Count number of repeated multi-character strings in file in bash
I am trying to search for consecutive, repeated instances of numbers in a file, with each number on a different line. It's easiest to show what I mean in an example.
For example, my input would be:
16
16
8
8
16
16
4
8
8
8
And my output would be:
16 2
8 2
16 2
4 1
8 3
This may appear to be a duplicate, but though I have found many solutions (like this one) for finding the repetitions of a single character, none of them are able to search for multiple repeated characters.
text-processing
add a comment |
I am trying to search for consecutive, repeated instances of numbers in a file, with each number on a different line. It's easiest to show what I mean in an example.
For example, my input would be:
16
16
8
8
16
16
4
8
8
8
And my output would be:
16 2
8 2
16 2
4 1
8 3
This may appear to be a duplicate, but though I have found many solutions (like this one) for finding the repetitions of a single character, none of them are able to search for multiple repeated characters.
text-processing
Do you mean repeated characters (e.g.111
), repeated strings (e.g.123123
) or repeated lines? Will each line of your input only consist of a string of characters (including numbers) and no spaces? Do you just want repeated lines?
– terdon♦
Jan 19 at 18:16
@terdon - it was a badly worded title - I meant to put repeated lines. My question sort of changed as I was writing the post. I'll edit the title.
– ToasterFrogs
Jan 19 at 18:25
add a comment |
I am trying to search for consecutive, repeated instances of numbers in a file, with each number on a different line. It's easiest to show what I mean in an example.
For example, my input would be:
16
16
8
8
16
16
4
8
8
8
And my output would be:
16 2
8 2
16 2
4 1
8 3
This may appear to be a duplicate, but though I have found many solutions (like this one) for finding the repetitions of a single character, none of them are able to search for multiple repeated characters.
text-processing
I am trying to search for consecutive, repeated instances of numbers in a file, with each number on a different line. It's easiest to show what I mean in an example.
For example, my input would be:
16
16
8
8
16
16
4
8
8
8
And my output would be:
16 2
8 2
16 2
4 1
8 3
This may appear to be a duplicate, but though I have found many solutions (like this one) for finding the repetitions of a single character, none of them are able to search for multiple repeated characters.
text-processing
text-processing
edited Jan 19 at 18:20
Rui F Ribeiro
39.9k1479134
39.9k1479134
asked Jan 19 at 18:06
ToasterFrogsToasterFrogs
443
443
Do you mean repeated characters (e.g.111
), repeated strings (e.g.123123
) or repeated lines? Will each line of your input only consist of a string of characters (including numbers) and no spaces? Do you just want repeated lines?
– terdon♦
Jan 19 at 18:16
@terdon - it was a badly worded title - I meant to put repeated lines. My question sort of changed as I was writing the post. I'll edit the title.
– ToasterFrogs
Jan 19 at 18:25
add a comment |
Do you mean repeated characters (e.g.111
), repeated strings (e.g.123123
) or repeated lines? Will each line of your input only consist of a string of characters (including numbers) and no spaces? Do you just want repeated lines?
– terdon♦
Jan 19 at 18:16
@terdon - it was a badly worded title - I meant to put repeated lines. My question sort of changed as I was writing the post. I'll edit the title.
– ToasterFrogs
Jan 19 at 18:25
Do you mean repeated characters (e.g.
111
), repeated strings (e.g. 123123
) or repeated lines? Will each line of your input only consist of a string of characters (including numbers) and no spaces? Do you just want repeated lines?– terdon♦
Jan 19 at 18:16
Do you mean repeated characters (e.g.
111
), repeated strings (e.g. 123123
) or repeated lines? Will each line of your input only consist of a string of characters (including numbers) and no spaces? Do you just want repeated lines?– terdon♦
Jan 19 at 18:16
@terdon - it was a badly worded title - I meant to put repeated lines. My question sort of changed as I was writing the post. I'll edit the title.
– ToasterFrogs
Jan 19 at 18:25
@terdon - it was a badly worded title - I meant to put repeated lines. My question sort of changed as I was writing the post. I'll edit the title.
– ToasterFrogs
Jan 19 at 18:25
add a comment |
1 Answer
1
active
oldest
votes
You can use uniq
to do this, though the standard format of its output is slightly different.
$ < in uniq -c
2 16
2 8
2 16
1 4
3 8
Assuming in
is a file that contains the input, you'll get what you see above.
which could be piped throughawk '{print $2,$1}'
for a quick'n'dirty way to get output in the order shown by the OP
– steeldriver
Jan 19 at 18:25
add a comment |
Your Answer
StackExchange.ready(function() {
var channelOptions = {
tags: "".split(" "),
id: "106"
};
initTagRenderer("".split(" "), "".split(" "), channelOptions);
StackExchange.using("externalEditor", function() {
// Have to fire editor after snippets, if snippets enabled
if (StackExchange.settings.snippets.snippetsEnabled) {
StackExchange.using("snippets", function() {
createEditor();
});
}
else {
createEditor();
}
});
function createEditor() {
StackExchange.prepareEditor({
heartbeatType: 'answer',
autoActivateHeartbeat: false,
convertImagesToLinks: false,
noModals: true,
showLowRepImageUploadWarning: true,
reputationToPostImages: null,
bindNavPrevention: true,
postfix: "",
imageUploader: {
brandingHtml: "Powered by u003ca class="icon-imgur-white" href="https://imgur.com/"u003eu003c/au003e",
contentPolicyHtml: "User contributions licensed under u003ca href="https://creativecommons.org/licenses/by-sa/3.0/"u003ecc by-sa 3.0 with attribution requiredu003c/au003e u003ca href="https://stackoverflow.com/legal/content-policy"u003e(content policy)u003c/au003e",
allowUrls: true
},
onDemand: true,
discardSelector: ".discard-answer"
,immediatelyShowMarkdownHelp:true
});
}
});
Sign up or log in
StackExchange.ready(function () {
StackExchange.helpers.onClickDraftSave('#login-link');
});
Sign up using Google
Sign up using Facebook
Sign up using Email and Password
Post as a guest
Required, but never shown
StackExchange.ready(
function () {
StackExchange.openid.initPostLogin('.new-post-login', 'https%3a%2f%2funix.stackexchange.com%2fquestions%2f495491%2fcount-number-of-repeated-multi-character-strings-in-file-in-bash%23new-answer', 'question_page');
}
);
Post as a guest
Required, but never shown
1 Answer
1
active
oldest
votes
1 Answer
1
active
oldest
votes
active
oldest
votes
active
oldest
votes
You can use uniq
to do this, though the standard format of its output is slightly different.
$ < in uniq -c
2 16
2 8
2 16
1 4
3 8
Assuming in
is a file that contains the input, you'll get what you see above.
which could be piped throughawk '{print $2,$1}'
for a quick'n'dirty way to get output in the order shown by the OP
– steeldriver
Jan 19 at 18:25
add a comment |
You can use uniq
to do this, though the standard format of its output is slightly different.
$ < in uniq -c
2 16
2 8
2 16
1 4
3 8
Assuming in
is a file that contains the input, you'll get what you see above.
which could be piped throughawk '{print $2,$1}'
for a quick'n'dirty way to get output in the order shown by the OP
– steeldriver
Jan 19 at 18:25
add a comment |
You can use uniq
to do this, though the standard format of its output is slightly different.
$ < in uniq -c
2 16
2 8
2 16
1 4
3 8
Assuming in
is a file that contains the input, you'll get what you see above.
You can use uniq
to do this, though the standard format of its output is slightly different.
$ < in uniq -c
2 16
2 8
2 16
1 4
3 8
Assuming in
is a file that contains the input, you'll get what you see above.
answered Jan 19 at 18:21
TomaszTomasz
9,51652965
9,51652965
which could be piped throughawk '{print $2,$1}'
for a quick'n'dirty way to get output in the order shown by the OP
– steeldriver
Jan 19 at 18:25
add a comment |
which could be piped throughawk '{print $2,$1}'
for a quick'n'dirty way to get output in the order shown by the OP
– steeldriver
Jan 19 at 18:25
which could be piped through
awk '{print $2,$1}'
for a quick'n'dirty way to get output in the order shown by the OP– steeldriver
Jan 19 at 18:25
which could be piped through
awk '{print $2,$1}'
for a quick'n'dirty way to get output in the order shown by the OP– steeldriver
Jan 19 at 18:25
add a comment |
Thanks for contributing an answer to Unix & Linux Stack Exchange!
- Please be sure to answer the question. Provide details and share your research!
But avoid …
- Asking for help, clarification, or responding to other answers.
- Making statements based on opinion; back them up with references or personal experience.
To learn more, see our tips on writing great answers.
Sign up or log in
StackExchange.ready(function () {
StackExchange.helpers.onClickDraftSave('#login-link');
});
Sign up using Google
Sign up using Facebook
Sign up using Email and Password
Post as a guest
Required, but never shown
StackExchange.ready(
function () {
StackExchange.openid.initPostLogin('.new-post-login', 'https%3a%2f%2funix.stackexchange.com%2fquestions%2f495491%2fcount-number-of-repeated-multi-character-strings-in-file-in-bash%23new-answer', 'question_page');
}
);
Post as a guest
Required, but never shown
Sign up or log in
StackExchange.ready(function () {
StackExchange.helpers.onClickDraftSave('#login-link');
});
Sign up using Google
Sign up using Facebook
Sign up using Email and Password
Post as a guest
Required, but never shown
Sign up or log in
StackExchange.ready(function () {
StackExchange.helpers.onClickDraftSave('#login-link');
});
Sign up using Google
Sign up using Facebook
Sign up using Email and Password
Post as a guest
Required, but never shown
Sign up or log in
StackExchange.ready(function () {
StackExchange.helpers.onClickDraftSave('#login-link');
});
Sign up using Google
Sign up using Facebook
Sign up using Email and Password
Sign up using Google
Sign up using Facebook
Sign up using Email and Password
Post as a guest
Required, but never shown
Required, but never shown
Required, but never shown
Required, but never shown
Required, but never shown
Required, but never shown
Required, but never shown
Required, but never shown
Required, but never shown
Do you mean repeated characters (e.g.
111
), repeated strings (e.g.123123
) or repeated lines? Will each line of your input only consist of a string of characters (including numbers) and no spaces? Do you just want repeated lines?– terdon♦
Jan 19 at 18:16
@terdon - it was a badly worded title - I meant to put repeated lines. My question sort of changed as I was writing the post. I'll edit the title.
– ToasterFrogs
Jan 19 at 18:25