Count number of repeated multi-character strings in file in bash












2















I am trying to search for consecutive, repeated instances of numbers in a file, with each number on a different line. It's easiest to show what I mean in an example.



For example, my input would be:



16
16
8
8
16
16
4
8
8
8


And my output would be:



16 2
8 2
16 2
4 1
8 3


This may appear to be a duplicate, but though I have found many solutions (like this one) for finding the repetitions of a single character, none of them are able to search for multiple repeated characters.










share|improve this question

























  • Do you mean repeated characters (e.g. 111), repeated strings (e.g. 123123) or repeated lines? Will each line of your input only consist of a string of characters (including numbers) and no spaces? Do you just want repeated lines?

    – terdon
    Jan 19 at 18:16











  • @terdon - it was a badly worded title - I meant to put repeated lines. My question sort of changed as I was writing the post. I'll edit the title.

    – ToasterFrogs
    Jan 19 at 18:25


















2















I am trying to search for consecutive, repeated instances of numbers in a file, with each number on a different line. It's easiest to show what I mean in an example.



For example, my input would be:



16
16
8
8
16
16
4
8
8
8


And my output would be:



16 2
8 2
16 2
4 1
8 3


This may appear to be a duplicate, but though I have found many solutions (like this one) for finding the repetitions of a single character, none of them are able to search for multiple repeated characters.










share|improve this question

























  • Do you mean repeated characters (e.g. 111), repeated strings (e.g. 123123) or repeated lines? Will each line of your input only consist of a string of characters (including numbers) and no spaces? Do you just want repeated lines?

    – terdon
    Jan 19 at 18:16











  • @terdon - it was a badly worded title - I meant to put repeated lines. My question sort of changed as I was writing the post. I'll edit the title.

    – ToasterFrogs
    Jan 19 at 18:25
















2












2








2


1






I am trying to search for consecutive, repeated instances of numbers in a file, with each number on a different line. It's easiest to show what I mean in an example.



For example, my input would be:



16
16
8
8
16
16
4
8
8
8


And my output would be:



16 2
8 2
16 2
4 1
8 3


This may appear to be a duplicate, but though I have found many solutions (like this one) for finding the repetitions of a single character, none of them are able to search for multiple repeated characters.










share|improve this question
















I am trying to search for consecutive, repeated instances of numbers in a file, with each number on a different line. It's easiest to show what I mean in an example.



For example, my input would be:



16
16
8
8
16
16
4
8
8
8


And my output would be:



16 2
8 2
16 2
4 1
8 3


This may appear to be a duplicate, but though I have found many solutions (like this one) for finding the repetitions of a single character, none of them are able to search for multiple repeated characters.







text-processing






share|improve this question















share|improve this question













share|improve this question




share|improve this question








edited Jan 19 at 18:20









Rui F Ribeiro

39.9k1479134




39.9k1479134










asked Jan 19 at 18:06









ToasterFrogsToasterFrogs

443




443













  • Do you mean repeated characters (e.g. 111), repeated strings (e.g. 123123) or repeated lines? Will each line of your input only consist of a string of characters (including numbers) and no spaces? Do you just want repeated lines?

    – terdon
    Jan 19 at 18:16











  • @terdon - it was a badly worded title - I meant to put repeated lines. My question sort of changed as I was writing the post. I'll edit the title.

    – ToasterFrogs
    Jan 19 at 18:25





















  • Do you mean repeated characters (e.g. 111), repeated strings (e.g. 123123) or repeated lines? Will each line of your input only consist of a string of characters (including numbers) and no spaces? Do you just want repeated lines?

    – terdon
    Jan 19 at 18:16











  • @terdon - it was a badly worded title - I meant to put repeated lines. My question sort of changed as I was writing the post. I'll edit the title.

    – ToasterFrogs
    Jan 19 at 18:25



















Do you mean repeated characters (e.g. 111), repeated strings (e.g. 123123) or repeated lines? Will each line of your input only consist of a string of characters (including numbers) and no spaces? Do you just want repeated lines?

– terdon
Jan 19 at 18:16





Do you mean repeated characters (e.g. 111), repeated strings (e.g. 123123) or repeated lines? Will each line of your input only consist of a string of characters (including numbers) and no spaces? Do you just want repeated lines?

– terdon
Jan 19 at 18:16













@terdon - it was a badly worded title - I meant to put repeated lines. My question sort of changed as I was writing the post. I'll edit the title.

– ToasterFrogs
Jan 19 at 18:25







@terdon - it was a badly worded title - I meant to put repeated lines. My question sort of changed as I was writing the post. I'll edit the title.

– ToasterFrogs
Jan 19 at 18:25












1 Answer
1






active

oldest

votes


















6














You can use uniq to do this, though the standard format of its output is slightly different.



$ < in uniq -c
2 16
2 8
2 16
1 4
3 8


Assuming in is a file that contains the input, you'll get what you see above.






share|improve this answer
























  • which could be piped through awk '{print $2,$1}' for a quick'n'dirty way to get output in the order shown by the OP

    – steeldriver
    Jan 19 at 18:25











Your Answer








StackExchange.ready(function() {
var channelOptions = {
tags: "".split(" "),
id: "106"
};
initTagRenderer("".split(" "), "".split(" "), channelOptions);

StackExchange.using("externalEditor", function() {
// Have to fire editor after snippets, if snippets enabled
if (StackExchange.settings.snippets.snippetsEnabled) {
StackExchange.using("snippets", function() {
createEditor();
});
}
else {
createEditor();
}
});

function createEditor() {
StackExchange.prepareEditor({
heartbeatType: 'answer',
autoActivateHeartbeat: false,
convertImagesToLinks: false,
noModals: true,
showLowRepImageUploadWarning: true,
reputationToPostImages: null,
bindNavPrevention: true,
postfix: "",
imageUploader: {
brandingHtml: "Powered by u003ca class="icon-imgur-white" href="https://imgur.com/"u003eu003c/au003e",
contentPolicyHtml: "User contributions licensed under u003ca href="https://creativecommons.org/licenses/by-sa/3.0/"u003ecc by-sa 3.0 with attribution requiredu003c/au003e u003ca href="https://stackoverflow.com/legal/content-policy"u003e(content policy)u003c/au003e",
allowUrls: true
},
onDemand: true,
discardSelector: ".discard-answer"
,immediatelyShowMarkdownHelp:true
});


}
});














draft saved

draft discarded


















StackExchange.ready(
function () {
StackExchange.openid.initPostLogin('.new-post-login', 'https%3a%2f%2funix.stackexchange.com%2fquestions%2f495491%2fcount-number-of-repeated-multi-character-strings-in-file-in-bash%23new-answer', 'question_page');
}
);

Post as a guest















Required, but never shown

























1 Answer
1






active

oldest

votes








1 Answer
1






active

oldest

votes









active

oldest

votes






active

oldest

votes









6














You can use uniq to do this, though the standard format of its output is slightly different.



$ < in uniq -c
2 16
2 8
2 16
1 4
3 8


Assuming in is a file that contains the input, you'll get what you see above.






share|improve this answer
























  • which could be piped through awk '{print $2,$1}' for a quick'n'dirty way to get output in the order shown by the OP

    – steeldriver
    Jan 19 at 18:25
















6














You can use uniq to do this, though the standard format of its output is slightly different.



$ < in uniq -c
2 16
2 8
2 16
1 4
3 8


Assuming in is a file that contains the input, you'll get what you see above.






share|improve this answer
























  • which could be piped through awk '{print $2,$1}' for a quick'n'dirty way to get output in the order shown by the OP

    – steeldriver
    Jan 19 at 18:25














6












6








6







You can use uniq to do this, though the standard format of its output is slightly different.



$ < in uniq -c
2 16
2 8
2 16
1 4
3 8


Assuming in is a file that contains the input, you'll get what you see above.






share|improve this answer













You can use uniq to do this, though the standard format of its output is slightly different.



$ < in uniq -c
2 16
2 8
2 16
1 4
3 8


Assuming in is a file that contains the input, you'll get what you see above.







share|improve this answer












share|improve this answer



share|improve this answer










answered Jan 19 at 18:21









TomaszTomasz

9,51652965




9,51652965













  • which could be piped through awk '{print $2,$1}' for a quick'n'dirty way to get output in the order shown by the OP

    – steeldriver
    Jan 19 at 18:25



















  • which could be piped through awk '{print $2,$1}' for a quick'n'dirty way to get output in the order shown by the OP

    – steeldriver
    Jan 19 at 18:25

















which could be piped through awk '{print $2,$1}' for a quick'n'dirty way to get output in the order shown by the OP

– steeldriver
Jan 19 at 18:25





which could be piped through awk '{print $2,$1}' for a quick'n'dirty way to get output in the order shown by the OP

– steeldriver
Jan 19 at 18:25


















draft saved

draft discarded




















































Thanks for contributing an answer to Unix & Linux Stack Exchange!


  • Please be sure to answer the question. Provide details and share your research!

But avoid



  • Asking for help, clarification, or responding to other answers.

  • Making statements based on opinion; back them up with references or personal experience.


To learn more, see our tips on writing great answers.




draft saved


draft discarded














StackExchange.ready(
function () {
StackExchange.openid.initPostLogin('.new-post-login', 'https%3a%2f%2funix.stackexchange.com%2fquestions%2f495491%2fcount-number-of-repeated-multi-character-strings-in-file-in-bash%23new-answer', 'question_page');
}
);

Post as a guest















Required, but never shown





















































Required, but never shown














Required, but never shown












Required, but never shown







Required, but never shown

































Required, but never shown














Required, but never shown












Required, but never shown







Required, but never shown







Popular posts from this blog

Liquibase includeAll doesn't find base path

How to use setInterval in EJS file?

Petrus Granier-Deferre