Using regular expression to divide two words and capture the first word

following part-of-speech tagged sentence: All/DT animals/NNS are/VBP equal/JJ ,/,
but/CC some/DT animals/NNS are/VBP more/RBR equal/JJ than/IN others/NNS ./.

How to write a regular expression that matches only the words of each word/pos-tag in the sentence.

text="""All/DT animals/NNS are/VBP equal/JJ ,/, but/CC some/DT animals/NNS 

are/VBP more/RBR equal/JJ than/IN others/NNS ./."""

tokens=nltk.word_tokenize(text)

pattern="([A-Za-z]+)|[A-Za-z]"

print("Upper case words:")

for tok in tokens:

   if re.search(pattern, tok) is not None:

      print("'{}'".format(tok))

asked Jan 20 at 7:03

Shideh Hm

193

What do you want your regex to match? capital letter text preceded by /? like DT NNS VBP etc?

– Pushpesh Kumar Rajwanshi
Jan 20 at 7:11

2

post the expected result

– RomanPerekhrest
Jan 20 at 7:15

You show some code. What is the problem with it?

– Mad Physicist
Jan 20 at 7:20

Is there any result you are expecting? Please post it.

– Austin
Jan 20 at 7:22

I want my regex to match only words (All, animals , are , equal , etc)

– Shideh Hm
Jan 20 at 7:30

add a comment |

following part-of-speech tagged sentence: All/DT animals/NNS are/VBP equal/JJ ,/,
but/CC some/DT animals/NNS are/VBP more/RBR equal/JJ than/IN others/NNS ./.

How to write a regular expression that matches only the words of each word/pos-tag in the sentence.

text="""All/DT animals/NNS are/VBP equal/JJ ,/, but/CC some/DT animals/NNS 

are/VBP more/RBR equal/JJ than/IN others/NNS ./."""

tokens=nltk.word_tokenize(text)

pattern="([A-Za-z]+)|[A-Za-z]"

print("Upper case words:")

for tok in tokens:

   if re.search(pattern, tok) is not None:

      print("'{}'".format(tok))

asked Jan 20 at 7:03

Shideh Hm

193

What do you want your regex to match? capital letter text preceded by /? like DT NNS VBP etc?

– Pushpesh Kumar Rajwanshi
Jan 20 at 7:11

2

post the expected result

– RomanPerekhrest
Jan 20 at 7:15

You show some code. What is the problem with it?

– Mad Physicist
Jan 20 at 7:20

Is there any result you are expecting? Please post it.

– Austin
Jan 20 at 7:22

I want my regex to match only words (All, animals , are , equal , etc)

– Shideh Hm
Jan 20 at 7:30

add a comment |

following part-of-speech tagged sentence: All/DT animals/NNS are/VBP equal/JJ ,/,
but/CC some/DT animals/NNS are/VBP more/RBR equal/JJ than/IN others/NNS ./.

How to write a regular expression that matches only the words of each word/pos-tag in the sentence.

text="""All/DT animals/NNS are/VBP equal/JJ ,/, but/CC some/DT animals/NNS 

are/VBP more/RBR equal/JJ than/IN others/NNS ./."""

tokens=nltk.word_tokenize(text)

pattern="([A-Za-z]+)|[A-Za-z]"

print("Upper case words:")

for tok in tokens:

   if re.search(pattern, tok) is not None:

      print("'{}'".format(tok))

asked Jan 20 at 7:03

Shideh Hm

193

following part-of-speech tagged sentence: All/DT animals/NNS are/VBP equal/JJ ,/,
but/CC some/DT animals/NNS are/VBP more/RBR equal/JJ than/IN others/NNS ./.

How to write a regular expression that matches only the words of each word/pos-tag in the sentence.

text="""All/DT animals/NNS are/VBP equal/JJ ,/, but/CC some/DT animals/NNS 

are/VBP more/RBR equal/JJ than/IN others/NNS ./."""

tokens=nltk.word_tokenize(text)

pattern="([A-Za-z]+)|[A-Za-z]"

print("Upper case words:")

for tok in tokens:

   if re.search(pattern, tok) is not None:

      print("'{}'".format(tok))

python regex

asked Jan 20 at 7:03

Shideh Hm

193

asked Jan 20 at 7:03

Shideh Hm

193

asked Jan 20 at 7:03

Shideh Hm

193

asked Jan 20 at 7:03

Shideh Hm

193

asked Jan 20 at 7:03

Shideh Hm

193

What do you want your regex to match? capital letter text preceded by /? like DT NNS VBP etc?

– Pushpesh Kumar Rajwanshi
Jan 20 at 7:11

2

post the expected result

– RomanPerekhrest
Jan 20 at 7:15

You show some code. What is the problem with it?

– Mad Physicist
Jan 20 at 7:20

Is there any result you are expecting? Please post it.

– Austin
Jan 20 at 7:22

I want my regex to match only words (All, animals , are , equal , etc)

– Shideh Hm
Jan 20 at 7:30

add a comment |

What do you want your regex to match? capital letter text preceded by /? like DT NNS VBP etc?

– Pushpesh Kumar Rajwanshi
Jan 20 at 7:11

2

post the expected result

– RomanPerekhrest
Jan 20 at 7:15

You show some code. What is the problem with it?

– Mad Physicist
Jan 20 at 7:20

Is there any result you are expecting? Please post it.

– Austin
Jan 20 at 7:22

I want my regex to match only words (All, animals , are , equal , etc)

– Shideh Hm
Jan 20 at 7:30

What do you want your regex to match? capital letter text preceded by /? like DT NNS VBP etc?

– Pushpesh Kumar Rajwanshi
Jan 20 at 7:11

post the expected result

– RomanPerekhrest
Jan 20 at 7:15

You show some code. What is the problem with it?

– Mad Physicist
Jan 20 at 7:20

Is there any result you are expecting? Please post it.

– Austin
Jan 20 at 7:22

I want my regex to match only words (All, animals , are , equal , etc)

– Shideh Hm
Jan 20 at 7:30

add a comment |

2 Answers
2

active

oldest

votes

Using re.findall

import re

print (re.findall(r'([a-zA-Z]+)/[a-zA-Z]+',text))

#['All', 'animals', 'are', 'equal', 'but', 'some', 'animals', 'are', 'more', 'equal', 'than', 'others']

answered Jan 20 at 7:33

Transhuman

2,7761411

Thanks. but how to match the "." at the end of the sentence?

– Shideh Hm
Jan 20 at 8:59

add a comment |

You can use the following regex:

(S+)/S+s?

Explaination:

(S+) is a capturing group which matches any non-whitespace character
/ matches the character /
S+ matches non-whitespace character, but not captured this time
s? optional space at the end

Here's a link to test the regex and get explanation

As @Transhuman suggested use re.findall to get all the matches:

import re

print (re.findall(r'(S+)/S+s?',text))

You can test the python code here:

edited Jan 20 at 10:33

answered Jan 20 at 10:17

adiga

8,50362241

add a comment |

Your Answer

StackExchange.ifUsing("editor", function () {
StackExchange.using("externalEditor", function () {
StackExchange.using("snippets", function () {
StackExchange.snippets.init();
});
});
}, "code-snippets");

StackExchange.ready(function() {
var channelOptions = {
tags: "".split(" "),
id: "1"
};
initTagRenderer("".split(" "), "".split(" "), channelOptions);

StackExchange.using("externalEditor", function() {
// Have to fire editor after snippets, if snippets enabled
if (StackExchange.settings.snippets.snippetsEnabled) {
StackExchange.using("snippets", function() {
createEditor();
});
}
else {
createEditor();
}
});

function createEditor() {
StackExchange.prepareEditor({
heartbeatType: 'answer',
autoActivateHeartbeat: false,
convertImagesToLinks: true,
noModals: true,
showLowRepImageUploadWarning: true,
reputationToPostImages: 10,
bindNavPrevention: true,
postfix: "",
imageUploader: {
brandingHtml: "Powered by u003ca class="icon-imgur-white" href="https://imgur.com/"u003eu003c/au003e",
contentPolicyHtml: "User contributions licensed under u003ca href="https://creativecommons.org/licenses/by-sa/3.0/"u003ecc by-sa 3.0 with attribution requiredu003c/au003e u003ca href="https://stackoverflow.com/legal/content-policy"u003e(content policy)u003c/au003e",
allowUrls: true
},
onDemand: true,
discardSelector: ".discard-answer"
,immediatelyShowMarkdownHelp:true
});

}
});

draft saved

draft discarded

Sign up or log in

StackExchange.ready(function () {
StackExchange.helpers.onClickDraftSave('#login-link');
});

Post as a guest

Name

Required, but never shown

StackExchange.ready(
function () {
StackExchange.openid.initPostLogin('.new-post-login', 'https%3a%2f%2fstackoverflow.com%2fquestions%2f54274308%2fusing-regular-expression-to-divide-two-words-and-capture-the-first-word%23new-answer', 'question_page');
}
);

Post as a guest

Name

Required, but never shown

2 Answers
2

active

oldest

votes

2 Answers
2

active

oldest

votes

Using re.findall

import re

print (re.findall(r'([a-zA-Z]+)/[a-zA-Z]+',text))

#['All', 'animals', 'are', 'equal', 'but', 'some', 'animals', 'are', 'more', 'equal', 'than', 'others']

answered Jan 20 at 7:33

Transhuman

2,7761411

Thanks. but how to match the "." at the end of the sentence?

– Shideh Hm
Jan 20 at 8:59

add a comment |

Using re.findall

import re

print (re.findall(r'([a-zA-Z]+)/[a-zA-Z]+',text))

#['All', 'animals', 'are', 'equal', 'but', 'some', 'animals', 'are', 'more', 'equal', 'than', 'others']

answered Jan 20 at 7:33

Transhuman

2,7761411

Thanks. but how to match the "." at the end of the sentence?

– Shideh Hm
Jan 20 at 8:59

add a comment |

Using re.findall

import re

print (re.findall(r'([a-zA-Z]+)/[a-zA-Z]+',text))

#['All', 'animals', 'are', 'equal', 'but', 'some', 'animals', 'are', 'more', 'equal', 'than', 'others']

answered Jan 20 at 7:33

Transhuman

2,7761411

Using re.findall

import re

print (re.findall(r'([a-zA-Z]+)/[a-zA-Z]+',text))

#['All', 'animals', 'are', 'equal', 'but', 'some', 'animals', 'are', 'more', 'equal', 'than', 'others']

answered Jan 20 at 7:33

Transhuman

2,7761411

answered Jan 20 at 7:33

Transhuman

2,7761411

answered Jan 20 at 7:33

Transhuman

2,7761411

answered Jan 20 at 7:33

Transhuman

2,7761411

Thanks. but how to match the "." at the end of the sentence?

– Shideh Hm
Jan 20 at 8:59

add a comment |

Thanks. but how to match the "." at the end of the sentence?

– Shideh Hm
Jan 20 at 8:59

Thanks. but how to match the "." at the end of the sentence?

– Shideh Hm
Jan 20 at 8:59

add a comment |

You can use the following regex:

(S+)/S+s?

Explaination:

(S+) is a capturing group which matches any non-whitespace character
/ matches the character /
S+ matches non-whitespace character, but not captured this time
s? optional space at the end

Here's a link to test the regex and get explanation

As @Transhuman suggested use re.findall to get all the matches:

import re

print (re.findall(r'(S+)/S+s?',text))

You can test the python code here:

edited Jan 20 at 10:33

answered Jan 20 at 10:17

adiga

8,50362241

add a comment |

You can use the following regex:

(S+)/S+s?

Explaination:

(S+) is a capturing group which matches any non-whitespace character
/ matches the character /
S+ matches non-whitespace character, but not captured this time
s? optional space at the end

Here's a link to test the regex and get explanation

As @Transhuman suggested use re.findall to get all the matches:

import re

print (re.findall(r'(S+)/S+s?',text))

You can test the python code here:

edited Jan 20 at 10:33

answered Jan 20 at 10:17

adiga

8,50362241

add a comment |

You can use the following regex:

(S+)/S+s?

Explaination:

(S+) is a capturing group which matches any non-whitespace character
/ matches the character /
S+ matches non-whitespace character, but not captured this time
s? optional space at the end

Here's a link to test the regex and get explanation

As @Transhuman suggested use re.findall to get all the matches:

import re

print (re.findall(r'(S+)/S+s?',text))

You can test the python code here:

edited Jan 20 at 10:33

answered Jan 20 at 10:17

adiga

8,50362241

You can use the following regex:

(S+)/S+s?

Explaination:

(S+) is a capturing group which matches any non-whitespace character
/ matches the character /
S+ matches non-whitespace character, but not captured this time
s? optional space at the end

Here's a link to test the regex and get explanation

As @Transhuman suggested use re.findall to get all the matches:

import re

print (re.findall(r'(S+)/S+s?',text))

You can test the python code here:

edited Jan 20 at 10:33

answered Jan 20 at 10:17

adiga

8,50362241

edited Jan 20 at 10:33

answered Jan 20 at 10:17

adiga

8,50362241

answered Jan 20 at 10:17

adiga

8,50362241

answered Jan 20 at 10:17

adiga

8,50362241

add a comment |

draft saved

draft discarded

Thanks for contributing an answer to Stack Overflow!

Please be sure to answer the question. Provide details and share your research!

But avoid …

Asking for help, clarification, or responding to other answers.

Making statements based on opinion; back them up with references or personal experience.

To learn more, see our tips on writing great answers.

draft saved

draft discarded

Sign up or log in

StackExchange.ready(function () {
StackExchange.helpers.onClickDraftSave('#login-link');
});

Post as a guest

Name

Required, but never shown

Post as a guest

Name

Required, but never shown

Sign up or log in

StackExchange.ready(function () {
StackExchange.helpers.onClickDraftSave('#login-link');
});

Post as a guest

Name

Required, but never shown

Sign up or log in

StackExchange.ready(function () {
StackExchange.helpers.onClickDraftSave('#login-link');
});

Post as a guest

Name

Required, but never shown

Sign up or log in

StackExchange.ready(function () {
StackExchange.helpers.onClickDraftSave('#login-link');
});

Post as a guest

Name

Required, but never shown

Name

Required, but never shown

Name

Required, but never shown

This page is only for reference, If you need detailed information, please check here

搜尋此網誌

Brtdku