This may be a silly question but…
Say you have a sentence like:
The quick brown fox
Or you might get a sentence like:
The quick brown fox jumped over the lazy dog
The simple regexp (w*) finds the first word «The» and puts it in a group.
For the first sentence, you could write (w*)s*(w*)s*(w*)s*(w*)s* to put each word in its own group, but that assumes you know the number of words in the sentence.
Is it possible to write a regular expression that puts each word in any arbitrary sentence into its own group? It would be nice if you could do something like (?:(w*)s*)* to have it group each instance of (w*), but that doesn’t work.
I am doing this in Python, and my use case is obviously a little more complex than «The quick brown fox», so it would be nifty if Regex could do this in one line, but if that’s not possible then I assume the next best solution is to loop over all the matches using re.findall() or something similar.
Thanks for any insight you may have.
Edit: For completeness’s sake here’s my actual use case and how I solved it using your help. Thanks again.
>>> s = '1 0 5 test1 5 test2 5 test3 5 test4 5 test5'
>>> s = re.match(r'^d+sd+s?(.*)', s).group(1)
>>> print s
5 test1 5 test2 5 test3 5 test4 5 test5
>>> list = re.findall(r'd+s(w+)', s)
>>> print list
['test1', 'test2', 'test3', 'test4', 'test5']
This may be a silly question but…
Say you have a sentence like:
The quick brown fox
Or you might get a sentence like:
The quick brown fox jumped over the lazy dog
The simple regexp (w*) finds the first word «The» and puts it in a group.
For the first sentence, you could write (w*)s*(w*)s*(w*)s*(w*)s* to put each word in its own group, but that assumes you know the number of words in the sentence.
Is it possible to write a regular expression that puts each word in any arbitrary sentence into its own group? It would be nice if you could do something like (?:(w*)s*)* to have it group each instance of (w*), but that doesn’t work.
I am doing this in Python, and my use case is obviously a little more complex than «The quick brown fox», so it would be nifty if Regex could do this in one line, but if that’s not possible then I assume the next best solution is to loop over all the matches using re.findall() or something similar.
Thanks for any insight you may have.
Edit: For completeness’s sake here’s my actual use case and how I solved it using your help. Thanks again.
>>> s = '1 0 5 test1 5 test2 5 test3 5 test4 5 test5'
>>> s = re.match(r'^d+sd+s?(.*)', s).group(1)
>>> print s
5 test1 5 test2 5 test3 5 test4 5 test5
>>> list = re.findall(r'd+s(w+)', s)
>>> print list
['test1', 'test2', 'test3', 'test4', 'test5']
Use of English B2 for all exams
UNIT 2 Section 2 (A)
Words easily confused
Use the correct form of the words in
the boxes to complete the sentences in each group A-G below. You may use some
of the words more than once. In some cases, more than one word may be
correct.
A
look |
see |
watch |
notice |
regard |
stare |
glance |
observe |
1 |
We spent weeks in Africa ________________________ |
2 |
Did you________________________ the tie he was |
3 |
The students________________________ the new teacher with curiosity. |
4 |
Always________________________ left and then right |
5 |
Don’t ________________________ at people like that! It’s really |
6 |
I ________________________ a great science fiction |
7 |
Before I bought the magazine, I ________________________ through it |
8 |
I couldn’t help________________________ the big red |
9 |
Bill________________________ at his watch and started running. He |
B
find out |
invent |
discover |
detect |
1 |
Many serious illnesses may |
2 |
“We must________________________ as much as we can |
3 |
Was it Captain Cook w ho________________________ Australia? |
4 |
The first camera, the Kodak 1, was________________________ |
C
explore |
investigate |
look for |
look up |
(do) research |
1 |
The police came to |
2 |
I still have________________________ to do for my |
3 |
I must________________________ this word in the dictionary, because |
4 |
Mum, I’m ________________________ my trainers. Have |
5 |
As soon as the five friends got to the cave, they decided to |
D
attempt |
effort |
trial |
experiment |
1 |
It takes a lot of |
2 |
John’s case came to ________________________ and in |
3 |
The athlete failed in his last________________________ to break the |
4 |
Many cosmetic companies claim they don’t carry |
5 |
I worked for the company for a(n)________________________ period of |
E
audience |
spectators |
viewers |
sightseers |
onlookers |
witnesses |
1 |
The ___________________ |
2 |
The two teenagers claimed they were just _______________________ |
3 |
Paris attracts thousands of ________________________ all year round. |
4 |
At the end of the play, the ________________________ |
5 |
The ________________________ were asked to give a detailed |
6 |
The concert was broadcast on TV and attracted one |
F
memorise |
remind |
recall |
recognise |
1 |
I didn’t ______________ her |
2 |
I had to ________________________ his phone number |
3 |
I’ll ring Dad to ________________________ him to buy coffee, |
4 |
My grandfather can still ________________________ scenes |
G
view |
sight |
image |
vision |
scene |
1 |
The sun affects my |
2 |
We have a superb________________________ of the sea |
3 |
The child started to cry at the ________________________ of the |
4 |
The television show was about the |
5 |
The police arrived at the ________________________ of the accident |
6 |
An actor’s ________________________ is important for |
7 |
I ran out of paint, so I couldn’t finish the sky for the |
8 |
When we were leaving the flower show, we were asked |
ANSWER KEY
A
1 |
observing |
2 |
see/notice |
3 |
regarded/ watched/ observed |
4 |
look |
5 |
stare |
6 |
saw/ watched |
7 |
glanced/ looked |
8 |
noticing |
9 |
glanced/ looked |
B
1 |
detected |
2 |
find out |
3 |
discovered |
4 |
invented |
C
1 |
investigate |
2 |
research |
3 |
look up |
4 |
looking for |
5 |
explore |
D
1 |
effort |
2 |
trial |
3 |
attempt |
4 |
experiments |
5 |
trial |
E
1 |
spectators |
2 |
onlookers |
3 |
sightseers |
4 |
audience |
5 |
witnesses |
6 |
viewers |
F
1 |
recognise |
2 |
memorise |
3 |
remind |
4 |
recall |
G
1 |
vision/ sight |
2 |
view |
3 |
sight |
4 |
sights |
5 |
scene |
6 |
image |
7 |
scene |
8 |
views |
поделиться знаниями или
запомнить страничку
- Все категории
-
экономические
43,633 -
гуманитарные
33,652 -
юридические
17,917 -
школьный раздел
611,709 -
разное
16,898
Популярное на сайте:
Как быстро выучить стихотворение наизусть? Запоминание стихов является стандартным заданием во многих школах.
Как научится читать по диагонали? Скорость чтения зависит от скорости восприятия каждого отдельного слова в тексте.
Как быстро и эффективно исправить почерк? Люди часто предполагают, что каллиграфия и почерк являются синонимами, но это не так.
Как научится говорить грамотно и правильно? Общение на хорошем, уверенном и естественном русском языке является достижимой целью.
- Размер: 211 Кб
- Количество слайдов: 39
WORD-GROUPS Lecture
Word-groups vs. phraseological units Words put together to form lexical units make phrases or word-groups. The largest two-facet lexical unit comprising more than one word is the word-group observed on the syntagmatic level of analysis. The degree of structural and semantic cohesion of word-groups may vary. Functionally and semantically inseparable word-groups like at least, point of view, by means of, take place are phraseological units. Semantically and structurally more independent word-groups a week ago, man of wisdom, take lessons, kind to people are defined as free or variable word-groups or phrases
Valency of words The two main linguistic factors to be considered in uniting words into word-groups are: 1) the lexical valency of words 2) the syntactic valency of words.
Lexical valency Words are used in certain lexical contexts, i. e. in combination with other words. The noun question is often combined with such adjectives as vital, pressing, urgent, disputable, delicate , etc. This noun is a component of a number of other word-groups, e. g. to raise a question, a question of great importance, a question of the agenda, a question of the day , and many others. Lexical valency is the possibility of lexical-semantic connections of a word with other words. Lexical collocability is the realisation in speech of the potential connections of a word with other words.
Lexical valency acquires special importance in case of polysemy as through the lexical valency different meanings of a polysemantic word can be distinguished, e. g. 1. heavy weight (safe, table , etc. ), 2. heavy snow (storm, rain , etc. ), 3. heavy drinker (eater , etc. ), 4. heavy sleep (disappointment, sorrow , etc. ), 5. heavy industry (tanks , etc. ), and so on. These word-groups are called collocations or such combinations of words which condition the realization of a certain meaning
The range of the lexical valency of words is linguistically restricted by the inner structure of the English word-stock. Though the verbs lift and raise are treated as synonyms, only raise is collocated with the noun question. The verb take may be interpreted as ‘grasp’, ’seize’, ‘catch’, etc. but only take is found in collocations with the nouns examination, measures, precautions , etc. , only catch in catch smb. napping and grasp in grasp the truth.
The restrictions of lexical valency of words may manifest themselves in the lexical meanings of the polysemantic members of word-groups. The adjective heavy , e. g. , is combined with the words food, meals, supper , etc. in the meaning ‘rich and difficult to digest’. But not all the words with the same component of meaning can be combined with this adjective *heavy cheese or *heavy sausage. The lexical valency of correlated words in different languages is different: pot flowers – комнатные цветы
Syntactic valency — the aptness of a word to appear in different syntactic structures. The minimal syntactic context in which words are used when brought together to form word-groups is described as the pattern of the word-groups. E. g. , the verb to offer can be followed by the infinitive ( to offer to do smth ) and the noun ( to offer a cup of tea ). The verb to suggest can be followed by the gerund ( to suggest doing smth ) and the noun ( to suggest an idea ). The syntactic valency of these verbs is different.
The adjectives clever and intelligent are seen to possess different syntactic valency as clever can be used in word-groups having the pattern: Adjective-Preposition at+Noun : clever at mathematics , whereas intelligent can never be found in exactly the same word-group pattern. The syntactic valency of correlated words in different languages is not identical, in English to influence a person, a decision, a choice ( verb +noun ) — in Russian влиять на человека, на решение, на выбор ( verb+ preposition+noun ).
T he individual meanings of a polysemantic word may be described through its syntactic valency: Keen + N: keen sight, hearing, etc. Keen + on + N : keen on sports, tennis, etc. Keen + V(inf): keen to know, to find out, etc. Thus word-groups may be regarded as minimal syntactic (or syntagmatic) structures that operate as distinguishing clues for different meanings of a polysemantic word.
INTERDEPENDENCE OF STRUCTURE AND MEANING IN WORD-GROUPS Syntactic structure and pattern of word-groups is the description of the order and arrangement of member-words in word-groups as parts of speech. The syntactic structure of the word-group an old woman, a blue dress, clever man, red flower is an adjective and a noun, i. e. A+N ; The syntactic structure of the word-groups wash a car, read books, take books, build houses – as a verb and a noun, i. e. V+N. The syntactic structure of the word-groups a touch of the sun, a matter of importance — as a preposition and a noun, i. e. N+prp+N.
Structural formulas: 1. V+N: ( to build houses ), 2. V+prp+N: ( to rely on somebody ), 3. V+N+prp+N: ( to hold something against somebody ), 4. V+N+V(inf. ): ( to make somebody work ), 5. V+ V(inf. ): ( to get to know ), and so on.
Syntactic structure of word-groups Word-groups may be described through the order and arrangement of the component members: To see sth – verbal-nominal group; To see to sth – (If you see to something that needs attention, you deal with it) verbal-prepositional-nominal, etc.
Word-groups may be classified according to their headwords into: 1. Nominal: red flower ; 2. Adjectival: kind to people ; 3. Verbal: to speak well , etc. The head is not necessarily the component that occurs first in the word-group: great bravery, bravery in the struggle the noun bravery is the head whether followed or preceded by other words.
Thus the structure of word-groups may also be described in relation to the head-word. In this case it is usual to speak of the pattern but not of formulas. E. g. , the patterns of the verbal groups to read a book, to wash a car are to read + N, to wash + N ; to rely on somebod y – to rely+on+N. Syntactic pattern implies the description of the structure of the word-group in which a given word is used as its head.
The interdependence of the pattern and meaning of head-words can be easily perceived by comparing word-groups of different patterns in which the same head-word is used. Three patterns with the verb ‘get’ as the head-word represent three different meanings of this verb: 1. get+ N ( get a letter, information, money , etc. ); 2. get+ to +N ( get to Moscow, to the Institute , etc. ); 3. get+N+V(inf. ) ( get somebody to come, to do the work, etc. ).
Notional member-words are habitually represented in conventional symbols whereas prepositions and other form-words are given in their usual graphic form. This is accounted for by the fact that individual form-words may modify or change the meaning of the word with which it is combined, as in, e. g. : 1. anxious+for+ N ( anxious for news ), anxious+about+ N ( anxious about his health ). the difference in the meaning of the head-word is conditioned by a difference in the pattern of the word-group in which this word is used
Syntactic patterns are classified into: 1. predicative word-groups have a syntactic structure similar to that of a sentence, they comprise the subject and the predicate, e. g. he went, John works. 2. non-predicative word-groups do not comprise the subject and the predicate and may be subdivided into a) subordinative (e. g. red flower, a man of wisdom ); b) coordinative (e. g. women and children, do or die ).
Classification of word-groups 1. ENDOCENTRIC WORD-GROUPS have one central member functionally equivalent to the whole word-group. In the word-group blue dress, friendly to people , the head-words are the noun dress and the adjective friendly correspondingly. According to their central members word-groups may be classified into: a) nominal groups or phrases ( blue dress ), b) adjectival groups ( friendly to people ), c) verbal groups ( to sing well ), etc.
2. EXOCENTRIC WORD-GROUPS have no central component and the distribution of the whole word-group is different from either of its members. For instance, the distribution of the word-groups side by side, at first, grow smaller is not identical with the distribution of their component-members, i. e. the component-members are not syntactically substitutable for the whole word-group.
TYPES OF MEANING OF WORD-GROUPS The lexical meaning – the combined lexical meaning of the component words, e. g. a blind man may be described denotationally as the combined meaning of the words blind and man. In most cases the lexical meanings of the word-group predominates over the lexical meanings of its components, e. g. blind alley, blind date. Polysemantic words are used in word-groups only in one of their meanings. These meanings of the component words in such word-groups are mutually interdependent and inseparable. Semantic inseparability of word-groups treats them as self-contained lexical units.
The structural meaning of the word-group is the meaning conveyed mainly by the pattern of arrangement of its components, e. g. , such word-groups as school grammar and grammar school are semantically different because of the difference in the pattern of arrangement of the component words. The structural meaning is the meaning expressed by the pattern of the word-group.
Interrelation of lexical and structural meaning in word-groups The lexical and structural components of meaning in word-groups are interdependent and inseparable. The structural pattern in all the day long, all the night long, all the week long in ordinary usage and the word-group all the sun long is identical. The generalised meaning of the pattern ‘a unit of time’. Replacing day, night, week by another noun the sun structural meaning of the pattern does not change. The group all the sun long functions semantically as a unit of time. But the noun sun included in the group continues to carry the semantic value or the lexical meaning that it has in word-groups of other structural patterns (cf. the sun rays, African sun , etc. ).
It follows that the meaning of the word-group is derived from the combined lexical meanings of its constituents and is inseparable from the meaning of the pattern of their arrangement. a factory hand − ‘a factory worker’ a hand bag − ‘a bag carried in the hand’. Though the word hand makes part of both its lexical meaning and the role it plays in the structure of word-groups is different which accounts for the difference in the lexical and structural meaning of the word-groups under discussion. Thus, the meaning of the word-group is derived from the combined lexical meanings of its constituents and is inseparable from the meaning of the pattern of their arrangement.
Polysemantic and monosemantic patterns Word-groups represented by different structural formulas are as a rule semantically different because of the difference in the grammatical component of meaning. Structurally identical patterns, e. g. heavy+ N , may be representative of different meanings of the adjective heavy which is perceived in the word-groups heavy rain (snow, storm), heavy smoker (drinker), heavy weight (table), etc. all of which have the same pattern — heavy +N.
Structurally simple patterns are as a rule polysemantic, i. e. representative of several meanings of a polysemantic head-word, whereas structurally complex patterns are monosemantic and condition just one meaning of the head-member. The simplest verbal structure V+N and the corresponding pattern are as a rule polysemantic (compare, e. g. take +N (take tea, coffee); take the bus, the tram, take measures, precautions, etc. ), whereas a more complex pattern, e. g. take+to+ N is monosemantic (e. g. take to sports ).
MOTIVATION IN WORD-GROUPS A word-group is lexically-motivated if the combined lexical meaning of the group is deducible from the meaning of its components, e. g. red flower, heavy weight, take lessons. If the combined lexical meaning of a word-group is not deducible from the lexical meanings of its constituent components, such a word-group is lexically non-motivated , e. g. red tape (official bureaucratic methods) take place (occur).
The degree of motivation can be different. Between the extremes of complete motivation and lack of motivation there are innumerable intermediate cases. E. g. , the degree of lexical motivation in the nominal group black market is higher than in black death , but lower than in black dress , though none of the groups can be considered completely non-motivated.
Completely motivated word-groups are correlated with certain structural types of compound words. Verbal groups having the structure V+N , e. g. to read books, to love music , etc. , are habitually correlated with the compounds of the pattern N+(V+er) (book-reader, music-lover); adjectival groups such as A+prp+N (e. g. rich in oil, shy before girls ) are correlated with the compounds of the pattern N+A , e. g. oil-rich, girl-shy.
Seemingly identical word-groups are sometimes found to be motivated or non-motivated depending on their semantic interrelation. Thus, apple sauce is lexically motivated when it means ‘a sauce made of apples’ but when used to denote ‘nonsense’ it is clearly non-motivated. Completely non-motivated or partially motivated word-groups are called phraseological units or idioms.
Summary and Conclusions 1. Words put together to form lexical units make up phrases or word-groups. The main factors active in bringing words together are lexical and syntactic valency of the components of word-groups.
2. Lexical valency is the aptness of a word to appear in various collocations. All the words of the language possess a certain norm of lexical valency. Restrictions of lexical valency are to be accounted for by the inner structure of the vocabulary of the English language.
3. Lexical valency of polysemantic words is observed in various collocations in which these words are used. Different meanings of a polysemantic word may be described through its lexical valency.
4. Syntactic valency is the aptness of a word to appear in various syntactic structures. All words possess a certain norm of syntactic valency. Restrictions of syntactic valency are to be accounted for by the grammatical structure of the language. The range of syntactic valency of each individual word is essentially delimited by the part of speech the word belongs to and also by the specific norm of syntactic valency peculiar to individual words of Modern English.
5. The syntactic valency of a polysemantic word may be observed in the different structures in which the word is used. Individual meanings of a polysemantic word may be described through its syntactic valency.
6. Structurally, word-groups may be classified by the criterion of distribution into endocentric and exocentric. Endocentric word-groups can be classified according to the head-word into nominal, adjectival, verbal and adverbial groups or phrases.
8. Semantically all word-groups may be classified into motivated and non-motivated. Non-motivated word-groups are usually described as phraseological units.
References 1. Зыкова И. В. Практический курс английской лексикологии. М. : Академия, 2006. – С. 121 -124. 2. Гинзбург Р. З. Лексикология английского языка. М. : Высшая школа, 1979. – С. 64 -74. 3. Антрушина Г. Б. , Афанасьева О. В. , Морозова Н. Н. Лексикология английского языка. М. : Дрофа, 2006. – С. 225 — 256.
WORD-GROUPS