Interaction of semantic and syntactic ambiguity in the light of discrimination learning
Conference object (Published version)
Metadata
Show full item recordAbstract
In this research we wanted to investigate whether the effects observed during simultaneous
processing of semantic and syntactic lexical ambiguity could be interpreted within the framework
of error-driven learning. To investigate the interaction of the two types of ambiguity we focused
on polysemy in a highly inflected Serbian noun system and attempted to simulate their processing
effects using naive discriminative learning (Baayen et al., 2011).
Polysemy is the type of lexical ambiguity where one word can have multiple related
senses. For example, isolated word paper could refer to the writing paper, i.e., paper as the
material, but also to scientific paper, and even a daily paper. Polysemous words are a complex
phenomenon, whose processing is affected by number of senses, probability distribution of those
senses, and degree of relatedness among the senses (Filipović Đurđević & Kostić, 2009; 2017,
under revision; Klepousniotou, 2002; Rodd et al., 2002). Additionally, in Serbi...an, words can take
up to seven inflected forms. Syntactic ambiguity of an isolated inflected form is reflected in the
multitude of syntactic roles the given inflected form can take in the sentence. For example,
inflected masculine noun konja (horse) can indicate the subject in the sentence (Dva konja su
trčala / Two horses were running), but also the object (Jahao sam konja / I rode the horse). It has
been demonstrated that different aspects of syntactic ambiguity affect lexical processing as well:
information load based on relative frequency of the inflected form within its inflectional class and
the number of syntactic functions and meanings (Kostić, 1991), inflectional entropy (Baayen et al,
2006), relative entropy (Milin et al., 2009), etc.
By definition, polysemous words are equally ambiguous in all of the inflected forms
(Gortan-Premk, 2004). Hence, the inflected form does not serve as the cue for their true meaning.
However, some research suggested that meaning can shape the way a noun is used in a
sentence (Kostić et al., 2003).
We presented 35 polysemous nouns of masculine gender in a visual lexical decision task
to 74 participants (data collection still ongoing). Each noun was presented in one of its seven
forms in a latin-square design. Entropy of the sense frequency distribution was estimated in a
norming study (Filipović Đurđević & Kostić, 2017). Relative frequencies of inflected forms and the
number of syntactic functions and meanings were taken from Kostić (1965). Discrimination
learning based predictors were derived from cue-outcome weights matrix calculated by
equilibrium equations (Danks, 2003), implemented in the ndl package (Arppe, et al. 2015). Cues
were bigrams of the polysemous words’ inflected forms presented in the experiment. Outcomes
were their lemmata and 1000 co-occurring context words.
We modelled processing times by applying GAMMs (Wood, 2006) and compared two
models. Information-theoretic model revealed the expected facilitatory effect of entropy of word
senses, but only in the nominative form. Discriminative predictors (Milin et al, 2017) affected
processing in some word forms, again revealing the interaction. AIC comparison showed that the
discrimination-based model was superior to the information-theoretic one. Correlations between
the two sets of measures revealed some interesting relations. Activation was more sensitive to
semantic ambiguity, whereas diversity captured both semantic and syntactic ambiguity. This
suggests a more complex interplay of semantics and morpho-syntax than previously thought and
the possibility of capturing such an interaction with discrimination-based diversity measures.
Keywords:
semantic ambiguity / polysemy / simultaneous processingSource:
Book of Abstracts, International Conference on Error-Driven Learning in Language (EDLL 2021), March 10 - 12, University of Tübingen, 2021, 32-Funding / projects:
- Ministry of Science, Technological Development and Innovation of the Republic of Serbia, institutional funding - 200163 (University of Belgrade, Faculty of Philosophy) (RS-MESTD-inst-2020-200163)
Institution/Community
Psihologija / PsychologyTY - CONF AU - Mišić, Ksenija AU - Filipović Đurđević, Dušica PY - 2021 UR - http://reff.f.bg.ac.rs/handle/123456789/5146 AB - In this research we wanted to investigate whether the effects observed during simultaneous processing of semantic and syntactic lexical ambiguity could be interpreted within the framework of error-driven learning. To investigate the interaction of the two types of ambiguity we focused on polysemy in a highly inflected Serbian noun system and attempted to simulate their processing effects using naive discriminative learning (Baayen et al., 2011). Polysemy is the type of lexical ambiguity where one word can have multiple related senses. For example, isolated word paper could refer to the writing paper, i.e., paper as the material, but also to scientific paper, and even a daily paper. Polysemous words are a complex phenomenon, whose processing is affected by number of senses, probability distribution of those senses, and degree of relatedness among the senses (Filipović Đurđević & Kostić, 2009; 2017, under revision; Klepousniotou, 2002; Rodd et al., 2002). Additionally, in Serbian, words can take up to seven inflected forms. Syntactic ambiguity of an isolated inflected form is reflected in the multitude of syntactic roles the given inflected form can take in the sentence. For example, inflected masculine noun konja (horse) can indicate the subject in the sentence (Dva konja su trčala / Two horses were running), but also the object (Jahao sam konja / I rode the horse). It has been demonstrated that different aspects of syntactic ambiguity affect lexical processing as well: information load based on relative frequency of the inflected form within its inflectional class and the number of syntactic functions and meanings (Kostić, 1991), inflectional entropy (Baayen et al, 2006), relative entropy (Milin et al., 2009), etc. By definition, polysemous words are equally ambiguous in all of the inflected forms (Gortan-Premk, 2004). Hence, the inflected form does not serve as the cue for their true meaning. However, some research suggested that meaning can shape the way a noun is used in a sentence (Kostić et al., 2003). We presented 35 polysemous nouns of masculine gender in a visual lexical decision task to 74 participants (data collection still ongoing). Each noun was presented in one of its seven forms in a latin-square design. Entropy of the sense frequency distribution was estimated in a norming study (Filipović Đurđević & Kostić, 2017). Relative frequencies of inflected forms and the number of syntactic functions and meanings were taken from Kostić (1965). Discrimination learning based predictors were derived from cue-outcome weights matrix calculated by equilibrium equations (Danks, 2003), implemented in the ndl package (Arppe, et al. 2015). Cues were bigrams of the polysemous words’ inflected forms presented in the experiment. Outcomes were their lemmata and 1000 co-occurring context words. We modelled processing times by applying GAMMs (Wood, 2006) and compared two models. Information-theoretic model revealed the expected facilitatory effect of entropy of word senses, but only in the nominative form. Discriminative predictors (Milin et al, 2017) affected processing in some word forms, again revealing the interaction. AIC comparison showed that the discrimination-based model was superior to the information-theoretic one. Correlations between the two sets of measures revealed some interesting relations. Activation was more sensitive to semantic ambiguity, whereas diversity captured both semantic and syntactic ambiguity. This suggests a more complex interplay of semantics and morpho-syntax than previously thought and the possibility of capturing such an interaction with discrimination-based diversity measures. C3 - Book of Abstracts, International Conference on Error-Driven Learning in Language (EDLL 2021), March 10 - 12, University of Tübingen T1 - Interaction of semantic and syntactic ambiguity in the light of discrimination learning SP - 32 UR - https://hdl.handle.net/21.15107/rcub_reff_5146 ER -
@conference{ author = "Mišić, Ksenija and Filipović Đurđević, Dušica", year = "2021", abstract = "In this research we wanted to investigate whether the effects observed during simultaneous processing of semantic and syntactic lexical ambiguity could be interpreted within the framework of error-driven learning. To investigate the interaction of the two types of ambiguity we focused on polysemy in a highly inflected Serbian noun system and attempted to simulate their processing effects using naive discriminative learning (Baayen et al., 2011). Polysemy is the type of lexical ambiguity where one word can have multiple related senses. For example, isolated word paper could refer to the writing paper, i.e., paper as the material, but also to scientific paper, and even a daily paper. Polysemous words are a complex phenomenon, whose processing is affected by number of senses, probability distribution of those senses, and degree of relatedness among the senses (Filipović Đurđević & Kostić, 2009; 2017, under revision; Klepousniotou, 2002; Rodd et al., 2002). Additionally, in Serbian, words can take up to seven inflected forms. Syntactic ambiguity of an isolated inflected form is reflected in the multitude of syntactic roles the given inflected form can take in the sentence. For example, inflected masculine noun konja (horse) can indicate the subject in the sentence (Dva konja su trčala / Two horses were running), but also the object (Jahao sam konja / I rode the horse). It has been demonstrated that different aspects of syntactic ambiguity affect lexical processing as well: information load based on relative frequency of the inflected form within its inflectional class and the number of syntactic functions and meanings (Kostić, 1991), inflectional entropy (Baayen et al, 2006), relative entropy (Milin et al., 2009), etc. By definition, polysemous words are equally ambiguous in all of the inflected forms (Gortan-Premk, 2004). Hence, the inflected form does not serve as the cue for their true meaning. However, some research suggested that meaning can shape the way a noun is used in a sentence (Kostić et al., 2003). We presented 35 polysemous nouns of masculine gender in a visual lexical decision task to 74 participants (data collection still ongoing). Each noun was presented in one of its seven forms in a latin-square design. Entropy of the sense frequency distribution was estimated in a norming study (Filipović Đurđević & Kostić, 2017). Relative frequencies of inflected forms and the number of syntactic functions and meanings were taken from Kostić (1965). Discrimination learning based predictors were derived from cue-outcome weights matrix calculated by equilibrium equations (Danks, 2003), implemented in the ndl package (Arppe, et al. 2015). Cues were bigrams of the polysemous words’ inflected forms presented in the experiment. Outcomes were their lemmata and 1000 co-occurring context words. We modelled processing times by applying GAMMs (Wood, 2006) and compared two models. Information-theoretic model revealed the expected facilitatory effect of entropy of word senses, but only in the nominative form. Discriminative predictors (Milin et al, 2017) affected processing in some word forms, again revealing the interaction. AIC comparison showed that the discrimination-based model was superior to the information-theoretic one. Correlations between the two sets of measures revealed some interesting relations. Activation was more sensitive to semantic ambiguity, whereas diversity captured both semantic and syntactic ambiguity. This suggests a more complex interplay of semantics and morpho-syntax than previously thought and the possibility of capturing such an interaction with discrimination-based diversity measures.", journal = "Book of Abstracts, International Conference on Error-Driven Learning in Language (EDLL 2021), March 10 - 12, University of Tübingen", title = "Interaction of semantic and syntactic ambiguity in the light of discrimination learning", pages = "32", url = "https://hdl.handle.net/21.15107/rcub_reff_5146" }
Mišić, K.,& Filipović Đurđević, D.. (2021). Interaction of semantic and syntactic ambiguity in the light of discrimination learning. in Book of Abstracts, International Conference on Error-Driven Learning in Language (EDLL 2021), March 10 - 12, University of Tübingen, 32. https://hdl.handle.net/21.15107/rcub_reff_5146
Mišić K, Filipović Đurđević D. Interaction of semantic and syntactic ambiguity in the light of discrimination learning. in Book of Abstracts, International Conference on Error-Driven Learning in Language (EDLL 2021), March 10 - 12, University of Tübingen. 2021;:32. https://hdl.handle.net/21.15107/rcub_reff_5146 .
Mišić, Ksenija, Filipović Đurđević, Dušica, "Interaction of semantic and syntactic ambiguity in the light of discrimination learning" in Book of Abstracts, International Conference on Error-Driven Learning in Language (EDLL 2021), March 10 - 12, University of Tübingen (2021):32, https://hdl.handle.net/21.15107/rcub_reff_5146 .