[The English version of this paper can be found at http://ssrn.com/abstract=2015741. And this paper is theoretically influenced by the anthropological findings of Christopher Small, Alpons Siberman (who did works on Afro-Asian Music). The haunting question is: how do so-called primitive societies(?) preserved highly melodious speech? Whereas so-called civilized technocratic societies are losing speech melody. Is it due to the “crippled creativity hypothesis” (cf. http://papers.ssrn.com/sol3/papers.cfm?abstract_id=2015334 , http://papers.ssrn.com/sol3/papers.cfm?abstract_id=2014708 and http://ssrn.com/abstract=2019504) of the present author? It is also influenced by the so-called “illiterate” (?)singers, who can make notation without any academiocratic training in acoustics-science and of course Barthes’ musica practica and Rabindranath Tagore’s concept of music. I am repeating, I am deeply indebted to “illiterate” “primitive” “tribes” (homo sapiens?) and have made a sin by patenting it. In this way, socially accumulated knowledge has become intellectual private property!]
Existing literature on gauging intonational contours in Linguistics, mainly depends on the three parameters (High, Mid, Low) to be determined by the trained ears of the professional Phoneticians. Though there are software and machines for analyzing sound waves, those software do not provide us with a generalized picture of intonational contours with varied and specific reference points with calculated intervals. Parameters like “High, Mid, Low” (as they are used in Linguistics) as reference points for intonational contours do not tell us anything as none exactly knows the proportionate distances or intervals among such naïve parameters. Therefore, I thought to take cue from music, where there is a rich system of archewriting or making notation of musical pieces.
This paper deals with the differAnce (a la Derrida) between so-called normal speech and music. The author, for the time being, is deferring the difference between speech and music. Though the melody of music is “heavy” and the melody of speaking is “light”, still both of them follows the same algorithm — the algorithm of do, re, mi, fa, so, la, ti and do 2, i.e., the ad hoc hypothesis is that, any normal speech falls within the octave that has a definite range. If the lower DO 1 is x, upper Sa DO 2 is 2x and this x to 2x octave range is considered as y, the intermediate noticeable frequency-points, as per Indian musical system, are 22. Therefore, our point of concentration is Y/22. In a scale y that has a range of x to 2x (i.e., any points on y, say yp, satisfies the relation x yp 2x). And the interval between two notes follows logarithmic pattern as the gap between two notes (say yp and yq ) is 2 multiplied by root 22 (please note the human love for binary-figure in the formulated representation). Thus, this hypothesis leads to an activity — i.e., making notation of “normal” speech to understand the intonation pattern of speaking vis a vis musicking. However, it does not pay any heed to something called “(word) stress.” In this work, the status of word in a sentence or discourse is contested. . What is “word” really, especially in this type of pre-lexical studies? (cf. http://ssrn.com/abstract=2016136) To the author, “word” is a culture-specific concept or rather citation form, which has only visual representation — there is no such representation in the game of speaking. A literate speaking subject, in his/her printing culture, has only a visual sensation of word. If word is to be defined as a something (visual black or any other colored figure) in between two (white or any other colors) spaces (grounds), the boundaries of word depend on the particular literate community’s way of manipulating blank spaces in their printing/writing. The boundaries/spaces as defined by morphology do not exist when a speaking subject is engaged in a speaking. At that moment of speaking, from the subject’s position, it is not word-stress, but it is rather a harmony of a discourse, which s/he is expressing as a continuum without ontologically being conscious about the grammarians’ order of things (different levels of language, viz. phoneme, morpheme, word, phrase, sentence…). As word does not exist, the word-stress is also an absentee at the moment of speaking. It is meaningless to account stress by isolating a ‘word’ from the speech continuum. When, in Linguistics, intonation pattern is attested by surveying a sample population, a crucial variable of that particular community is totally ignored. That is their cultural audio-exposure to the unintended sounds or non-discursive sonority (that is, the noises, music, and rhythm of the habitat or the non-discursive sounds in which the particular population inhabits). It is not possible to gauge the intonation pattern of particular speech community without noticing this context-specificity of non-discursive sonority. This work considers this non-discursive sonority as one of the variables (other variables are age and gender and speed of speech, speech Act) for attesting intonation.