I try to learn Norwegian by reading the CLUE series of Jørn Lier Horst. But there is a problem: I cannot find the pronunciation of a lot of words.
So, as other posts already pointed out, Norwegian is a language with two written standards but no spoken standard assigned to any of them, which is an enviable policy. On the other hand, the orthographic system is close phonemic (compared to which variety, is a great question, in light of the first sentence, but it's probably Urban East Norwegian). As a consequence, most dictionaries don't list pronunciations, either to avoid prescribing one dialect's pronunciation over another or because they feel that it's obvious from spelling. Which is not.
And it's not a great help for language learners, especially like me, who prefer to learn languages as mostly logical systems, whith phonemes as basic building blocks. I also think that the spoken language is the "original", and the written language merely tries to represent it (with more or less success). (The opposite can also happen, that's called spelling pronunciation, and usually is a form of hypercorrection, and from what I read, the Bokmål side of the language maybe started as pronouncing written Danish in a Norwegian way, oversymplifying the situation, but that's another story.) This also means, that I would be more than happy to learn to speak in any dialect - in fact, I started learning Bergensk first (and also developed an Australian accent in English consciously)-, but sadly enough, Bokmål with an Urban East Norwegian pronunciation has the most resources, including Duolingo, so I had to stick to it.
However phonemic Bokmål orthography looks, it doesn't mark 1) the stressed syllable 2) the tone of the word, 3) when the letter ⟨e⟩'s pronunciation reduces to /ə/ and 4) the length of vowels at all. It has the means to, but marks inconsistently 5) the sounds /o/ and /u/ with the letter ⟨o⟩, 6) the sounds /u/ and /ʉ/ with the letter ⟨o⟩, 7) the sounds ⟨e,æ⟩ by either of the letters [e,æ], 8) the length of consonants (mostly in the case of word final ⟨m⟩, e.g. ⟨kom⟩=/komː/, a lot of other things, probably for etymological reasons, for example 9) ⟨d⟩, ⟨g⟩, ⟨t⟩ are inconsistently silent in certain enviroments or ⟨g⟩ is pronounced /j/ or /ŋ/ unexpectedly 10) and there are loandwords spelled in a foreign way as well. The retroflex sounds are hopefully predictable, except the 'tykk l' /ɽ/, which is maybe avoided in educated speech.
Based on what I read and observed, my theory is that 2) tone, 4) long vowels and 8) long consonants are properties of 1) the stressed syllable, so they cannot appear outside the stressed syllable and the stressed syllable must have either of the two tones and either a long vowel or a long consonant. On the other hand 3) can only appear outside stressed sillables. This means, that if I mark a long consonant or a vowel with a ':' put after them, this will be the stressed syllable with tone 1. If it's tone 2, I can use ';' instead. Example: 'Gut:ten læ;rer sam:men me:d jen;ta.'
Since the original text is mostly phonemic, one doesn't have to use IPA to rewrite the whole text. You can modify the original by small signs and diacritics where the spelling of a word is inconsistent or insufficient, so that both the pronunciation and the original orthography is recovereble. For example, I can use a dot under a consonant to denote, that it's retroflex, e.g. 'barṇ'. I can put silent letters into angle brackets: 'Han: sel;⟨g⟩er ḷan:⟨d⟩e⟨t⟩'. In the opposite case, when a sound is pronounced but not written, I can use angle brackets: 'oran:[g]sje'. You can combine the two brackets, when a part of the word is pronounced differently than written: 'p⟨en⟩[ang]sjona:t'. For the most commont combinations, its convenient to make new symbols:
⟨o⟩[å]=ò, ⟨u⟩[o]=ù, ⟨e⟩[æ]=è, ⟨g⟩[j]=ġ. If a letters pronunciation is optional, I use parentheses, and I use [ŧ] at the end of neuter definite nouns, since they are pronounced in the genitive: 'ste:(d)eŧ', 'ste:(d)eŧs'.
In the cases that are not ambiguous, I stick to the original Norwegian orthography, for example the combinations: tj, sj, skj, lj, hj, hv; kj, ki, ky; gj, gi, gy; skj, ski, sky.
The different pronunciations of ⟨e⟩ are more complicated. When it's reduced, I use [ə], if it's absorbed in the following consonant making it syllabic, I use [']. When it turns to /æ/ in front of ⟨r⟩, I write [è]. I don't know the exact rule to these changes, so I have to mark them every time they happen.
Okay, sorry, it's quite a lot of text, I don't know how to make it more compact. The main thing is, this is the part in front of the CLUE book, where the main characters are introduced, with the pronunciation indicated:
###########################################################
Cecilia Gaathe bo:r på Pè;rḷ'ṇ p⟨en⟩[ang]sjona:t sam:m'n me:⟨d⟩ fa:r'ṇ ṣin, Alan W. Gaathe, som: ei;ər ṣṭe:(d)əŧ. Mo:r'ṇ ṣin, Iselin Gaathe, drùk;nət på mys:tisk vi:s i Skutebukta fòr;r⟨i⟩ġə sòm;mər.
Leo Bast ha:r ṇet:tòpp flyt;tət til: Skutebukta. Han: bo:r i den priva:tə de:lən av p⟨en⟩[ang]sjona:təŧ sam:m'n me:⟨d⟩ mo:r'ṇ ṣin, Rebekka, sòm: è:r ṇy:an:satt i stil;lingən sòm: hotel:lsje:f. Fòrel:drənə han:s è:r skil:t. Fa:r'ṇ è:r [s]jo⟨u⟩rṇalis:t ò⟨g⟩ bo:r i Dubai.
Une Flaker bo:r ṣam:m'n me:⟨d⟩ fòrel:drənə ò⟨g⟩ d⟨e⟩[i]: to: brø;drənə si;nə i et gam;m'lt skip:pərhu:s på øs:tsi;d'n av Skutebukta. Dè:r ha:r hun: bod:d bestan:di⟨g⟩. Fa:r'ṇ, Widar Flaker, è:r fi;skər, men:s mo:r'ṇ è:r ḷæ;rər på sko;l'n in;nə i by:ən.
Egon è:r hun:⟨d⟩'n til: Une. Den he;tər Egon fòrḍi: den è:r ṣå e;g'n ò⟨g⟩ sta:, ò⟨g⟩ sto:rṭ ṣet:t gjø:r ṣòm: den vil:. Deŧ è:r en kne:høy: blan;⟨d⟩ingshun:⟨d⟩, me:⟨d⟩ bru:n, krøl;lətə pel:s, lan:g ha;lə ò⟨g⟩ vå:t snu;tə. Deŧ si;əs at han: stam;mər fra: en politi:hun:⟨d⟩.
Pè;rḷ'ṇ p⟨en⟩[ang]sjona:t
P⟨en⟩[ang]sjona:təŧ Pè;rḷ'ṇ ble: byg;gət av Cecilias tip:pòldəfa:r fòr: ò:vər hun;drə å:r ṣi;d'n. Den gan:gən kòm: gjes;t'nə me:⟨d⟩ dam:pbå:t til: dy:pvan:nskai:a, el:lər me:d jè:rnba;n'n til: by:ən ò⟨g⟩ vi;dərə dèr:fra til: kys:t'n me:⟨d⟩ hes:t ò⟨g⟩ kjèr:rə. Un:⟨d⟩ər kri:g'n ble: ei;əndòm'n tat:t av tys:kərṇə, sòm: lo:t òfise:r'ṇə si;nə bo: dè:r. Et;tərpå: ha:r ḍeŧ vært: fle:rə fòrskjel:li⟨g⟩ə ei;ərə he:lt til: mo:r'ṇ ò⟨g⟩ fa:r'ṇ ṭil: Cecilia kjøp;tə deŧ tilba:;kə, pus;sət deŧ òp:p ò⟨g⟩ begyn:tə å lei;ə deŧ u:t til: gjes;tər. Cecilia ò⟨g⟩ fa:r'ṇ bo:r i en priva:t de:l i an;drə eta:sjə på p⟨en⟩[ang]sjona:təŧ. Deŧ sam;mə gjø:r Ḷeo ò⟨g⟩ mo:r'ṇ. Høy:t ò:vər res:t'n av p⟨en⟩[ang]sjona:təŧ ra;gər ṭå:rṇvæ;r{əls/'ḷṣ}əŧ.
###########################################################
I looked up the words from the site of (NAOB)[https://naob.no/], which only provides the pronunciations of base words, and not their inflected forms such as definite or plural forms, or compound words. I heard that exactly those words have tone 2, which were multisillabic in Old Norse, so I guessed, that the plural form of words and definite form of adjectives must have tone 2, but not the definite forms of nouns, which is the same as what this source says. This is why I also think that loanwords have tone 1, no matter how inflected. In the case of compound words, I have no idea, so I just wrote the pronunciation of their components, and underlined them. I wouldn't be surprized if they retained their long vowels and consonants even after the stress gets removed from one of the components.
Verbs seem a bit more complicated, the infinitive of most have tone 2, but prefixed verbs seem to have tone 1. I don't know about the inflected forms like present, past, passive and preteritum, however. They are in bold.
Also, I didn't mark the pronunciation of these short unstressed words frequently appearing in the text:
på(:) i(:) den:/d(ə)n de:ŧ/dəŧ e:n/ən et(:) a(:)(v) ò(:)⟨g⟩ så(:) at(:) ṣi{:n/n:}
and I have no idea how to pronounce these proper names:
⟨Cecilia Gaathe⟩?[Səsi:liə Gå;tə]
⟨Alan W. Gaathe⟩?[Ala:n Dåb:'ltve: Gå;te]
⟨Iselin Gaathe⟩?[Isəli:n Gå;tə]
⟨Leo Bast⟩?[Le:å Bas:t]
⟨Rebekka⟩?[Rəbek:ka]
⟨Une Flaker⟩?[U;nə Fla;kər]
⟨Widar Flaker⟩?[Vi;(d)ar Fla;kər]
⟨Egon⟩?[E:gån]
⟨Skutebukta⟩?[Sku;təbuk:ta]
⟨Dubai⟩?[Dubai:]
So, I mainly need your feedback about my theories, if there are any mistakes in the text (even if you think it's only a typo), and mainly, the pronunciations of compound words and names and inflected words. After this long text, I'm pretty sure I left out something important, so ask, if something is not clear. Although my question is mainly about Urban East Norwegian, I'm also interested in other dialects, actually, but of course, they will work according to a different system. Also, if you have any advice for formatting, let me know.


