Proto-Albanian
Albanoid
Reconstruction ofAlbanian (Albanian dialects)
RegionBalkan Peninsula
Erac. 1000 BCE[1] – 600 CE[2]
Reconstructed
ancestor
NotesPaleo-Balkan

Proto-Albanian is the ancestral reconstructed language of Albanian, before the GhegTosk dialectal split. Albanian and other Paleo-Balkan languages had their formative core in the Balkans after the Indo-European migrations in the region.[3][4] Whether descendants or sisters of what was called Illyrian by classical sources, Albanian and Messapic, on the basis of shared features and innovations, are grouped together in a common branch in the current phylogenetic classification of the Indo-European language family.[5][6][7][8][9] The earlier phase of Albanian is also labelled Albanoid in reference to a specific ethnolinguistically pertinent and historically compact language group.[7]

Proto-Albanian is reconstructed by way of the comparative method between the Tosk and Gheg dialects, as well as the treatment of loanwords, which can be traced back as early as the first contacts with Ancient Greek, but the most important of which are those from Latin (dated by De Vaan to the period 167 BCE to 400 CE) and from Slavic (dated from 600 CE onward).[10] The evidence from loanwords allows linguists to construct in great detail the shape of native words at the points of major influxes of loans from well-attested languages.[11]

Proto-Albanian is broken up into different stages which are usually delimited by the onset of contact with different well-attested languages.[10] Its earliest stages are dated to the early Roman Empire, just before the period of intense Latin-Albanian contact, while in its late stages it experienced contact with Slavic languages.[12][13][14] The Tosk-Gheg split is known to predate Slavic contact circa 600 CE, as evidenced by the fact that Latin and ancient Greek loanwords are treated like native words with regard to taxonomical differences between Gheg and Tosk, but the same is not true of Slavic loans.[15][16][17]

History

Indo-European phylogenetic tree where the IE dialect that gave rise to Albanian splits from Post-Tocharian Indo-European, that is the residual Indo-European unity which remained after Tocharian's splitting from Post-Anatolian Indo-European (by Chang et al. 2015).[18]

Albanian and other Paleo-Balkan languages had their formative core in the Balkans after the Indo-European migrations in the region.[3][4] Although research is ongoing, in current phylogenetic tree models of the Indo-European language family, the IE dialect that gave rise to Albanian splits from "Post-Tocharian Indo-European", that is the residual Indo-European unity ("Core Indo-European") which remained after Tocharian's splitting from "Post-Anatolian Indo-European".[19][18]

Paleo-Balkanic

Armenian

Graeco-Albanian

Graeco-Phrygian

Illyric

Messapic

Albanian

Gheg

Tosk

Arvanitika

Arbëresh

Mainland Tosk

Albanian and Messapic in the Paleo-Balkanic branch based on "The Indo-European Language Family" by Brian D. Joseph and Adam Hyllested (2022)

Albanian in antiquity is often thought to have been an Illyrian language for obvious geographic and historical reasons,[20][21][22][23][24][25] or otherwise an unattested Balkan Indo-European language that was closely related to Illyrian and Messapic, an ancient language of Balkan provenance attested through inscriptions from Iron Age Apulia, southeast Italy.[6][26][8][9] The earlier phase of Albanian is also labelled 'Albanoid' in reference to a "specific ethnolinguistically pertinent and historically compact language group".[7] Whether descendants or sisters of what was called 'Illyrian' by classical sources, Albanian and Messapic, on the basis of shared features and innovations, are grouped together in a common branch in the current phylogenetic classification of the Indo-European language family.[5][6][7][8][9]

Nomenclature of periodization of Proto-Albanian

Vladimir Orel distinguishes the following periods of Proto-Albanian:

  • Early Proto-Albanian (EPA): spoken before the 1st century CE, when Albanian had not yet acquired extensive influence via language contact from Latin/Proto-Romance
  • Late Proto-Albanian (LPA): after extensive Latin contact, with the end of the period seeing contacts between ancient Slavic idioms still close to the Proto-Slavic language, in the 6th and 7th centuries CE. During this period the structure of Proto-Albanian was "shattered" by major changes.[12]

However, another periodization paradigm does exist, and is used by some scholars in the field, such as Ranko Matasović:

  • Pre-Proto-Albanian: essentially equivalent to Vladimir Orel's "Early Proto-Albanian", except that the newer paradigm of Matasović dates Latin/Albanian contact a century earlier, and thus it ends for Matasović in the 1st century BCE rather than the 1st century CE.[27] After this period ends, Latin contact begins to transform the language.
  • Early Proto-Albanian: corresponds to the earlier phases of what is for Orel "Late Proto-Albanian". For Matasović, the period spans the 1st century BCE to the 6th century CE, halting before contact with Slavic idioms begins.[27]
  • Late Proto-Albanian: includes the last two centuries of LPA for Orel, plus most of the unattested period of Old Albanian, halting before Turkish influence begins.[27] In this paradigm, Gheg and Tosk split from Early Proto-Albanian, not Late Proto-Albanian, consistent with our knowledge that the split preceded Slavic contact.
  • Early Albanian: corresponds to the late, Ottoman, phase of Old Albanian in the traditional paradigm, ending in 1800, at which point it transitions to Modern Albanian.[27]

Demiraj, like Matasović and unlike Orel, observes the 5th/6th centuries as a boundary between stages, but instead places the "emergence of Albanian" from its parent after this point, rather than the 14th.[14]

In an Albanian chapter penned by Michiel de Vaan within Klein, Joseph and Fritz' 2018 Handbook of Comparative and Historical Indo-European Linguistics,[10] Demiraj's periods are adhered to. Orel's "Later Proto-Albanian", which is for them also definitively placed before Slavic contact, is referred to as simply "Proto-Albanian" (PAlb) or, in German, Uralbanisch, reflecting the terminology of earlier writing in German.[28][29][30] What is for Orel "Early Proto-Albanian" (EPA), dated definitively before the onset of Latin contact, is for De Vaan, "Pre-Proto-Albanian" (PPAlb); in German, this stage is called Voruralbanisch or Frühuralbanisch.[10] De Vaan also discusses the possibility of breaking Pre-Proto-Albanian into two stages: one before the first Greek loanwords, and one that is after the first Greek loanwords, but before contact with Latin.[10]

This page at present is using the paradigm of Orel.

History of study

Vladimir Orel is one of the main modern international linguists to have dealt with the passage from Proto-Indo-European to Proto-Albanian to Modern Albanian. According to Orel, the study of Proto-Albanian syntax remains in its infancy so there are some limitations to the work. However, there have been developments in the understanding of the historical development of phonetics and vocabulary.[31] Other major work has been done by Eqrem Çabej and Shaban Demiraj as well as by major scholars in the field of Romanian historical linguistics as it relates to Albanian (see Albanian–Romanian linguistic relationship) as well as other Balkan linguists. A large amount of work done on Proto-Albanian is published in German, rather than English.

Phonology

Extensive recent studies on Proto-Albanian phonology have been published by Huld (1984), Beekes (1995), Shaban Demiraj (1996), Bardhyl Demiraj (1997), Orel (2000), Hock (2005), Matzinger (2006), Vermeer (2008), Schumacher (2013), and De Vaan (2018).[10]

At present, this page follows Orel's paradigm for periods of Proto-Albanian, and presents the relationship between the synchronic phonologies of both "EPA" and "LPA" with diachronic relationships to each other and to ancestral Indo-European forms as well as descendant Albanian forms.

Stress

In Early Proto-Albanian, stress was paradigmatic, and behaved according to morphological class,[32] with a base on the first syllable.[13] In different paradigms, the stress pattern was varyingly barytonic, oxytonic, and mobile.[32] Unstressed vowels lost one mora—long vowels were shortened, already short vowels were often deleted. In Later Proto-Albanian, however, a new system of unstressed vowel reduction emerged where *a reduced to *ë while all others were simply deleted (except for post-tonic inlaut vowels, which became *ë).[32] Orel gives the following examples:

  • EPA *dáusas "ram" (sg) > *dauš > ... > modern dash
  • EPA *dáusai "rams" (pl) > *dauši > ... > modern desh
  • EPA *dwáigā "branch" (sg) > *déga > ... > modern degë
  • EPA *dwáigāi "branches" (pl) > *dégai > ... > modern degë

Vowels

Simple vowels in EPA[33]
Front Central Back
High *i • *iː *u • *uː
Mid *e • *eː • *oː
Low *a • *aː
Diphthongs in EPA[33]
Nucleus -i -u
*e *ei *eu
*a *ai *au

Early Proto-Albanian possessed four distinctive short vowels: *a, *e, *i and *u. Proto-Indo-European *o had merged into *a by the Early Proto-Albanian stage. A five-way distinction was maintained for long vowels: *aː, *eː, *iː, *oː and *uː. Early Proto-Albanian also had four diphthongs: *ei, *ai, *eu and *au.

Early Proto-Albanian's vowel inventory began to change as a result of Latin contact. Initially Albanian was resistant to the restoration of short *o as a separate phoneme, with Latin unstressed *o being replaced by *a, and stressed Latin *o being replaced by *u. However, in later loans, Latin *o is maintained in Albanian as *o. Additionally, some Latin loans with short *u saw Latin *u replaced by *o, as well as *ə specifically in unstressed positions before sonorants. In two cases, Orel argues that Latin short /u/ was lengthened in Albanian to /u:/, ultimately to render /y/. On the other hand, whatever effect Ancient Greek loanwords had at their time of absorption is unclear, but diachronically the vowels always agree with regular internal Albanian developments.

LPA simple vowels
Front Back
High *i *u •
Mid (*e)[Note 1] (*o)[Note 2]
Low *a • *å[34]
(Earlier) LPA diphthongs
-u -i -e
u- *ui *ue
i- *ie
e- *eu *ei
a- *au *ai

Late Proto-Albanian

Late Proto-Albanian exhibited *a, *i and *u throughout its development as distinctive short vowels. *o was restored to the phonemic inventory as a result of loanwords where it was increasingly maintained instead of replaced. Although *e was eliminated by breaking to *ie (which would render je and ja), it was restored by the leveling of /ai/ to /e/ and other phenomena that replaced /a/, /ie/, and /ue/ with /e/. The only long vowel preserved in its original form was *iː. *o: was replaced by *ue, *eː was merged into *aː and both were rounded and eventually raised to *o, while *uː merged with the diphthong *ui, ultimately rendering *y. By Late Proto-Albanian, all the original Indo-European diphthongs had now leveled, but new diphthongs were absorbed in loans, and were also innovated by breaking phenomena: *ie, *ue and *ui. *ai in Latin words with AE shared the fate of inherited Early Proto-Albanian *ai, becoming *e, while Latin AU similarly shared the fate of inherited *au and became *a.

Phonemically nasal vowels emerged in Late Proto-Albanian.[35] First, all vowels standing before nasal consonants were nasalized. The following nasal consonant was then lost in certain morphological contexts, while the vowel remained nasalized, resulting in the emergence of LPA phonemes denoted , , , and .[35] Except in certain Gheg varieties, merged into .[35] The traditional view presented by Orel[35] and Desnickaja[36] is that distinctive nasalization was lost by Tosk but retained by Gheg and that this is a taxonomical difference between the two.[35] However this has now been challenged,[36] after Sheper and Gjinari discovered Lab dialects (Lab is a subdialect of Tosk) in the Kurvelesh region that still had distinctive nasal vowels,[36] and Totoni likewise found that the Lab speech of Borsh also still has nasal vowel phonemes.[37] This means that, instead of the traditional view, it is possible that denasalization happened in most Tosk dialects only after the split from Gheg.[36]

Slavic *uː appears to still have been back and round when it was loaned into Albanian, but it is after the diphthongization and resulting fronting of the original Early Proto-Albanian *uː to *y was no longer absorbing new *uː segments, as they are, with only three exceptions, reflected as *u. Slavic *o had already become *a in the Slavic languages that contacted Albanian by the time of contact, and was loaned as *a for the most part; as is reflected also in other non-Slavic languages absorbing these words. After /v/, this *a became *o again in two attested cases: kos ("yogurt", from Proto-Slavic *kvasъ) and vorbë ("clay pot").

It was at the end of the LPA period that length became no longer distinctive in Albanian,[38] although many Gheg and some Lab dialects preserved it and/or re-innovated it. Furthermore, by Old Albanian, all diphthongs had been lost: those ending in -i were all leveled, the -u was lost in those ending in -u, and those ending in -e were converted to glide + vowel sequences; further changes including the frequent effacement of the former first element or otherwise its hardening into an occlusive (typically /v/ for former u-, and gj /ɟ/ for former i-) rendering the former presence of a diphthong rather opaque in many reflexes.

Vowels of late LPA transitioning to Old Albanian
Front Central Back
High *i • *y *u
MId *e *o
Low *a

Diachronic development

This table differentiates short vowels form long vowels with the IPA symbol <ː> being applied to the long vowels.

Specifically contextualized reflex results are placed in parentheses.

Proto-Indo-European developments before Proto-AlbanianEarly Proto-AlbanianLate Proto-AlbanianTosk AlbanianGheg Albanian Example
Latin short /a/ merges with EPA /a/ /a/[31] /a/ /a/[31] /a/[31] PIE *kap- "to seize" > EPA *kapa > kap "to grasp";

Latin APTUM > Alb aftë "capable";

PIE *n̩bʰ(u)lo- > EPA *abula > Alb avull "steam, vapor";[39]

PIE *septm̩ > EPA *septati > Alb shtatë "seven"[39]

*a Proto-Indo-European * and * merge with result of *a[39]
*a > /e/ under umlaut and subsequent analogy[40] /e/ /e/ EPA *albr̩ > elb "barley";

Latin GALBINUS "yellow" > Alb gjelbër "green";

PIE *u̯īḱm̩tī > EPA *wīdžatī > Alb zet "twenty";[39]

PIE *n̩- > EPA *a- > Alb e- (privative prefix)[39]

>/ɑ̃/ before nasals /ə/ <ë> /ɑ̃/ <â, an> EPA *ksanda > Alb hënë ("moon", Gheg: hanë);

Latin CANTICUM > Alb këngë "song"

/ə/ deleted after a stressed syllable[41] IE *bʰolətom > EPA *baltaː > Alb baltë "swamp"
> /e/ after absorption of following laryngeal H_e /e/ /ie/
(> /e/ before *ts, *dz, *nd, *nt, *mb)
/ie/
/je/
/ja/
/ie/
/je/
/ja/
IE *dheHi "to suck"> dheːi > EPA dela > Alb djalë "boy"
> /o/ elsewhere /a/[42] /a/ /a/ /a/ IE *apo "away", "off" > EPA *apa > Alb pa "without"
>/ɑ̃/ before nasals /ə/ <ë> /ɑ̃/ <â, an> IE *sont-s "being" > EPA *san(s) > Alb gjë "thing" (Gheg: gjâ sen/send "thing" or sene/sende "things")
/o/ /o/
(/e/ under umlaut and subsequent analogy[40]) /e/ /e/ IE *ǵʰorios > EPA *darja > Alb derr "pig";

IE *kʷəp- "smoke"(?) > EPA *kapna > Alb kem "incense"(Gheg: kall "burn")

> /ɑ̃/ before nasal /ə/ /ɑ̃/ IE *sont-s "being" > EPA *san(s) > Alb gjë
/e/ /e/ /e/[43] IE *lent- > EPA *lenta: > Alb lëndë "timber" (Gheg: landë)
/ie/
/ie//ie/ IE *bʰer- "to bring, carry" > EPA *berja > Alb bie "to bring"
/je/ /je/ IE *smeḱu > EPA smekraː > Alb mjekër "beard"
je > e after affricates, palatals, and liquids je > e after affricates, palatals, and liquids Alb fle "to sleep"
/ja/ /ja/ IE *esmi > EPA *esmi > Alb jam;


ja > a after affricates, palatals and liquids ja > a after affricates, palatals and liquids IE *seḱs + ti > EPA *seksti > Alb gjashtë "six"
/ie/ > /e/ before *ts, *dz, *nd, *nt, *mb /e/ /e/ IE *en per en tod > EPA *(en) per en ta > Alb brenda
/ie/ + /i/ /i/ /i/ IP *gʷedijos > EPA d͡ʒedija > Alb zi "black"
/e/ before *m followed by sibilant or affricate /i/ /i/ /i/ IE *sem-ǵʰo > EPA *semdza > Alb gjithë "all"
Classical Latin /e/ > EPA /ie/ in "usual" layer > /ie/ in EPA for "usual layer" (not identical to development of inherited /e/ which also went through /ie/) /je/ /je/ Lat VERSUM > Alb vjershë "verse"
>/e/ in various contexts after sh, before ng/nd, etc. /e/ Lat CONVENTUS > Alb kuvend
/ja/ /ja/ Lat HEBDOMAS > Alb javë "week"
/ja/ > /a/ after palatals /a/ Lat SELLA > Alb shalë "saddle"
Unstressed /ei/ in Latin loans /e/ /e/ Lat DEBITU꞉RA > Alb detyrë "duty"
Latin /e/ via an unknown different intermediary /e/ /e/ Lat INFERNUM > Alb ferr "hell"; Lat COMMERCIUM > Alb kumerq "toll, duty"
Latin /e/ loaned into Late Proto-Albanian while it lacked any short /e/ phoneme /i/ /i/ IE *ambʰi > EPA *ambi > Alb mbi "on, upon"; Lat PARENTEM > Alb prind "parent"; PIE t(e)r-m- > Alb trim "brave"
Between *r̥ and C /i/ /i/ /i/
/i/
> /ĩ/ before nasals /i/ /ĩ/ <î> EPA *rinja > Alb rij "to make humid" (Gheg: )
/u/ /u/ /u/ /u/ /u/ /u/ IE *bʰugʰ > EPA *bugta > Alb butë "smooth"
> /ũ/ before nasals /u/ /ũ/ <û> IE *ǵenu "knee" > EPA *ganuna > LPA glûna > Alb gju "knee" (Gheg: gjû)
Latin /o/ raises before nasals[44] Lat MONACHVS > murg "monk", CONTRĀ > kundër "against"[44]
Latin stressed[45] /o/ ultimately merges with PIE *ā elsewhere /o/ /o/ Lat COXA > kofshë "hip",[44] ROTA > rrotë "wheel" [45]
/aː/ /aː/ /aː/ /ɒː/ /o/ /o/ IE maːter "mother > EPA maːter > Alb motër "sister"
/eː/ /e:/ /e:/ everywhere except gliding to /j/ in clusters: /ɒː/ /o/ /o/ IE *meː-kwe > LPA mɒːts > Alb mos "don't"
/o:/ /o:/ /o:/ /we/ /e/ /e/ IE *bʰloːros > EPA bloːra > Alb blertë "green"
/i:/ /i:/ /i:/ /i:/ /i/ /i/ IE *piː- "to drink" > EPA *pi꞉ja > Alb pi "to drink"
/u:/ /u:/ /u:/ /ui/ /y/ /y/, /i/ in certain conditions EPA *suːsa > Alb gjysh "grandfather"
/wi/ > /i:/ at word coda after loss of nominative final s /i/ /i/ IE *suːs "pig" > EPA *tsu꞉s > LPA tθui > Alb thi "pig"
/wi/ > /i:/ after labial /i/ /i/ IE *bʰuː- "to grow" > EPA enbuːnja > Alb mbij "to thrive"
/wi/ > /i/ before labial /i/ /i/ IE *kreup > EPA kruːpaː > LPA krwipa > Alb kripë "salt"
/wi/ > /i/ before j, i, other palatal elements /i/ /i/ IE *dreu "tree" > EPA druːnjaː > drinjë "brushwood"
/ai/ /ai/ /ai/ > /ẽ/ > /ɑ̃/ before nasal /ə/ <ë> /ɑ̃/ <â, an> EPA *laidna > Alb ("to let"; Gheg: )
/e/ /e/ /e/ IE *aidʰos > EPA *aida > Alb ethe "fever"
/oi/ /oi/ IE *ḱloitos > EPA *klaita: > Alb qetë "jagged rock"
/ei/ /ei/ /ei/ /i/ /i/ /i/ IE *ǵʰeimen- "winter" > EPA *deimena > Alb dimër "winter"(Gheg: dimën)
Diphthongs of long vowel + j j elided, long vowel develops regularly
/au/ /au/ /au/ >/ɑ̃/ before nasals /ə/ <ë> /ɑ̃/ <â, an> IE *dreu- "tree" > EPA *draunja: > Alb drënjë
/a/ /a/ /a/ *IE aug- > EPA *auga > Alb ag "dusk"
/a/ > /e/ /e/ /e/ EPA *ausra > Alb err "darkness"
/ou/ /ou/ /a/ /a/ /a/ IE *poujo- > EPA *pauja > Alb pah "scab, dust"
/a/ > /e/ /e/ /e/ EPA *gaura > Alb ger
/eu/ /eu/ /eu/ > /ẽ/ > /ɑ̃/ before nasal /ə/ <ë> /ɑ̃/ <â, an> IE *newn̩ "nine" > EPA *neunti > Alb nëntë ("nine", Gheg: nand)
/e/ /e/ /e/ IE *skeud- "to throw" > EPA *skeuda > Alb hedh

Development of Indo-European sonorants

The nasal sonorants *n̩ and *m̩ both rendered Early Proto-Albanian *a, which remains *a in modern Albanian (PIE *g'hn̩taː "goose" > EPA *gataː > modern Albanian gatë "heron"). Like EPA *a elsewhere, in some cases it was raised to *e, as seen in PIE *ln̩gwh- > EPA *laga > Albanian lehtë (suffixed with -të).[46]

Diachronic development of sonorants[47][48]
Proto-Indo-European Intermediate developments Early Proto-Albanian Later Proto-Albanian Old Albanian Tosk Albanian Gheg Albanian Example
*m̩ *a continue regular developments of *a from EPA in vowel chart. shtatë ‘seven’ < PIE *septm ̥-
*n̩ *a (i) gjatë ‘long’ < PIE *dln ̥gʰ-t-
*l̩ *il before consonant clusters, *i or *j il, li
*ul elsewhere ul, lu
*r̩ *ir before consonant clusters, *i or *j ir, ri
*ur elsewhere ur, ru
*l *l *l *l l l SA lagje ‘quarter’ < PIE *logʰ-
*l *λ (ly/-li)? j

l (Cham/Arbresh/

Arvanitika)

j SA gjuhë vs A/A gljuhë, SA mijë vs Cham milë
*ɫ (V_V) ll

/γ/ (some Arbresh)

/ð/ (some Lab)

ll

/ð/ (some dialects)

SA hell ‘spear, spit’ < PIE *skōl-
*r *r *r (V_V) *r r r SA (i) mirë ‘good’ < PIE *miHr-
*r (V_V) *λ (-ri)? j j SA bij 'sons' < PA *bir-i
*r: (#_) *r: rr rr SA rrjedh ‘to flow’ < PIE *h₃reǵ-
*m *m *m m m SA motër ‘sister’ < PIE *meh₂-tr-
*n *n *n n

r (-n-)

n

ng /ŋ/ (from /ng/)

SA natë ‘night’ <PIE *nokʷt-
*n: (*-sn-, *-Cn-, *-nC-) n

n (-n-)

n

ng /ŋ/ (from /ng/)

Dialectal anë < PIE *h₂ewk(ʷ)- ~ *h₂uk(ʷ)-.
*ɲ (*gn-, before front vowels) nj nj

ni~n (northern)

SA njeri ‘man’ <PIE *h₂nḗr

Consonants

EPA Consonants[49]
Labial Dental Alveolar Palatal Velar
Nasal *m *n
Plosive *p • *b *t • *d *ts • *dz *t͡ʃ • *d͡ʒ *k • *g
Fricative *s • *z *x
Glide *w *j
Lateral *l
Trill *r
LPA Consonants
Labial Dental Alveolar Palatal Velar
Nasal *m *n
Plosive *p • *b *t • *d *ts *c *k • *g
Affricate *t͡ʃ
Fricative *f *θ • *ð *s • *z *x
Glide *w *j
Lateral *l
Trill *r
Diachronic development[48]
Proto-Indo-European Pre-Proto-Albanian Early Proto-Albanian Later Proto-Albanian Modern Alb (Tosk/Gheg) Examples
*s *s *z > *j ɟ ~ d͡ʒ <gj> IE *serp- "to crawl" > EPA *serpena > Alb gjarpër "snake"
*s > *ʃ after *iː, *uː or -i, -u diphthongs ʃ <sh> IE *dʰouso- > EPA *dausa > Alb dash ("ram")
*ʃ word-initially (sometimes) ʃ <sh> SA shi ‘rain’ < PIE *suH-
> t͡s (if next consonant was *s) > *θ θ <th> IE *suːs ("pig") > EPA *t͡suːs > Alb thi ("pig")
>*x intervocalically or between EPA sonorant and vowel h IE *golso- ("sound") > EPA *gulxa > SA gjuhë
Ø IE *nosom > EPA *naxa > Alb na ("us")
*sK *sK *sK *x h SA hedh ‘to throw’ < PIE *skewd-
*sp- *sp- *sp- f- f- SA farë < PIE *spor-
*st *st *st ʃt ʃt SA shteg ‘path, road’ < PIE *stoygʰ-
*sd *[zd] *zd *zd ð dh SA pidh ‘female pudenda’ < PIE *peysd(ʰ)-
*s from Greek, Latin loanwords ʃ <sh> Lat summus > Alb shumë "more", "much"
*p *p *p *p p IE *eːp ("to take") > EPA *eːpa > Alb jap ("to give")(Gheg: jep ep)
*b, *bʰ *b *b *b b IE *serbʰ- ("to suck in") > EPA *serba > Alb gjerb ("to gulp")
*w between a vowel and *u v EPA *abula > Alb avull ("vapor")
*t *t *t *t t IE *trejes ("three") > EPA *treje > Alb tre ("three")
*d, *dʰ *d *d *d d IE *dʰegʷʰ- ("to burn") > EPA *dega > Alb djeg ("to burn")
> *ð intervocalically or between r and vowel,

in 5th or 6th centuries[50]

ð <dh> IE *skeudV- ("to throw, shoot") > Alb hedh ("to throw, shoot")
*ḱ (*c?) *ḱ (*c?) >*t͡s > *θ θ <th> IE *ḱi-ḱer- ("pea"? cf Latin: cicer) > EPA *tsera > Alb thjerrë ("lentil")
> *t͡ʃ > *s before i, j, u, or w s IE *ḱupo- ("shoulder") > Alb sup ("shoulder");

IE *ḱiā dīti > EPA tsja(i) diti > Alb sot ("today")

*t͡s retained, conditions unclear t͡s <c> IE *h₂eḱ- ("sharp") > EPA *atsara > Alb acar "cold/ice cold"

(but doublet: > athët ("sour")

> *t͡ʃ, conditions unclear t͡ʃ <ç> IE *ḱentro- ("to stick") > Alb çandër ("prop")
> *k before sonorant *k k IE *smeḱr- ("chin") > Alb mjekër ("chin, beard")
*ǵ, *ǵʰ (*ɟ?) *ǵ (*ɟ?) *dz dh SA dhëmb, Gheg dhãmb ‘tooth’ < PIE *ǵombʰ-
*dz~d? *ð~d? d SA dorë < PIE *ǵʰesr-
*d͡ʒ before w *z z SA zë, Gheg. zã (< zãn) < PIE *ǵʰweno-
*k *k *k *k k SA kam ‘to have’ < PIE *keh₂p-
*c (palatalised) q SA qaj 'to weep, cry' < PIE *kluH-i̯o-
*g, *gʰ *g *g *g g SA gardh ‘fence’ < PIE *gʰordʰ-
*j (palatalised) gj SA gjej 'to find' < PIE *gʰédni̯e/o-
*kʷ *kʷ? *t͡ʃ before front vowels *s s SA sjell ‘to bring’ < PIE *kʷelh₁-
*k elsewhere *k k SA pjek ‘to bake’ < PIE *pekʷ-
*c (palatalised) q SA që 'that, which' < PIE *kʷṓd
*gʷ, *gʷʰ *gʷ? *d͡ʒ(w) before front vowels *z z SA zorrë ‘gut’ < PIE *gʷʰērn-
*g elsewhere *g g SA djeg ‘to burn’ < PIE *dʰegʷʰ-
*j (palatalised) gj SA gjeth 'leaf' < PIE *gʷos(d)- 'wood'
*y *j *z (#_V) *j gj SA gjem ‘bridle’ < PIE *yom-
Ø (V_V) Ø Ø SA tre ‘three’ < PIE *treyes
*w *w *w (#_V) *w v SA vesh ‘to put on (clothes)’ < PIE *wes-
Ø (V_V) Ø Ø SA ve ‘widow’ < PIE *widʰewh₂

The development of IE dorsal consonants in Albanian

Indo-European languages are traditionally divided into two groups based on the development of the three series of dorsal (often called "guttural") stops, viz. the palatal (*ḱ *ǵ *ǵʰ), velar (*k *g *gʰ) and labiovelar (*kʷ *gʷ *gʷʰ) series. In the "centum" languages (e.g. Italic languages, Germanic languages, Greek) the palatal series has merged with velar series, while the labiovelar series remained distinct; whereas in the "satem" languages (Indo-Iranian languages, Balto-Slavic languages), the labiovelars merged with the plain velars, while the palatals shifted to sibilant consonants.[51]

Many Indo-Europeanists have classified Albanian as a satem language since it has dental fricatives /θ/ and /ð/ as the common reflex of the palatal series, while velar and labiovelar stops in most cases have merged.[52] However, there is clear evidence that all three IE dorsal series remained distinct (at least before front vowels) in Proto-Albanian:[53][54][55]

  • *ḱ > /θ/, *ǵ/*ǵʰ > /ð/
  • *k > /k/, *g/*gʰ > /g/
  • *kʷ > /s/ (before high vowels), /k/ (elsewhere); *gʷ/*gʷʰ > /z/ (before high vowels), /g/ elsewhere.

In the later phonologocal history of Albanian, the velars /k/ and /g/ were subject to further palatalizations.[56]

The (partial) retention of the Proto-IE three-way contrast for dorsal stops is an archaic feature that links Albanian with the wider Paleo-Balkanic group and is shared with Messapic[57] and Armenian.[58]

Classification & isoglosses with other Indo-European branches

The closest language to Albanian is Messapic, with which it forms a common branch titled Illyric in Hyllested & Joseph (2022).[59] Hyllested & Joseph (2022) in agreement with recent bibliography identify Greco-Phrygian as the IE branch closest to the Albanian-Messapic one. These two branches form an areal grouping – which is often called "Balkan IE" – with Armenian.[59] Hyllested & Joseph (2022) identify the highest shared number of innovations between (Proto-)Albanian and (Proto-)Greek.[60]

Innovative creations of agricultural terms shared only between Albanian and Greek, such as *h2(e)lbh-it- 'barley' and *spor-eh2- 'seed', were formed from non-agricultural Proto-Indo-European roots through semantic changes to adapt them for agriculture. Since they are limited only to Albanian and Greek, they can be traced back with certainty only to their last common Indo-European ancestor, and not projected back into Proto-Indo-European.[61] Shortly after they had diverged from one another, Greek, Armenian, and partly Albanian undoubtedly also underwent a longer period of contact (as can be seen, for example, in the irregular correspondence: Greek σκόρ(ο)δον, Armenian sxtor, xstor, and Albanian hudhër, hurdhë "garlic"). Furthermore, intense Greek–Albanian contacts certainly occurred thereafter,[62] and ongoing connections between them have been in the Balkans from the ancient times, continuing up to the present-days.[63]

The deictic element *ḱjā- in PPAlb *ḱjā-dīti > Albanian sot ("today") has the same source as *kjā- in Proto-Greek *kjā-wētes (cf. Mycenean Greek za-we-te, Attic Greek τῆτες, and Ionic Greek σῆτες "this year"). These words are built combining the deictic element and a form of the word for "day" in Albanian (PPAlb *dīti-) and for "year" in Greek (PGk *wētes-). The deictic element resulted from a reanalysis of the word for "today" *kjāmer-, which contains the restricted word for "day" *āmer- (cf. Gk ἡμέρα, Doric Greek ἁμέρα, and Armenian awr). In PPalb only later the word āmer- was replaced by *dīti-, when the latter became the usual word for day in this language.[64] Another remarkable Greek/Albanian isogloss is a very ancient form for "hand": *mər-, cf. the Albanian verb marr ("hold") and the Greek márē ("hand"), and also Greek márptō ("grab").[65]

A common Balkan Indo-European root *aiğ(i)- ("goat") can be reflected in Albanian edh ("goat, kid") < PAlb *aidza and dhi ("nanny goat) < PAlb *aidzijɑ̄ with Greek αἴξ ("goat", gen. αἰγός) and Armenian ayc ("(nanny) goat"). It has been noted that the Balkan IE root and all the alleged Balto-Slavic and Indo-Iranian roots with a meaning "goat" are likely to be not Proto-Indo-European, as they may all originate as independent and relatively early, post-PIE borrowings, from the substrate languages spoken by the sedentary farmers who were encountered by immigrating Indo-European pastoralists. The view of a substrate borrowing can be corroborated by areal words for "goat" in other IE languages, such as Gothic gaits ("goat") and Latin haedus ("kid"), reflecting *gʰaid̯(-o)-, considered as a substrate word usually linked with Semitic languages (cf. Akkadian gadû, Aramaic gaδiā ̄"kid"). However it was most likely not directly borrowed from Semitic, but from a European substrate language that in turn had loaned the word from a common third source. Hence it can be viewed as an old cultural word, which was slowly transmitted to different European languages, and then adopted by the newcoming Indo-European speakers. Within this scenario it should be remarked the exclusive sharing of a common proto-form between Albanian, Greek, and Armenian, which could have been borrowed at a pre-stage that was common to these languages.[66] Specifically Indo-Iranian/Greek/Albanian and Greek/Armenian/Albanian isoglosses are both relatively rare, examples including ndaj (to divide; Indo-Greek-Albanian) and ëndërr ("dream"; Greek/Armenian/Albanian). Whereas Armenian/Albanian isoglosses are "insignificant", there are a considerable number of Indo-Iranian/Albanian isoglosses, which are notably often connected with horses, horse tending, and milk products.[67]

In older literature, Orel (2000) argues that Albanian has a large number of isoglosses that are common to Albanian, Germanic, Baltic and Slavic, as part of a "North Eastern" lexical grouping, with a large number of these referring to wood or objects made out of wood.[68] Orel (1998) noted 24 isoglosses between Balto-Slavic and Albanian, 48 common words between Baltic and Albanian and 24 between Albanian and Slavic. Hyllested & Joseph (2022) review Orel's common items and argue that a substantial number don't have convincing etymologies or do not constitute isoglosses between Balto-Slavic and Albanian. An example is Albanian murg (dark) and Lithuanian margas (colourful) which Orel considers to be isoglosses but both are equally related to Proto-Germanic *murkaz, ancient Greek ἀμορβός amorbos and Proto-Slavic *mergъ.[69]

Orel identifies only one Albanian/Italic/Celtic isogloss, blertë ("green"), cognate to Latin flōrus ("bright") and Irish blár ("gray").[70] Specifically Celtic/Albanian vocabulary was previously thought to be limited although including at least one core vocabulary item (hënë "moon", cognate to Welsh cann "white" and Breton cann "full moon"),[71] but recent work by Trumper in 2018 has proposed a larger though still not overwhelming set, with the notable addition of dritë ("light").[72]

Although knowledge of Tocharian is fragmentary, the one known Albanian/Tocharian isogloss is "very important" as noted by Orel: kush ("who", cognate to Tocharian A kus, with the same meaning).[73]

References

Notes

  1. Absent for early part of period
  2. Absent for early part of period

Citations

  1. Matzinger 2016, p. 6: "Folgende Lautwandel charakterisieren u.a. das Uralbanische (Protoalbanische) und grenzen es dadurch als eine eigenständige idg. Sprache von anderen idg. Sprachen ab. Diese Phase kann präzisiert als Frühuralbanisch bezeichnet werden. Da das Hethitische (im antiken Kleinasien) und das mykenische Griechische schon im 2. Jahrtausend v.Chr. als voll ausgebildete, d.h. individuelle Sprachen dokumentiert sind, kann auch die Vorstufe des Albanischen (das Frühuralbanische) mindestens ab dem ersten Jahrtausend v.Chr. als eine ebenso schon voll ausgebildete, d.h. individuelle Sprache angesetzt werden"
  2. De Vaan 2018, p. 1732: "Internal comparison between the Tosk and Geg dialects allows us to reconstruct a Proto-Albanian stage (PAlb.; in German Uralbanisch; see Hock 2005; Klingenschmitt 1994: 221; Matzinger 2006: 23; B. Demiraj 1997: 41–67; Hamp 1992: 885–902). Additional external information on the development of the phonology is provided by different layers of loanwords, of which those from Slavic (from ca. 600 CE onward) and from Latin (ca. 167 BCE−400 CE) are the most important. Since the main phonological distinction between Tosk and Geg, viz. rhotacism of n, is found in only a few Slavic loanwords in Tosk (Ylli 1997: 317; Svane 1992: 292 f.), I assume that Proto-Albanian predated the influx of most of the Slavic loanwords.;
    Matzinger 2006, p. 41: "Diese Zeitspanne von der Antike bis ca. 600 n.Chr. wird in der Geschichtsschreibung die uralbanische Zeit genannt."
  3. 1 2 Friedman, Victor (2022). "The Balkans". In Salikoko Mufwene, Anna Maria Escobar (ed.). The Cambridge Handbook of Language Contact: Volume 1: Population Movement and Language Change. Cambridge Handbooks in Language and Linguistics. Cambridge University Press. ISBN 9781009115773.
  4. 1 2 Lazaridis, Iosif; Alpaslan-Roodenberg, Songül; et al. (26 August 2022). "The genetic history of the Southern Arc: A bridge between West Asia and Europe". Science. 377 (6609): eabm4247. doi:10.1126/science.abm4247. PMC 10064553. PMID 36007055. S2CID 251843620.
  5. 1 2 Hyllested & Joseph 2022, p. 235.
  6. 1 2 3 Friedman 2020, p. 388.
  7. 1 2 3 4 Trumper 2018, p. 385.
  8. 1 2 3 Ismajli 2015, p. 45.
  9. 1 2 3 Hamp & Adams 2013, p. 8.
  10. 1 2 3 4 5 6 De Vaan 2018, p. 1732
  11. Matasović 2019, p. 6
  12. 1 2 Orel 2000, p. XII
  13. 1 2 Matasović 2019, p. 7
  14. 1 2 Demiraj 2006, p. 483
  15. Fortson 2010, p. 392: "The dialectal split into Gheg and Tosk happened sometime after the region become Christianized in the fourth century AD; Christian Latin loanwords show Tosk rhotacism, such as Tosk murgu "monk" (Geg mungu) from Lat. monachus."
  16. Mallory & Adams 1997, p. 9: "The Greek and Latin loans have undergone most of the far-reaching phonological changes which have so altered the shape of inherited words while Slavic and Turkish words do not show those changes. Thus Albanian must have acquired much of its present form by the time Slavs entered into Balkans in the fifth and sixth centuries AD"
  17. Brown & Ogilvie 2008, p. 23: "In Tosk /a/ before a nasal has become a central vowel (shwa), and intervocalic /n/ has become /r/. These two sound changes have affected only the pre-Slav stratum of the Albanian lexicon, that is the native words and loanwords from Greek and Latin"
  18. 1 2 Chang, Will; Chundra, Cathcart (January 2015). "Ancestry-constrained phylogenetic analysis supports the Indo-European steppe hypothesis" (PDF). Language. 91 (1): 194–244. doi:10.1353/lan.2015.0005. S2CID 143978664. Retrieved 30 September 2020.
  19. Koch, John T. (2020). Celto-Germanic, Later Prehistory and Post-Proto-Indo-European vocabulary in the North and West. University of Wales Centre for Advanced Welsh and Celtic Studies. pp. 24, 50, 54. ISBN 9781907029325.
  20. Coretta, Stefano; Riverin-Coutlée, Josiane; Kapia, Enkeleida; Nichols, Stephen (16 August 2022). "Northern Tosk Albanian". Journal of the International Phonetic Association: 1–23. doi:10.1017/S0025100322000044. hdl:20.500.11820/ebce2ea3-f955-4fa5-9178-e1626fbae15f. Though the origin of the language has been debated, the prevailing opinion in the literature is that it is a descendant of Illyrian (Hetzer 1995).
  21. Matasović 2019, p. 5: "Much has been written about the origin of the Albanian language. The most probable predecessor of Albanian was Illyrian, since much of the present-day Albania was inhabited by the Illyrians during the Antiquity, but the comparison of the two languages is impossible because almost nothing is known about Illyrian, despite the fact that two handbooks of that language have been published (by Hans Krahe and Anton Mayer)... examination of personal names and toponyms from Illyricum shows that several onomastic areas can be distinguished, and these onomastic areas just might correspond to different languages spoken in ancient Illyricum. If Illyrians actually spoke several different languages, the question arises -from which 'Illyrian' language did Albanian develop, and that question cannot be answered until new data are discovered.The single "Illyrian" gloss preserved in Greek (rhínon 'fog') may have the reflex in Alb. (Gheg) re͂ 'cloud' (Tosk re)< PAlb. *ren-."
  22. Beekes 2011, p. 25: "It is often thought (for obvious geographic reasons) that Albanian descends from ancient Illyrian (see above), but this cannot be ascertained as we know next to nothing about Illyrian itself."
  23. Fortson 2010, p. 446: "Albanian forms its own separate branch of Indo-European; it is the last branch to appear in written records. This is one of the reasons why its origins are shrouded in mystery and controversy. The widespread assertion that it is the modern–day descendant of Illyrian, spoken in much the same region during classical times ([...]), makes geographic and historical sense but is linguistically untestable since we know so little about Illyrian."
  24. Mallory & Adams 1997, p. 11: "Although there are some lexical items that appear to be shared between Romanian (and by extension Dacian) and Albanian, by far the strongest connections can be argued between Albanian and Illyrian. The latter was at least attested in what is historically regarded as Albanian territory and there is no evidence of any major migration into Albanian territory since our records of Illyrian occupation. The loan words from Greek and Latin date back to before the Christian era and suggest that the ancestors of the Albanians must have occupied Albania by then to have absorbed such loans from their histori-cal neighbors. As the Illyrians occupied Albanian territory at this time, they are the most likely recipients of such loans."
  25. Villar, Francisco (1996). Los indoeuropeos y los orígenes de Europa (in Spanish). Madrid: Gredos. pp. 313–314, 316. ISBN 84-249-1787-1.
  26. Matzinger 2018, p. 1790.
  27. 1 2 3 4 Matasović 2019, p. 39
  28. Demiraj 1997, pp. 41–67
  29. Matzinger 2006, p. 23
  30. Klingenschmitt 1994, p. 221
  31. 1 2 3 4 Orel 2000, p. 1
  32. 1 2 3 Orel 2000, pp. 20–21
  33. 1 2 Orel 2000, p. 270
  34. Orel 2000, pp. 8–12
  35. 1 2 3 4 5 Orel 2000, pp. 15–16
  36. 1 2 3 4 Paçarizi 2008, pp. 101–102
  37. Totoni 1964, p. 136
  38. Orel 2000, p. 15
  39. 1 2 3 4 5 Orel 2000, pp. 42
  40. 1 2 Orel 2000, pp. 143–144
  41. Orel 2000, p. 3
  42. Orel 2000, pp. 2–3
  43. Orel 2000, pp. 3–4
  44. 1 2 3 de Vaan, Michiel (2018). "The phonology of Albanian". In Klein, Jared; Joseph, Brian; Fritz, Matthias (eds.). Handbook of Comparative and Historical Indo-European Linguistics. Vol. 3. Walter de Gruyter. p. 1735.
  45. 1 2 Orel 2000, p. 27
  46. Orel 2000, p. 42
  47. Orel 2000, pp. 271–272
  48. 1 2 Rusakov 2017, pp. 566–571
  49. Orel 2000, pp. 273–274
  50. Orel 2000, p. 65
  51. Fortson 2010, p. 58–59.
  52. Fortson 2010, p. 449.
  53. Fortson 2010, p. 450.
  54. Rusakov 2017, p. 559.
  55. Hyllested & Joseph 2022, pp. 238–239.
  56. Rusakov 2017, p. 571.
  57. Matzinger 2005, p. 47.
  58. Hyllested & Joseph 2022, pp. 229.
  59. 1 2 Joseph & Hyllested 2022, p. 235.
  60. Joseph & Hyllested 2022, p. 226.
  61. Kroonen et al. 2022, pp. 11, 26, 28
  62. Thorsø 2019, p. 258.
  63. Joseph 2013, p. 7.
  64. Joseph 2013, pp. 15–16.
  65. Bubenik 1997, p. 104.
  66. Thorsø 2019, p. 255.
  67. Orel 2000, pp. 259–260
  68. Orel 2000, pp. 250–251
  69. Joseph & Hyllested 2022, p. 223.
  70. Orel 2000, p. 257
  71. Orel 2000, pp. 256–257
  72. Trumper 2018, p. 379.
  73. Orel 2000, p. 260

Bibliography

This article is issued from Wikipedia. The text is licensed under Creative Commons - Attribution - Sharealike. Additional terms may apply for the media files.