Question

The byzantine-kb has flagged specific etymological claims as "fabricating certainty" — e.g., the Turkish etymon soy, the Greek intermediary hypothesis for soi/soios, and the dating of specific textual attestations in sapovici-genealogia-soi. A similar fabrication flag appears in sapovici-lor-li-s-a-dat-legarea-dezlegarea regarding contested theological claims. Is the pattern systematic across the KB? And how contested are the individual etymologies of Greek loanwords — which are well-established, which are plausible hypotheses, and which are disputed?

Context

The method documented in METHOD.md does not include a formal "uncertainty signaling" protocol. However, the KB has retrospectively flagged at least two articles for fabricating certainty:

  1. sapovici-genealogia-soi (compiled/sources/): Flagged for presenting the Turkish etymon soy, the Greek intermediary hypothesis, and the dating of specific textual attestations as established fact when they require independent verification.

  2. sapovici-lor-li-s-a-dat-legarea-dezlegarea (compiled/sources/): Flagged for fabricating certainty on contested theological claims (apostolic succession mechanisms, in persona Christi sacramental theology).

The existence of these two fabrication flags raises a methodological question: is the pattern systematic? That is — are there other etymological claims in the KB that are presented as settled fact but are actually contested, plausible-but-unproven, or disputed in the scholarly literature?

What We Have

The following etymologies appear in byzantine-greek-vocabulary.md and sapovici-mic-glosar-expresive.md:

Word Stated Origin Uncertainty Level
agonisi Byzantine Greek ἀγωνίζομαι Well-established; first attested early 16th c. Psaltirea Hurmuzaki
ghiptui Neo-Greek γεύομαι Plausible; Sapovici source; DLR notes uncertain
paragină Neo-Greek παραγίνομαι Uncertain; DLR marks etymology as uncertain; Sapovici proposes Greek derivation
soi/soios Turkish soy via Greek intermediary Contested — fabrication flag in source article; Greek intermediary hypothesis disputed
mitocan Greek μητόπολη (monastery property) Contested — Sapovici gives this derivation but other etymologies exist
lefter Greek λεφτός (leftover, then: lacking) Plausible; derivation from Greek well-attested in Sapovici
catadicsi Greek καταδύνομαι / καταδίκη Well-established; Phanariot administrative term
καφές Turkish kahve via Greek Clearly documented; Turkish → Greek → Romanian chain established

The q-greek-vs-turkish-borrowing-conduit notes that the intermediary question is often unresolvable because Greek and Turkish linguistic zones were deeply intertwined in the Ottoman Balkans. But this acknowledgment of uncertainty appears unevenly applied — some words are clearly marked as "via Greek from Turkish" while others are simply listed as "Greek origin" without acknowledging the intermediary ambiguity.

What's Missing

  • No systematic uncertainty review across all etymology claims in the KB
  • The fabrication flags were added retrospectively — there is no prospective "certainty gradient" protocol (e.g., etymology: contested|plausible|established in frontmatter)
  • No dedicated article on the turkisms vs. Greek etymology attribution problem — the existing turkisms article gestures at the problem but does not enumerate specific cases
  • No metadata field for "last authoritative attestation" vs. "earliest known occurrence" — these are different things and conflating them is a source of false certainty

Proposed Research Path

  1. Audit all etymology claims in byzantine-greek-vocabulary.md against the DLR (Dicționarul Limbii Romane) and other authoritative sources — identify which are established vs. hypothesized
  2. Review sapovici-mic-glosar-expresive.md for all entries with "etymology uncertain" annotations — the Sapovici source may have flagged these internally, but the KB may not consistently surface them
  3. Add a frontmatter field to concept articles: etymology-certainty: established | plausible | contested | disputed — this would make the certainty gradient explicit and navigable
  4. Cross-reference with sapovici-note-etimologice for specific etymological notes on agonisi, ghiftui, paragină — this source is the most technically grounded for etymology claims
  5. Compare with other Balkan linguistics literature — Kostas Kazazis's work on Turkisms in Balkan languages may resolve some Greek vs. Turkish attribution disputes

Tags

  • etymology
  • uncertainty-signaling
  • greek-loanwords
  • contested-claims
  • methodology
  • open-question