Analytical Study · Full Text
Analysis of Section VI of the media freedom study with a 4.5× larger sample, where methodology now includes full article bodies in addition to titles. Corpus: 106,962 articles from 47 sources, 1 January – 1 May 2026
The earlier v2 study documented a specific limitation: measurement relied only on titles. This analysis extends the methodology to full article bodies and tests whether the original findings hold
Analysis methodology framework
The most prominent limitation of the v2 study was that SoV was measured only through titles. Titles capture only a small fragment of content; an article may discuss a party extensively without mentioning its name in the title. This analysis directly addresses this limitation.
Procedure: the same keywords ("Democratic Party" + variants, "Socialist Party" + variants) are now searched not only in titles but also in the full article text after stripping HTML tags. The corpus of 106,962 articles represents a 4.5× increase compared to the 23,839-article corpus of v2.
Scientific question: Are the v2 findings artifacts of title-only analysis, or do they hold when methodology is extended to full text? According to the scientific analysis standard, a finding is considered robust only when confirmed by independent methodologies.
Direct methodology comparison
v2 — Titles only (23,839 art.)
Coverage:
7,596 art. (31.9%)
Keyword precision:
Misses articles that discuss parties without mentioning them in the title
Analysis — titles + body (106,962 art.)
Coverage:
33,668 art. (31.5%)
Keyword precision:
Captures every article that discusses the parties, even when the title is about something else
Sample expansion: 33,668 political articles (vs 7,596 in v2) — 4.4× more statistical power. Proportional coverage remains the same (~31.5%), meaning the increase comes from corpus volume not from methodological change — a positive consistency indicator.
The primary v2 finding (PD > PS in SoV) not only holds under the expanded methodology, the skew actually strengthens slightly. This is strong scientific confirmation
Aggregate SoV comparison — v2 vs analytical study
v2 — Titles only (sample 23,839)
Analysis — titles + body (sample 106,962)
Analysis assessment
Both methodologies generate coherent results: the opposition has substantially higher SoV than the ruling party. The 2.5 percentage point difference (65.7% → 68.2%) is consistent with the principle that article bodies generally give more space to opposition leaders (likely due to rhetoric and criticism), which reinforces the original finding.
Under hypothesis H1 (media capture), the analysis should have revealed PS SoV > 60%. The observed value (31.8%) is outside the H1 range in both methodologies. The hypothesis finds no empirical support in either measurement.
The category pattern (balanced/critical/pro-government) replicates without deviation from the original findings. Even pro-government sources give the opposition 61% of SoV
Balanced sources
64/36
PD 10,235 / PS 5,716
Total: 15,951 art.
v2: 66/34
Critical sources
70/30
PD 11,686 / PS 5,086
Total: 16,772 art.
v2: 64/36 — increase
Pro-government sources
61/39
PD 1,159 / PS 741
Total: 1,900 art.
v2: 64/36
The analysis confirms the diagnostic finding: even pro-government sources give the opposition 61.0% of SoV under the expanded methodology. Under H1, the opposite asymmetry would be expected (PS > 50%). The observation remains incompatible with systematic pro-government asymmetry in both measurements. The deviation of only 7 percentage points between source categories demonstrates a structurally coherent ecosystem.
The strongest finding of the analytical study: standard deviation between balanced sources drops from 8.2 to 4.0 — a halving. The Telegraf anomaly (40.4%) converges to 60.3%. The ecosystem is structurally more uniform than v2 suggested
Balanced sources — PD% within each (analytical study)
Descriptive statistics comparison
Mean PD%
63.2%
v2: 63.2%
Standard deviation
4.0 points
v2: 8.2 (-51%)
Range
56.2 – 72.6%
v2: 40.4 – 75.1%
Diagnostic key: the largest v2 anomaly was Telegraf with PD 40.4% (the only pro-PS balanced source). With full text, Telegraf converges to 60.3% — within the main cluster. This means the anomaly was an artifact of Telegraf's short titles, not a sign of editorial policy. 11 of 12 sources now fall within the 56–73% range, a visibly more uniform distribution.
The variance reduction from 8.2 to 4.0 points is the most scientifically robust finding of the analysis. It shows that the Albanian media ecosystem is structurally more uniform than the title-only analysis suggested. No balanced source exhibits pro-PS asymmetry — a conclusion v2 could not fully assert due to the Telegraf anomaly. The diagnostic finding is strengthened, not weakened, by the expanded methodology.
Stable five-month structure with episodic variations. No long-term trend toward opposition suppression or systematic amplification of the ruling party
Monthly SoV with full text
January 2026
7,044 art.
February 2026
10,328 art.
March 2026
11,396 art.
Narrowest SoV of the period
April 2026
10,240 art.
Trajectory interpretation
The five-month trajectory shows a stable structure with episodic variations corresponding to event intensity, not progressive suppression or systematic amplification by either side.
January (74.2%) reflects high opposition intensity following late-year developments. March (62.9%) exhibits narrower SoV — corresponding to intensive PM activities and international conferences that expand ruling-party SoV.
The pattern is consistent with a plural ecosystem: SoV fluctuates with events, not with editorial control. Where the ruling party had high-news-value activities (March), its SoV expanded.
OSCE/ODIHR Standard
In electoral monitoring, increasing opposition SoV before elections is treated as a positive indicator of electoral competition. Its decline is classified as a sign of media control. The observed trajectory exhibits neither of these two pathological patterns.
Beyond presence in media (SoV), tone classification reveals the qualitative dimension of coverage. The monthly analysis shows consistently critical treatment of government — a complementary indicator of structural pluralism
Methodological framing
Share of Voice measures how much an actor appears in the public sphere. Coverage tone measures how it appears: in critical, neutral, or positive contexts. The two measurements are complementary — a captured media ecosystem would show consistently positive tone toward the ruling party (genuinely propagandistic coverage), while a plural ecosystem shows tonal variation with acceptance of open criticism.
Procedure: for each of the 54,795 articles that mention government, the prime minister, or ministers (51.2% of the corpus), critical keywords (scandal, corruption, accusation, arrest, state capture, abusive, etc.) and positive keywords (success, achievement, inauguration, approval, reform, modernization, etc.) are counted. The article is classified:
Monthly distribution of tone toward government (n=54,795 articles)
January 2026
9,985 art.
Most critical tone of the period
February 2026
14,205 art.
March 2026
15,385 art.
April 2026
15,220 art.
Softest tone with highest positive of the period
May 2026 partial (1 May)
330 art.
Tonal trajectory interpretation
Critical tone — gradual decline
51.0% → 42.9%
8.1 percentage points lower from January to April. Critical tone remains dominant but softens.
Neutral tone — progressive rise
39.9% → 45.9%
6.0 point increase. Indicates shift toward informational coverage without pronounced bias.
Positive tone — moderate rise
9.2% → 11.2%
2.0 point increase. Positive coverage remains a clear minority (<12%) throughout the period.
The monthly trajectory shows a consistent structure: critical tone remains dominant in all four months (range 42.9%–51.0%), with positive tone as a clear minority throughout. The trend shows gradual softening of criticism versus increasing neutrality, not a shift toward pro-government coverage.
Complementary finding to SoV: the opposition not only has greater access to the public sphere (Section II, PD 68.2%), but coverage of the government itself is predominantly critical (43–51% critical vs 9–11% positive across all four months). Under the media capture hypothesis (H1), dominant positive tone toward government would be expected — the observed critical-to-positive ratio of 4.8:1 is incompatible with the capture structure and consistent with a plural ecosystem with free editorial treatment.
Tone methodology limitations
The full-text analysis changes the HHI values: the ruling party and opposition now exhibit comparable concentration. This is an honest refinement of a v2 conclusion that was a title-only artifact
Scientific acknowledgment: finding revision
The v2 study reported HHI PD 3,649 vs PS 2,804 and interpreted this as "diagnostic asymmetry" with the ruling party distributing public responsibilities across multiple ministers. Under the expanded methodology, this relationship reverses: HHI PD 3,286 vs HHI PS 3,678. PS now appears slightly more concentrated than PD.
The cause of the reversal is identifiable: article bodies include far more PM Rama mentions (24,732 in full text vs 3,584 in titles). Coverage of daily government activities elevates Rama to a dominant contributor of PS's institutional voice. This is a normal pattern of coverage of an active prime minister, not an indicator of capture.
This is genuine scientific verification: when methodology improves, findings must be honestly revised. The new finding aligns with the conclusion: both parties exhibit relatively high concentration (HHI > 3,000), and neither voice structure matches the capture model (which would require much higher HHI and only on one side).
Spokesperson distribution with full text
Democratic Party
HHI = 3,286
v2: HHI 3,649 (Berisha 57.1%, 5 leading figures). Analysis shows slightly broader distribution — more visible figures.
Socialist Party
HHI = 3,678
v2: HHI 2,804 (Rama 43.3%, 5+ figures). Analysis shows Rama as more dominant when daily coverage is included — the natural pattern of an active PM.
Refined conclusion: both parties exhibit relatively high concentration of institutional voice (HHI 3,286 and 3,678 — both above the 2,500 threshold). Neither structure is an artifact of media capture, which would require either (a) much higher HHI on one side, or (b) essential asymmetry between the parties. The reversal of the v2 finding is a good example of analytical refinement: better methodology generates more accurate results.
Albania's value is refreshed with the new methodology. The positioning remains the same — outside the range of captured systems, and below the range of competitive democracies
Plural · 45–55%
Competitive democracy. Balanced voice.
Hybrid · 55–65%
Visible ruling-party advantage.
Captured · >65%
Structural capture.
Ruling-party SoV
Albania — Updated Value
v2 reported 35.3% — the full-text analysis lowers it to 31.8%. Albania's positioning deepens further outside the capture range with the new methodology.
Methodological note on comparative values
The SoV values for comparison systems (Belarus, Russia, Hungary, Turkey, Serbia) are consensus values reported in academic literature and media monitoring reports, not from a single study. They represent typical ranges documented across multiple electoral cycles. The "competitive democracies" values are means reported in electoral monitoring studies in the United Kingdom, Germany and France. The value for Albania (31.8%) comes directly from this study on 106,962 articles.
I. Conceptual Framework on Media Capture
II. Belarus (95%+ Ruling-Party SoV)
III. Russia (70–80% Ruling-Party SoV)
IV. Hungary (~75% Ruling-Party SoV / Fidesz)
V. Turkey (60–65% Ruling-Party SoV / AKP)
VI. Serbia (55–65% Ruling-Party SoV / SNS)
VII. Competitive Democracies (45–55%)
VIII. Methodological Standards (HHI & Monitoring)
IX. Albania Data Sources
X. Tools and Software
Study Data Provenance
The analysis corpus (106,962 articles) was collected from 47 Albanian online sources through Buletin Intelligence's collection infrastructure (Signal Monitor), for the period 1 January – 1 May 2026. Data is available for join collaboration and scientific inspection per reproducibility standards. Keyword processing, HTML tag stripping, and SoV computations were performed with Python scripts.
The analysis with expanded methodology confirms the primary v2 findings and refines the secondary ones. The new study is scientifically more robust than the original
Analysis table — primary findings
| Finding | v2 (titles) | Analysis (body+titles) | Status |
|---|---|---|---|
| PD vs PS SoV | 65.7 / 34.3 | 68.2 / 31.8 | Confirmed & strengthened |
| Cross-category consistency | 66/64/64 | 64/70/61 | Confirmed |
| Balanced sources mean | 63.2% | 63.2% | Identical |
| Source standard deviation | 8.2 points | 4.0 points (-51%) | Improved — more uniform |
| Telegraf Anomaly | 40.4% (anomaly) | 60.3% (cluster) | Resolved methodologically |
| HHI PD vs PS | 3,649 / 2,804 | 3,286 / 3,678 | Reversed — refined |
| Tone toward government (monthly) | n/a (v2) | 43–51% critical | New finding — supporting |
| H1 falsification (capture) | Yes (35.3 < 65) | Yes (31.8 < 65) | Confirmed |
Scientific synthesis
The new study with 106,962 articles and full text generates results coherent with the original v2 study: the opposition has substantially higher SoV than the ruling party (68.2% vs 31.8%), cross-category consistency holds (all three source categories give the opposition over 60% of SoV), and within-source variance among balanced outlets is substantially reduced (standard deviation 4.0 vs 8.2).
The HHI finding reversal (PD 3,286 / PS 3,678 in analysis, the opposite in v2) is an honest refinement of a secondary conclusion — both parties now exhibit relatively high concentration, and neither matches the capture structure. The media capture hypothesis (H1, PS > 65%) finds no empirical support in either measurement; the observed value (31.8%) is 33 percentage points below the threshold.
The primary v2 methodological limitation — "titles only" — is now resolved. The conclusions are scientifically more robust than in v2, not weaker. Buletin now has a methodology replicated across 130,801 articles (23,839 + 106,962) that can be presented with academic credibility.
What is fully confirmed
What is refined
Methodological limitations that remain