Methodology — Avarieux

Every public-record source Avarieux surfaces has a peer-reviewed paper establishing why it carries research value. We didn't invent these findings; we make the underlying source legible at consumer pricing. The papers below are why the source is worth showing — not a claim about what any individual event will do.

Lazy Prices

2020

Cohen, Malloy, Nguyen · Journal of Finance

Finding
Year-over-year changes in 10-K and 10-Q language are associated with subsequent stock returns. Companies that materially modify their filings underperformed by ~30 bps/month vs companies with stable language.

Why it matters here
The basis for surfacing filing-language changes as a documented public event. We show the section-level diff and cite the filing; the interpretation is yours.

Read paper

When Is a Liability Not a Liability? Textual Analysis, Dictionaries, and 10-Ks

2011

Loughran & McDonald · Journal of Finance

Finding
Domain-specific (not general-purpose) sentiment lexicons are required for finance text. Generic NLP libraries misclassify ~75% of negative words in 10-K filings.

Why it matters here
Why we don't pipe SEC text through off-the-shelf sentiment APIs. Filing-language work requires finance-domain vocabulary or it returns noise.

Read paper

Decoding Inside Information

2012

Cohen, Malloy, Pomorski · Journal of Finance

Finding
Insider trades (Form 4) are associated with future returns when filtered for the right insiders and routine-ness. Non-routine trades by senior insiders were linked to ~7% abnormal annual returns.

Why it matters here
Why Form 4 filings are a documented public event worth surfacing — with the routine vs non-routine distinction shown, not editorialized.

In Search of Attention

2011

Da, Engelberg, Gao · Journal of Finance

Finding
Search-frequency spikes (Google Trends) preceded price moves on small-cap and IPO stocks in the sample studied.

Why it matters here
The academic basis for treating public attention across sources as a documented event worth recording — not a prediction.

Read paper

Giving Content to Investor Sentiment: The Role of Media in the Stock Market

2007

Tetlock · Journal of Finance

Finding
Media tone (pessimism in WSJ columns) was associated with short-term price movements that reverted within a week in the sample.

Why it matters here
Why we treat a news event as a short-window fact with its timestamp and source, and never as a standing signal.

Prediction Markets

2004

Wolfers & Zitzewitz · Journal of Economic Perspectives

Finding
Prediction markets aggregated dispersed information efficiently for binary, time-bounded outcomes across the 50+ markets studied.

Why it matters here
Why a prediction-market price is a documented public data point worth surfacing alongside filings — shown as a quoted number with its source.

The Cross-Section of Expected Stock Returns

1992

Fama & French · Journal of Finance

Finding
Beta alone does not explain the cross-section of returns. Size and value factors capture most of what beta misses.

Why it matters here
Why, where we report risk context, we report several factors rather than a single number.

The Limits of Arbitrage

1997

Shleifer & Vishny · Journal of Finance

Finding
Mispricing can persist longer than arbitrageurs can stay solvent. Information edges decay over hours-to-days, not seconds.

Why it matters here
Why Avarieux is a research-and-discovery tool, not an execution product. We surface what's public; we don't trade.

How this is built.

Academic literature we build on

Lazy Prices

When Is a Liability Not a Liability? Textual Analysis, Dictionaries, and 10-Ks

Decoding Inside Information

In Search of Attention

Giving Content to Investor Sentiment: The Role of Media in the Stock Market

Prediction Markets

The Cross-Section of Expected Stock Returns

The Limits of Arbitrage

Evaluation principles

Random sampling, not event-selected

Point-in-time data only

Out-of-sample, walk-forward

Stated limits

What we explicitly do not claim

See the sources for yourself