✦ Research-Backed Implementation

Reinforcement
Learning that
actually ships

25+ years of RL research applied to real business problems. We bridge the gap between academic theory and production deployment — depth consultancies can't match, velocity academics never deliver.

Typical engagements start at $250/hr · Project work from $50K

🧠
DataWorks Engagement Reinforcement Learning Consulting
Problem scoped & formalized MDP formulation, state/action space, reward signal design
Algorithm selected & benchmarked PPO vs. SAC vs. DDPG — justified for your environment
Model trained, evaluated, deployed With convergence guarantees and failure mode analysis
Team upskilled Your engineers understand what they're maintaining

Hourly rate from $250 / hr
25+
Years of RL Research
Since 1998
70+
Peer-Reviewed Publications
Nature, PNAS, Psych. Review
6,500+
Citations
Google Scholar
100%
Research-Backed Work
No heuristics, no guesswork
Services

What we do for you

Four core offerings, each grounded in formal methodology. Expand any card for engagement models, pricing, and delivery details.

⚙️

Decision AI Systems

Autonomous decision-making for complex, sequential business processes. Systems built on peer-reviewed algorithms that mimic decision-making in biological brains — improving over time without manual re-tuning.

Sequential Decision-Making Bandit Algorithms Adaptive Optimization
What this covers
  • Drug discovery optimization — adaptive experimental design to accelerate compound screening and dosing studies
  • Portfolio management — multi-asset sequential allocation under uncertainty, beyond mean-variance optimization
  • Autonomous systems — real-time decision agents for robotics, logistics routing, and adaptive control
  • Pricing engines — dynamic pricing with contextual bandits that learn from market feedback
  • Supply chain optimization — inventory and routing decisions that adapt to demand shifts without retraining

Research foundation

Our Decision AI algorithms are grounded in 25+ years of peer-reviewed research on how biological systems make sequential decisions under uncertainty. We bring formal mathematical rigor to problems where off-the-shelf RL frameworks make hidden assumptions that break in production. View publications →

Engagements scoped and priced per project. Typical range: $75K–$250K depending on system complexity.

🔬

Data Science & Analytics

Research-grade data science: experimental design, statistical modeling, and cross-source data curation. We find what your current methodology is missing — and build the infrastructure to get it right.

Experimental Design Statistical Modeling Cross-Source Curation
What we deliver
  • Experimental design — valid study design that produces statistically interpretable results, not just movement on a metric
  • Statistical modeling — formal models chosen for your data structure, not defaults from a library
  • Cross-source data curation — harmonizing incompatible datasets from multiple sources into a single analyzable schema
  • Model auditing — independent research-grade evaluation of existing models; we find the flaws before your customers do
  • Remediation roadmaps — specific, actionable next steps, not a list of concerns

Track record

In a published case study, we harmonized 108 experiments from 14 species across dozens of independent labs into a single unified model — achieving mean correlation 0.88 and 5.9% MAE with no per-study tuning. The same methodology applies to multi-vendor benchmarks, cross-geography analytics, and any environment where data sources weren't designed to talk to each other.

Audit engagements start at $15K. Full analytics projects scoped per requirements.

🧭

AI Strategy Advisory

Strategic guidance for companies evaluating whether RL and ML belong in their product. You get a 25-year expert on call — not to write code, but to make sure the direction is right before you spend the budget.

RL Feasibility Assessment Architecture Review Vendor Evaluation
Monthly Retainer
AI Strategy Advisory
$5K / month · cancel anytime

8 hours per month of dedicated strategic access. Direct line to Stefano Ghirlanda, PhD for architecture decisions, vendor assessments, team interviews, and technology roadmap review.


Who this is for
  • VPs of Engineering or Data evaluating whether reinforcement learning fits a specific use case before committing headcount
  • Heads of Product mapping AI capabilities to product strategy for a 12–24 month roadmap
  • CTOs at growth-stage companies building an ML function for the first time and needing a senior sounding board
  • Teams evaluating vendors who want an independent expert to review proposals and separate signal from sales pitch

What's included
  • Monthly strategy session (60–90 min video call)
  • Async Q&A via email between sessions
  • Architecture and code review (up to 3 PRs or documents per month)
  • Priority access when time-sensitive decisions arise

Most advisory clients convert to project work within 3–6 months. The retainer is intentionally structured so you can move from strategy to implementation without switching providers.

The Researcher

Stefano Ghirlanda, PhD

25+ years building the mathematical foundations that other researchers cite — with 70+ peer-reviewed publications cited over 6,500 times.

Computational modeling, neural networks, reinforcement learning, and the formal study of how systems learn from experience. Not as a practitioner following frameworks — as an architect who built some of them.

📄
70+ Peer-Reviewed Publications Nature, PNAS, Psychological Review, Neural Networks
📊
Cited 6,500+ Times Google Scholar verified impact
🧠
25+ Years of RL Research Formal models of associative learning and reward-based optimization
Reinforcement Learning

Formal models of associative learning and reward-based policy optimization. Research that predates and informs modern deep RL — 20+ years of primary contributions.

Computational Modeling

Quantitative models of complex adaptive systems. The same mathematics that describes biological learning also describes how machines should learn from feedback.

Neural Networks

Architecture, training dynamics, and generalization. Research-grade understanding of why neural networks succeed and fail — not just how to run them.

Applied Methodology

Bridging formal statistical theory and real-world data science practice. The difference between models that look good in papers and models that work in production.

Why DataWorks

Research depth.
Implementation velocity.

Most RL consultancies are one of two things: academics who've never shipped production code, or engineers who've never read a paper. DataWorks is neither — and both.

We don't just recommend an algorithm and leave. We stay until it works, explain why it works, and make sure your team can maintain it.

Research Firms
DataWorks
Dev Shops
RL expertise
~
Ships production code
Formal methodology
Understands your domain
~
~
Explains the work
~
Past Work

Selected projects

Research-backed implementations for real commercial problems — performance engineering, model auditing, predictive analytics, and cross-source data engineering.

🔬
Model Auditing

Auditing Inflated Cognitive Claims with a Simpler Model

Bottom Line

Benchmark auditing applicable to LLM evaluation, AI due diligence, and stress-testing "the model can reason" claims.

58 Experiments Audited 88% Variance Explained 14+ Studies
Situation Published literature claimed animals use "intuitive statistics" — a cognitively complex ability requiring high-level reasoning.
Task Independently audit 58 experiments across 14+ studies in animal cognition.
Action Built a minimal associative-learning model to test whether simpler mechanisms could explain the data — without any per-study tuning.
Result Reproduced every result in the literature; explained 88% of variance on average. Claims of "intuitive statistics" were not warranted.
Ghirlanda & Mendoza, Psychological Review (in press) · osf.io/8wjdb
📈
Predictive Analytics

Cultural Trends as Leading Indicators

Bottom Line

Diffuse cultural-choice data as leading indicators for brand sentiment, product reception, and public-figure risk — decades before traditional polling catches up.

135 Years of Records 90%+ Name Collapse 630 Names Analyzed
Situation Traditional polling missed major public-sentiment shifts — evidenced most sharply by the 2016 forecasting failures.
Task Test whether diffuse cultural-choice data encodes public sentiment earlier than polls do.
Action Analyzed 100% of U.S. Social Security name records (1880–2015). Found "Hillary"/"Hilary" collapsed 90%+ after 1992 — the most extreme cycle-skew among 630 comparable names.
Result Effect crossed party lines and was visible in the late 1990s — decades before 2016 polling failures.
Ghirlanda, Cliodynamics 8(1) · doi.org/10.21237/C7clio0033703
🔗
Data Engineering

Cross-Source Data Curation and Unified Modeling

Bottom Line

Fragmented multi-source evidence → defensible schema → single model → actionable verdict. Directly applicable to multi-vendor benchmark audits and cross-study synthesis.

108 Experiments Harmonized 0.88 Mean Correlation 14 Species, 5.9% MAE
Situation Behavioral science evidence is fragmented across dozens of independent labs with incompatible formats and no shared schema.
Task Harmonize 108 experiments and 1,540 data points from 14 species across dozens of labs into a single analyzable dataset.
Action Built per-source ingestion pipelines and one unified 3-parameter model — no per-study tuning, no special-casing.
Result Mean correlation 0.88, mean absolute error 5.9% across 68 datasets.
Ghirlanda, Lind & Enquist (2017), Royal Society Open Science 4: 161011
Pricing

Straightforward rates

No retainer traps. No surprise overruns. We scope tightly and stick to it.

Hourly Consulting

$250/hr
Minimum 4-hour block

Architecture review, technical advisory, model evaluation, or any focused work that doesn't require a full project scope. Good for getting unstuck fast.

Book time →

Project Engagement

$50K+
Fixed-scope, milestone-based

Full RL system design and implementation. We define the scope, agree on milestones, and deliver. Typical engagements run 6–16 weeks depending on complexity.

Get a scope →

Ongoing Advisory

$5K/mo
Monthly retainer, cancel anytime

8 hours/month of dedicated access. Code review, architecture guidance, and a direct line to a 25-year RL expert when your team needs backup.

Start a conversation →
Work With Us

Tell us about your project

We respond within one business day. If the project is a good fit, we'll set up a 30-minute call to scope it properly.

Inquiry received

We'll review your project and respond within one business day.