DataWorks | Reinforcement Learning & AI Consulting

Services

What we do for you

Four core offerings, each grounded in formal methodology. Expand any card for engagement models, pricing, and delivery details.

🤖

Reinforcement Learning Consulting

End-to-end RL for your business problem. We formalize your environment as an MDP, select and tune the right algorithm (PPO, SAC, custom bio-inspired), train the policy, and deploy it to production. You get working code, full documentation, and a team that understands it.

MDP Formalization Algorithm Selection Production Deployment PPO · SAC · Bio-Inspired

Phase 1

Research Audit

$30K – $50K · 4–6 weeks

Problem formalization as a Markov Decision Process. State and action space definition. Reward signal design. Algorithm landscape review (PPO, SAC, DDPG, custom bio-inspired). Feasibility assessment and implementation roadmap. Delivered as a technical report your engineering team can act on — regardless of whether you proceed with us.

Phase 2

Full Implementation

$150K – $300K · 12–20 weeks

End-to-end training pipeline, environment simulation, policy optimization, convergence analysis, and production deployment. Includes comprehensive test suite, failure mode documentation, and team knowledge transfer. Pricing depends on environment complexity and integration requirements.

Phase 3

Ongoing Optimization Retainer

$8K – $12K / month

Continuous policy monitoring, model drift detection, periodic retraining, and performance benchmarking. Keeps your RL system performing as the environment evolves. Includes 8–12 hours/month of direct access and priority support.

Phases can be engaged independently. Most clients begin with a Research Audit to de-risk before committing to implementation.

⚙️

Decision AI Systems

Autonomous decision-making for complex, sequential business processes. Systems built on peer-reviewed algorithms that mimic decision-making in biological brains — improving over time without manual re-tuning.

Sequential Decision-Making Bandit Algorithms Adaptive Optimization

What this covers

Drug discovery optimization — adaptive experimental design to accelerate compound screening and dosing studies
Portfolio management — multi-asset sequential allocation under uncertainty, beyond mean-variance optimization
Autonomous systems — real-time decision agents for robotics, logistics routing, and adaptive control
Pricing engines — dynamic pricing with contextual bandits that learn from market feedback
Supply chain optimization — inventory and routing decisions that adapt to demand shifts without retraining

Research foundation

Our Decision AI algorithms are grounded in 25+ years of peer-reviewed research on how biological systems make sequential decisions under uncertainty. We bring formal mathematical rigor to problems where off-the-shelf RL frameworks make hidden assumptions that break in production. View publications →

Engagements scoped and priced per project. Typical range: $75K–$250K depending on system complexity.

🔬

Data Science & Analytics

Research-grade data science: experimental design, statistical modeling, and cross-source data curation. We find what your current methodology is missing — and build the infrastructure to get it right.

Experimental Design Statistical Modeling Cross-Source Curation

What we deliver

Experimental design — valid study design that produces statistically interpretable results, not just movement on a metric
Statistical modeling — formal models chosen for your data structure, not defaults from a library
Cross-source data curation — harmonizing incompatible datasets from multiple sources into a single analyzable schema
Model auditing — independent research-grade evaluation of existing models; we find the flaws before your customers do
Remediation roadmaps — specific, actionable next steps, not a list of concerns

Track record

In a published case study, we harmonized 108 experiments from 14 species across dozens of independent labs into a single unified model — achieving mean correlation 0.88 and 5.9% MAE with no per-study tuning. The same methodology applies to multi-vendor benchmarks, cross-geography analytics, and any environment where data sources weren't designed to talk to each other.

Audit engagements start at $15K. Full analytics projects scoped per requirements.

🧭

AI Strategy Advisory

Strategic guidance for companies evaluating whether RL and ML belong in their product. You get a 25-year expert on call — not to write code, but to make sure the direction is right before you spend the budget.

RL Feasibility Assessment Architecture Review Vendor Evaluation

Monthly Retainer

AI Strategy Advisory

$5K / month · cancel anytime

8 hours per month of dedicated strategic access. Direct line to Stefano Ghirlanda, PhD for architecture decisions, vendor assessments, team interviews, and technology roadmap review.

Who this is for

VPs of Engineering or Data evaluating whether reinforcement learning fits a specific use case before committing headcount
Heads of Product mapping AI capabilities to product strategy for a 12–24 month roadmap
CTOs at growth-stage companies building an ML function for the first time and needing a senior sounding board
Teams evaluating vendors who want an independent expert to review proposals and separate signal from sales pitch

What's included

Monthly strategy session (60–90 min video call)
Async Q&A via email between sessions
Architecture and code review (up to 3 PRs or documents per month)
Priority access when time-sensitive decisions arise

Most advisory clients convert to project work within 3–6 months. The retainer is intentionally structured so you can move from strategy to implementation without switching providers.

The Researcher

Stefano Ghirlanda, PhD

25+ years building the mathematical foundations that other researchers cite — with 70+ peer-reviewed publications cited over 6,500 times.

Computational modeling, neural networks, reinforcement learning, and the formal study of how systems learn from experience. Not as a practitioner following frameworks — as an architect who built some of them.

📄

70+ Peer-Reviewed Publications Nature, PNAS, Psychological Review, Neural Networks

📊

Cited 6,500+ Times Google Scholar verified impact

🧠

25+ Years of RL Research Formal models of associative learning and reward-based optimization

Reinforcement Learning

Formal models of associative learning and reward-based policy optimization. Research that predates and informs modern deep RL — 20+ years of primary contributions.

Computational Modeling

Quantitative models of complex adaptive systems. The same mathematics that describes biological learning also describes how machines should learn from feedback.

Neural Networks

Architecture, training dynamics, and generalization. Research-grade understanding of why neural networks succeed and fail — not just how to run them.

Applied Methodology

Bridging formal statistical theory and real-world data science practice. The difference between models that look good in papers and models that work in production.

Past Work

Selected projects

Research-backed implementations for real commercial problems — performance engineering, model auditing, predictive analytics, and cross-source data engineering.

🔬

Model Auditing

Auditing Inflated Cognitive Claims with a Simpler Model

Bottom Line

Benchmark auditing applicable to LLM evaluation, AI due diligence, and stress-testing "the model can reason" claims.

58 Experiments Audited 88% Variance Explained 14+ Studies

Situation Published literature claimed animals use "intuitive statistics" — a cognitively complex ability requiring high-level reasoning.

Task Independently audit 58 experiments across 14+ studies in animal cognition.

Action Built a minimal associative-learning model to test whether simpler mechanisms could explain the data — without any per-study tuning.

Result Reproduced every result in the literature; explained 88% of variance on average. Claims of "intuitive statistics" were not warranted.

Ghirlanda & Mendoza, Psychological Review (in press) · osf.io/8wjdb

📈

Predictive Analytics

Cultural Trends as Leading Indicators

Bottom Line

Diffuse cultural-choice data as leading indicators for brand sentiment, product reception, and public-figure risk — decades before traditional polling catches up.

135 Years of Records 90%+ Name Collapse 630 Names Analyzed

Situation Traditional polling missed major public-sentiment shifts — evidenced most sharply by the 2016 forecasting failures.

Task Test whether diffuse cultural-choice data encodes public sentiment earlier than polls do.

Action Analyzed 100% of U.S. Social Security name records (1880–2015). Found "Hillary"/"Hilary" collapsed 90%+ after 1992 — the most extreme cycle-skew among 630 comparable names.

Result Effect crossed party lines and was visible in the late 1990s — decades before 2016 polling failures.

Ghirlanda, Cliodynamics 8(1) · doi.org/10.21237/C7clio0033703

🔗

Data Engineering

Cross-Source Data Curation and Unified Modeling

Bottom Line

Fragmented multi-source evidence → defensible schema → single model → actionable verdict. Directly applicable to multi-vendor benchmark audits and cross-study synthesis.

108 Experiments Harmonized 0.88 Mean Correlation 14 Species, 5.9% MAE

Situation Behavioral science evidence is fragmented across dozens of independent labs with incompatible formats and no shared schema.

Task Harmonize 108 experiments and 1,540 data points from 14 species across dozens of labs into a single analyzable dataset.

Action Built per-source ingestion pipelines and one unified 3-parameter model — no per-study tuning, no special-casing.

Result Mean correlation 0.88, mean absolute error 5.9% across 68 datasets.

Ghirlanda, Lind & Enquist (2017), Royal Society Open Science 4: 161011

⚡

Performance Engineering

Inference Engine Rewrite: 100×+ Speedup in Belief Propagation

Bottom Line

Research-backed implementation in practice: porting a complex algorithm to production-grade C while preserving every semantic detail. The result is a >100× speedup and >10× memory reduction, unlocking production workloads that were previously infeasible. The same approach applies to any client who has a working prototype and needs production performance without sacrificing correctness.

>100× Speedup 10× Memory Reduction 100K+ Node Graphs 18-Test Verification Suite

Situation A tech company was seeking a more efficient implementation of its flagship software, which handles belief propagation on graphs with 100,000+ nodes.

Task Rewrite the software in pure C — boosting performance and memory effectiveness while faithfully implementing the original semantics. Deliver a fully verified implementation with a comprehensive test suite.

Action Designed a C architecture built on the MINT neural-network library, adopting a 4-state-variable node model with sparse weight matrices for memory efficiency. Replicated original semantics (NDA in place). Built an 18-test verification suite covering all logical properties: forward chains, negation chains, contrapositive reasoning, OR gates, inverse-fallacy prevention, XOR, long propagation chains, diamond graphs, bidirectional links, mixed weights, multiple independent seeds, and logic puzzles.

Result

Scale	C Engine	Previous	Improvement
5k nodes	0.003s	0.384s	131× faster
10k nodes	0.006s	0.750s	128× faster
50k nodes	0.031s	3.905s	128× faster
100k nodes	0.079s	12.361s	156× faster
Memory (5K)	3.3 MB	12.2 MB	3.7× less
Memory (100K)	22.3 MB	249.0 MB	11.2× less

At production scale (150K nodes, 15 connections each, 1,000 iterations): C engine completes in ~36 seconds using ~50 MB; previous engine would require ~47 minutes and ~560 MB. The speedup is consistent (>100×) across all network sizes. Memory savings grow with scale (6–11×).

Reinforcement
Learning that
actually ships

What we do for you

Reinforcement Learning Consulting

Decision AI Systems

Data Science & Analytics

AI Strategy Advisory

Stefano Ghirlanda, PhD

Research depth.
Implementation velocity.

Selected projects

Auditing Inflated Cognitive Claims with a Simpler Model

Cultural Trends as Leading Indicators

Cross-Source Data Curation and Unified Modeling

Inference Engine Rewrite: 100×+ Speedup in Belief Propagation

Seen enough? Let's talk.

Straightforward rates

Hourly Consulting

Project Engagement

Ongoing Advisory

Get expert eyes on your RL problem

You're on our list

ReinforcementLearning thatactually ships

What we do for you

Reinforcement Learning Consulting

Decision AI Systems

Data Science & Analytics

AI Strategy Advisory

Stefano Ghirlanda, PhD

Research depth.Implementation velocity.

Selected projects

Auditing Inflated Cognitive Claims with a Simpler Model

Cultural Trends as Leading Indicators

Cross-Source Data Curation and Unified Modeling

Inference Engine Rewrite: 100×+ Speedup in Belief Propagation

Seen enough? Let's talk.

Straightforward rates

Hourly Consulting

Project Engagement

Ongoing Advisory

Get expert eyes on your RL problem

You're on our list

Reinforcement
Learning that
actually ships

Research depth.
Implementation velocity.