Remote — Germany & South Africa

Tina Koziol

Computational economist by training, automation engineer by habit. About 10 years ago I started my data science and research journey the way most people did: hunting for error messages on Stack Overflow and getting excited about pandas and scikit-learn. Now we have agentic tools that make autonomous decisions and work with data in ways I couldn't have imagined back then. The goal has always been the same, making the most out of data, but the tools have completely revolutionised how we get there.

My PhD put 240,000 agents on HPC clusters to study how agent behaviour propagates through networks. In the last four years, as Senior Data Scientist at Wintershall Dea, I took AI out of notebooks and into operations: LLM pipelines hitting 99% extraction accuracy on messy corporate documents, LSTM anomaly detection on live well sensor data, MLOps infrastructure that kept running long after everyone moved on.

And I can't quite keep my hands off research either. I'm always drawn to questions around monetary policy, asset bubbles, how markets actually behave — you'll find my latest work here.

Today I build automated data platforms end-to-end: Django applications with CI/CD via GitHub Actions, custom AI agent skills and scheduled tasks on Claude Code and MCP, multi-stage workflows from data capture to cloud delivery.

Open to roles PhD Economics · UCT Entrepreneur Active researcher DE / EN / FR
Tina Koziol
Tina Koziol
Data Science · Economics
Experience

Complex systems, messy data.

Complex systems, real stakes, and getting AI to work inside organisations — not just in demos.

2025 – now
Remote / Lingen
ongoing
AI Systems & Automation Engineer
Ceraluna Labs

Architecture and operation of a production Django platform with automated data pipelines and CI/CD via GitHub Actions.

Built custom AI agent skills and scheduled tasks with Claude Code and MCP integration — automated content translation, crime data pipelines, and report generation.

Multi-stage automation workflows for data capture, transformation, R2 cloud upload, and Markdown-based documentation.

API integrations for SEO data, geo services, currency APIs, and analytics dashboards.

Scalable content pipelines: automated blog translation, keyword-driven report generation, and interactive map creation.

2021 – 2025
Hamburg
4 years
Senior Data Scientist
Wintershall Dea GmbH

Ran AI/ML projects from idea through production, including an LLM document pipeline (Claude, OpenAI) that hit 99% extraction accuracy on messy, real-world files.

Led cross-functional teams and mentored junior data scientists through production delivery.

Built the MLOps layer — CI/CD with GitHub Actions, monitoring, model governance — so nothing stayed stuck in a notebook.

Technical implementation of an LSTM-based architecture on well sensor data to support detection of well integrity issues.

Owned data curation pipelines and quality frameworks — because models are only as good as what goes in.

Led EU AI Act and NIST AI RMF alignment across 120+ staff. Not a checkbox exercise — actual workflow redesign, risk classification, and safety guardrails.

2020 – 2021
Remote / SA
1 year
Computational Researcher
GEMS — Univ. of Minnesota & Univ. of Stellenbosch

Large socio-economic datasets, cross-institutional research, policy-facing outputs.

Python, SQL, Postgres, GIS. Worked across time zones and disciplines to support food security modelling.

2016 – 2020
Cape Town
4 years
PhD Researcher & Lecturer
University of Cape Town — School of Economics (AIFMRM)

Built agent-based models with 240,000 agents on HPC clusters to study how financial crises spread.

Published on systemic risk and contagion — peer-reviewed, cited, still relevant.

Taught econometrics and programming. Supervised student projects.

2014 – 2016
Johannesburg
2 years
Consultant
Econometrix Ltd

Market studies for South African energy and finance. Budget management, profitability analysis, client delivery.

Research

Working papers and active research.

My research examines the joint dynamics of financial markets, monetary policy, and wealth distribution in advanced economies. I use agent-based macroeconomic models, computational methods, and empirical analysis to study how behavioural and political-economy feedbacks shape macro-financial outcomes. PhD in Computational Economics, University of Cape Town (2020).

The Monetary Expansion Riddle

Single-authored
Working paper

Why did a decade of aggressive post-GFC easing produce almost no consumer-price inflation, while a single round of post-2020 expansion produced the sharpest inflation episode in forty years? The paper argues that housing is the hinge sector linking both regimes, and that how national CPIs measure shelter acts as a symmetric delay mechanism that systematically disguises the true speed of monetary pass-through in both directions.

Comparable monetary expansions produced radically different inflation outcomes after the Global Financial Crisis and after 2020. The dominant explanations — supply shocks, fiscal support, expectations de-anchoring — treat the divergence as contingent on the shocks themselves. This paper offers a structural alternative: the two episodes are reconciled once housing is placed at the centre of the transmission mechanism and CPI shelter measurement is recognised as a symmetric lag operator that understates inflation on the way up and overstates disinflation on the way down.

The core contribution is a 5×2 transmission taxonomy: five macro-financial channels (credit supply, collateral valuations, rental-market pass-through, wealth effects, and mortgage cash-flow effects) mapped across two inflation regimes (post-GFC and post-2020). The taxonomy is organised around four institutional primitives that vary across advanced economies and jointly determine which channels fire in which regime: (i) CPI shelter construction — rental-equivalence vs. actual rents vs. house prices, (ii) mortgage-contract structure — fixed-rate vs. floating, recourse vs. non-recourse, prepayment penalties, (iii) housing supply elasticity, and (iv) outright homeownership rate.

Using a harmonised panel of twenty advanced economies, I show that cross-country heterogeneity in inflation outcomes is driven primarily by these four institutional features rather than by shock composition. The framework recasts cross-country divergence as structural rather than contingent, identifies which channels can be reactivated by policy, and produces testable predictions for the next expansion. A companion SUERF Policy Brief summarises the implications for inflation-targeting regimes.

Reference Returns, Belief Contagion, and Monetary Policy

Single-authored
Work in progress

The first agent-based macro model in which monetary policy is endogenously shaped by wealth-weighted political-economy pressure. Capitalists anchor on subjective reference returns that adapt sluggishly to realised performance; when reality falls short, they reach for yield and propagate the response through social networks — producing endogenous regime shifts and a feedback loop from capitalist wealth to the central bank's loss function.

This paper develops an agent-based model of a three-population economy — workers, capitalists, and a central bank — in which capitalists hold heterogeneous reference returns: subjective anchors for what they believe they "should" be earning on financial wealth. Reference returns adapt sluggishly to realised performance, combining habit formation with psychological anchoring. The gap between reference and realised returns drives portfolio behaviour: when realised falls below reference, capitalists reach for yield, compressing risk premia and shifting demand toward riskier assets.

The novelty lies in two coupled mechanisms. First, reference returns are heterogeneous across the capitalist population, reflecting differences in experience and exposure. Second, beliefs about achievable returns spread through a social-network contagion process: as neighbours reach for yield and realise higher returns, peer reference returns drift upward, amplifying the aggregate response and generating endogenous feedback loops. The methodology builds on the Brock–Hommes tradition of heterogeneous-expectation models and the LeBaron line of agent-based financial modelling, extended with explicit network-contagion dynamics.

The central theoretical contribution is a political-economy channel from wealth-weighted capitalist preferences to the central bank's policy stance. Because capitalists hold the bulk of financial wealth and their reference returns drift with realised performance, prolonged periods of low realised returns generate persistent political pressure for accommodative policy — institutionalised through asset-price stability concerns, financial-stability mandates, and lobbying. The central bank's endogenous response further validates elevated reference returns, closing the loop.

Preliminary simulations reproduce several empirical regularities of low-rate environments: persistent demand for higher-yielding assets, narrowing risk premia across asset classes, endogenous regime shifts between low- and high-volatility states, and asymmetric responses to tightening vs. easing. The framework offers a mechanism for the observed secular drift in monetary policy stance across advanced economies that does not rely on exogenous shifts in natural rates or policy preferences.

Social learning in a network model of Covid-19

Davids, A., du Rand, G., Georg, C.-P., Koziol, T., & Schasfoort, J.
J. Econ. Behav. Organ. 213, 271–304 (2023) · doi:10.1016/j.jebo.2023.07.010

Embeds naive social learning (DeGroot, 1974) into a network SEIR agent-based model of Covid-19 transmission. Calibrated to Cape Town, the model shows that social learning both flattens and shortens the epidemic curve — qualitatively different from the flattening produced by non-pharmaceutical interventions that reduce contact rates or transmission probabilities.

We develop an agent-based epidemiological model in which agents are nodes in a network, their physical interactions are time-varying edges, and each agent carries an epidemiological state drawn from the Susceptible–Exposed–Infected–Recovered (SEIR) compartments. Into this setting we introduce naive social learning in the sense of DeGroot (1974): agents form subjective beliefs about the prevalence and severity of the disease by repeatedly averaging signals received from their neighbours, and adjust their protective behaviour accordingly.

The model is calibrated to detailed data for Cape Town, South Africa. Incorporating social learning materially improves fit: the best-fit squared difference between modelled and observed excess fatalities drops from 19.34 to 11.40. More importantly, the qualitative behaviour differs from standard compartmental models — social learning both flattens and shortens the curves for infections, hospitalisations, and excess fatalities, whereas contact-rate or transmission-probability reductions flatten the curve at the cost of lengthening the epidemic.

The paper contributes a tractable framework for embedding heterogeneous belief formation into epidemic models and shows that the behavioural response to information is a first-order determinant of epidemic dynamics — separate from, and sometimes substituting for, formal non-pharmaceutical interventions.

Other publications
Career Trajectory
2014 Consulting 2016 PhD Research 2020 Comp. Research 2021 – 2025 Sr. Data Scientist
12+
years in data
PhD
computational econ
2
continents
Skills

The tools, and how I use them.

Where I've spent my time
Python / Data10+ yrs LLMs / Agentic AI4 yrs Enterprise AI adoption4 yrs AI governance / safety3 yrs MLOps / Cloud4 yrs Research / agent-based models6 yrs Consulting / strategy2 yrs

Programming

PythonDaily driver for a decade. Data, web, automation, ML.
REconometrics, research, statistical modelling.
SQL / PostgresComplex queries, migrations, production schemas.
Django / WebTwo live products built end-to-end with Django.

ML & AI

Claude / Codex / OpenAIProduction pipelines, not just chat prompts.
RAG / NLP / Agentic AIDocument intelligence, extraction, verification loops.
Anomaly DetectionLSTM on real sensor data. Deployed, not just prototyped.
Agent-Based Models240k agents on HPC. Published in JEBO.

Cloud & MLOps

Azure AI / MLModel deployment, monitoring, governance.
Docker / CI/CDContainerised pipelines, reproducible builds.
GitHub Actions24 automated workflows running in production.
HPC ClustersLarge-scale simulations, parallel computing.

Governance & Leadership

Project ownershipEnd-to-end delivery, no hand-holding needed.
Cross-functional teamsEngineers, domain experts, leadership — kept aligned.
EU AI Act / NIST AI RMFLed compliance across 120+ staff. Not a checkbox exercise.
Data curation / guardrailsQuality in, quality out. Built the frameworks.

Builder

Dashboards, workflows, websites, internal systems,I make things and keep improving them.

Governance-minded

EU AI Act, NIST AI RMF, safety guardrails, data curation. I think about what goes into a system before asking what comes out.

Systems thinker

My economics PhD trained me to see interactions, incentives, and emergent behaviour. That still shapes everything.

Driver

I take ownership, make decisions, and move work forward. I don't wait to be told what to do next.

Selected work

Proof beats promises.

Enterprise AI

Document Intelligence Pipeline

Extracts structured information from messy corporate documents. Not a demo.ran in production, used by people who didn't know it was AI.

AzureOpenAIClaudeLangChainPython
Research

COVID-19 Network Model

Agent-based model with 240,000 agents simulating how social learning and contagion interact in complex networks. Published in JEBO.

PythonRHPC240k agents
Safety AI

AI for Well Integrity Monitoring

LSTM architecture on well sensor data to support detection of well integrity issues. Built on real operational data, presented at ADIPEC/SPE.

LSTMWell IntegrityMLOpsAzure
Venture

capetowndata.com

A multilingual information site for Cape Town.safety data, neighbourhood guides, local knowledge. Thousands of monthly readers.

DjangoPythonCloudflareStripeMultilingual
Venture

bubble-tracker.com

Tracking and visualising economic and market bubble indicators.turning scattered signals into something you can watch over time.

DjangoPythonData vizEconomics
Passion Project

timeline.music

An interactive music atlas.explore how genres, artists, and scenes evolved across cities and countries, from the origins to now.

Next.jsDjangoCloudflare R2Multilingual

Education

PhD in Economics
University of Cape Town,2016–2020
Computational economics, agent-based models, HPC simulations. Thesis on systemic risk and contagion.
Diplom-Kauffrau
Friedrich-Schiller-Universität Jena,2013
Equivalent to a master's in business administration.

Languages

GermanNative
EnglishFluent
FrenchIntermediate

Get in touch