EXPERIENCE

Senior Data Scientist

The Alan Turing Institute

2022 - Present

Data Scientist 2021-2022

  • Safety testing pre-release frontier models.
  • Building infrastructure for evaluating LLMs and human-AI experimentation.
  • Delivered multiple applied research projects with government stakeholders.
  • Built large scale cloud data/ML pipelines.

Data Scientist

Curve Analytics, 2021

  • Developed and deployed search traffic ranking solutions and recommender systems at data science consultancy.

SKILLS

  • Python
  • SQL
  • Azure/AWS/GCP
  • Docker
  • Git
  • PyTorch
  • HF Transformers
  • Scikit-learn
  • Pandas
  • Flask
  • Gradio
  • FastAPI

EDUCATION

MEng (1st), Computer Science with Innovation

University of Bristol

2016 - 2020

  • Dissertation (1st): NLP classifiers for misleading headline detection.
  • Units taken include: Applied Data Science, Applied Deep Learning, Machine Learning, Design & Systems Thinking for Innovation

🎭 LLM Disinformation

Evaluating LLMs for malicious info ops
Measuring compliance and "humanness" of AI-generated election disinformation.

🐙 Prompto

Facilitating LLM experiments
Python library for asynchronous querying of LLM endpoints.

🦤 DoDo Learning

Cross-domain abuse classification
Exploring domain-demographic transfer performance in language models for abuse classification.

🔭 Online Harms Observatory

Tracking online abuse
Building pipelines and deploying language models to deliver real-time analytics on online harm.

⚽ Footballer Abuse

Analysing patterns in online abuse
Collecting data and training language models to detect abuse targeted at football players.