Hakim Jonas Ghoula

Software Engineer & Systems Architect

hakim@walkthisway.dk | +45 26335577 | Hothers Plads 11, 3, 2200 Cph. N

Hakim Jonas Ghoula

Profile

Software engineer with four years of professional experience in distributed data systems, plus independent work spanning library design, compiler development, and AI agent architectures. Primary languages are Scala, Rust, and Python. Professional work includes Apache Spark pipelines for identity resolution at scale, data governance frameworks, and developer productivity tooling. Independent work includes a published validation library (Maven Central) and systems built to understand domains from the ground up. Background in cognitive science (MSc) informs approach to system design, particularly in architectures combining deterministic validation with LLM reasoning.

Professional Experience

Data Engineer

LiveIntent (acquired by Zeta Global, November 2024) | Copenhagen, Denmark

2022-01 - Present

Technical owner of distributed data systems for AdTech/MarTech identity resolution. Four years focused on Apache Spark pipeline development and testing, with additional contributions spanning data governance, infrastructure automation, and AI agent systems. Code owner for multiple warehouse modules.

  • Architected AI agent system with dual-process cognitive architecture, combining deterministic validation with LLM reasoning for automated configuration workflows
  • Developed developer productivity CLI integrating issue tracking, version control, and MCP (Model Context Protocol) for AI assistant integration
  • Designed warehouse-wide data governance framework enabling configurable privacy controls and hash filtering across all delivery pipelines
  • Built compile-time JSON schema derivation using Scala metaprogramming for configuration validation across warehouse systems
  • Developed and maintained Apache Spark ETL pipelines with comprehensive Specs2 test coverage for identity resolution and audience segmentation
  • Integrated UID2 (Unified ID 2.0) protocol for privacy-compliant identity resolution across partner ecosystems
  • Built PersonID matching system with normalized address resolution for cross-system identity linkage
Apache Spark Scala Python AWS (S3, Athena, Bedrock) Functional Programming Specs2 Akka Streams Parquet Kubernetes Helm Docker Terraform LLM/AI Engineering spaCy/NLP MCP Protocol FastAPI Apache Airflow CircleCI GitHub Actions

Python Back End Developer

Ocean.io | Copenhagen, Denmark

2018 - 2019

Developed NLP components for B2B lead intelligence platform, including email prediction models and multilingual language detection.

  • Built email prediction pipeline using pattern matching and probabilistic models
  • Implemented language detection module for multilingual content classification
Python NLP REST APIs Machine Learning

Front End Developer / Data Engineer

Købstædernes Forsikring | Copenhagen, Denmark

2017 - 2018

Contributed to the development of a new data model for the insurance customer database.

  • Implemented white-label solutions and managed web projects for external partners
Front End Data Modeling

Education

MSc in IT and Cognition

University of Copenhagen | Copenhagen, Denmark

2014 - 2016

Bachelor in Philosophy

University of Copenhagen | Copenhagen, Denmark

2008 - 2011

Diploma in Music Management

The Royal Danish Conservatory | Copenhagen, Denmark

2009 - 2011

Skills

Programming Languages

  • Scala: Advanced FP, Spark applications, Scala 3 metaprogramming, compile-time derivation
  • Rust: Systems programming, compiler design, type systems
  • Python: FastAPI, Airflow DAGs, CLI tooling, NLP/ML pipelines

Big Data & Distributed Systems

  • Apache Spark: ETL pipelines, DataFrame/Dataset APIs, performance tuning, Specs2 testing
  • Apache Airflow: DAG orchestration, scheduling, monitoring
  • Akka Streams: Reactive stream processing, backpressure handling
  • Data Architecture: Pipeline design, Parquet/columnar storage, data lake patterns, schema evolution

AI & Machine Learning

  • LLM Engineering: AWS Bedrock, prompt engineering, ReAct patterns, MCP protocol
  • NLP: spaCy, intent classification, semantic search, embeddings
  • Cognitive Architectures: Dual-process systems (Kahneman), metacognitive agents

Cloud & Infrastructure

  • AWS: S3, Athena, Bedrock, IAM
  • Kubernetes: Deployments, Jobs, Helm charts, ArgoCD
  • Terraform: Modular IaC, reusable modules
  • CI/CD: CircleCI, GitHub Actions, automated pipelines

Personal Projects

valar

2

Type-safe validation library for Scala 3 with automatic case class derivation

Scala data-validation error-handling scala scala-native scala3 validation

fin

2

Finë is a simple, GTK4-based session transition tool for Linux desktops—built in Rust with a functional, modular design.

Rust

cv

0

CV/portfolio generator producing HTML and PDF from structured JSON data

Rust

spore-print (Fungal)

0

A Rust library providing a SporePrint trait and derive macro for immutable string representations.

Rust

Languages

  • English: Fluent
  • Danish: Native