Hakim Jonas Ghoula
Software Engineer & Systems Architect
hakim@walkthisway.dk | +45 26335577 | Hothers Plads 11, 3, 2200 Cph. N
Profile
Software engineer with four years of professional experience in distributed data systems, plus independent work spanning library design, compiler development, and AI agent architectures. Primary languages are Scala, Rust, and Python. Professional work includes Apache Spark pipelines for identity resolution at scale, data governance frameworks, and developer productivity tooling. Independent work includes a published validation library (Maven Central) and systems built to understand domains from the ground up. Background in cognitive science (MSc) informs approach to system design, particularly in architectures combining deterministic validation with LLM reasoning.
Professional Experience
Data Engineer
LiveIntent (acquired by Zeta Global, November 2024) | Copenhagen, Denmark
Technical owner of distributed data systems for AdTech/MarTech identity resolution. Four years focused on Apache Spark pipeline development and testing, with additional contributions spanning data governance, infrastructure automation, and AI agent systems. Code owner for multiple warehouse modules.
- Architected AI agent system with dual-process cognitive architecture, combining deterministic validation with LLM reasoning for automated configuration workflows
- Developed developer productivity CLI integrating issue tracking, version control, and MCP (Model Context Protocol) for AI assistant integration
- Designed warehouse-wide data governance framework enabling configurable privacy controls and hash filtering across all delivery pipelines
- Built compile-time JSON schema derivation using Scala metaprogramming for configuration validation across warehouse systems
- Developed and maintained Apache Spark ETL pipelines with comprehensive Specs2 test coverage for identity resolution and audience segmentation
- Integrated UID2 (Unified ID 2.0) protocol for privacy-compliant identity resolution across partner ecosystems
- Built PersonID matching system with normalized address resolution for cross-system identity linkage
Python Back End Developer
Ocean.io | Copenhagen, Denmark
Developed NLP components for B2B lead intelligence platform, including email prediction models and multilingual language detection.
- Built email prediction pipeline using pattern matching and probabilistic models
- Implemented language detection module for multilingual content classification
Front End Developer / Data Engineer
Købstædernes Forsikring | Copenhagen, Denmark
Contributed to the development of a new data model for the insurance customer database.
- Implemented white-label solutions and managed web projects for external partners
Education
MSc in IT and Cognition
University of Copenhagen | Copenhagen, Denmark
Bachelor in Philosophy
University of Copenhagen | Copenhagen, Denmark
Diploma in Music Management
The Royal Danish Conservatory | Copenhagen, Denmark
Skills
Programming Languages
- Scala: Advanced FP, Spark applications, Scala 3 metaprogramming, compile-time derivation
- Rust: Systems programming, compiler design, type systems
- Python: FastAPI, Airflow DAGs, CLI tooling, NLP/ML pipelines
Big Data & Distributed Systems
- Apache Spark: ETL pipelines, DataFrame/Dataset APIs, performance tuning, Specs2 testing
- Apache Airflow: DAG orchestration, scheduling, monitoring
- Akka Streams: Reactive stream processing, backpressure handling
- Data Architecture: Pipeline design, Parquet/columnar storage, data lake patterns, schema evolution
AI & Machine Learning
- LLM Engineering: AWS Bedrock, prompt engineering, ReAct patterns, MCP protocol
- NLP: spaCy, intent classification, semantic search, embeddings
- Cognitive Architectures: Dual-process systems (Kahneman), metacognitive agents
Cloud & Infrastructure
- AWS: S3, Athena, Bedrock, IAM
- Kubernetes: Deployments, Jobs, Helm charts, ArgoCD
- Terraform: Modular IaC, reusable modules
- CI/CD: CircleCI, GitHub Actions, automated pipelines
Personal Projects
valar
Type-safe validation library for Scala 3 with automatic case class derivation
fin
Finë is a simple, GTK4-based session transition tool for Linux desktops—built in Rust with a functional, modular design.
cv
CV/portfolio generator producing HTML and PDF from structured JSON data
spore-print (Fungal)
A Rust library providing a SporePrint trait and derive macro for immutable string representations.
Languages
- English: Fluent
- Danish: Native