About Me
My guiding principle is to always go beyond the surface level. I believe that what separates "it works" from "it works beautifully" comes down to a deep grasp of the fundamentals and a relentless focus on the details.
It's a philosophy I apply to everything I build.
Some of the thinking that has deeply influenced my approach:
- • "Atomic Habits" by James Clear
- • "Deep Work" by Cal Newport
- • "Thinking, Fast and Slow" by Daniel Kahneman
Work Experience
Senior Software Engineer
Tinexta Innovation Hub
June 2025 - Present | Naples, IT
- Contributed as a core developer to Lextel AI, a hybrid search platform (Elasticsearch + GenAI) featured in an official Elastic case study for reducing legal research time from hours to minutes.
- Resolved a key driver of customer dissatisfaction by implementing a parallel reranking system leveraging Gemini Flash, achieving a 100% reduction in user-reported complaints regarding missing information.
- Engineered a large-scale document processing pipeline, reducing analysis costs by 90% (from an estimated €1,000 to €100) for a 50,000-page legal corpus by leveraging asynchronous Gemini Batch APIs.
- Architected a multi-agent patent analysis workflow, increasing retrieval accuracy by an estimated 15% by orchestrating LLM calls for seed analysis, parallel query generation, and final results reranking.
Machine Learning Engineer
NTT Data
Feb 2022 - June 2025 | Naples, IT
- Led the end-to-end design and deployment of a conversational AI platform, serving as the technical lead for a cross-functional team of 7 (PMs, analysts, developers). The solution automated 100% of after-hours claims and reduced contact center operational load by 50%.
- Mentored junior team members to achieve full project autonomy, architecting a scalable microservices solution (Dialogflow CX, IVR, backend APIs) and ensuring a successful handover upon departure.
- Recognized by the client as a key technology partner ("cornerstone of the project"), transforming the relationship from a vendor to an integrated team and solidifying the client's AI roadmap.
Technical Leadership & Projects
- Author of technical explorations into advanced AI concepts, including LLM inference optimization (Speculative Decoding), agentic architectures (Self-RAG), and cross-platform orchestration (Semantic Kernel).
- Developer of open-source projects to explain and implement state-of-the-art AI techniques, including a from-scratch implementation of speculative decoding that achieved a 2.4x inference speedup in local tests.
Technical Skills
🚀 Core Stack
- Python & FastAPI
- RAG Architecture
- LLM Integration (Gemini, Azure OpenAI)
- Docker
☁️ Cloud Platforms & MLOps
- Google Cloud Platform (GCP)
- Microsoft Azure
- Azure DevOps (Pipeline Authoring)
- Kubernetes (GKE) & OpenShift
- BigQuery & SQL
🤖 AI Frameworks & Orchestration
- Concepts: NLP, Deep Search, Agentic Workflows, Prompt Engineering
- Libraries: LangGraph, LangChain, Semantic Kernel, Hugging Face
⚙️ Backend & Data Stores
- API Development: FastAPI, Spring
- Databases: Elasticsearch, Azure Cosmos DB, Vector Search (Azure AI Search)
Education & Certifications
Master of Science in Computer Engineering
University of Naples Federico II
Sept 2019 - March 2022
Grade: 110/110 with honors
AWS Certified Machine Learning – Specialty
Amazon Web Services
🏆 Perfect Score: 1000/1000