My Expertise

Hi, I'm Leo—a software engineer and lecturer from Spain. I build reliable software, teach people how it works, and spend the rest of my time making music or tinkering with hardware.

My work moves between web products, data pipelines and applied AI. I like turning fuzzy ideas into useful prototypes, then refining them into clear, well-documented systems with as little accidental complexity as possible.

Curiosity ties it all together. I've taught AI and software engineering for more than nine years, researched language and music technologies, and still happily disappear into Game Boy hardware, homegrown PCBs, a 3D printer or a jazz gig.

Web & Products

I design and build friendly interfaces, practical APIs and prototypes that are simple to use and easy to maintain.

Software & AI

From compact tools to distributed data pipelines, I work across languages and use AI where it solves a real problem.

How I Work

Clear code, useful documentation, sensible security and honest collaboration—enough process to help a team, never to slow it down.

Featured Projects

CARAMEL

CARAMEL is a research and innovation project funded under Horizon Europe. It aims to develop an AI-driven, personalized prevention model tailored to the unique cardiovascular health needs of women aged 40-60, particularly during the menopausal transition, when CVD risk rises significantly.

Check it out

CRYPTOTRACK

CRYPTOTRACK is a system developed by Treelogic to monitor and detect illicit activities in cryptocurrency transactions. This project was part of the first call of the Strategic Initiative for Innovative Public Procurement (IECPI) by the National Institute of Cybersecurity (INCIBE).

Check it out

COMMUNITAS

COMMUNITAS principal goal is to pave the way for the empowerment and engagement of different types of consumers and prosumers, placing them at the heart of energy markets. It will do so by boosting the creation and exploiting the potentialities of ECs as hubs for innovative energy services, integrated with non-energy benefits, co-created together with citizens and other stakeholders.

Check it out

MODERATE

MODERATE aims to connect data providers with other building stakeholders by improving interoperability between datasets, by making use of data from different providers and aims to develop services based on data analytics that can transform raw data into knowledge for end-users.

Check it out

STREAM

The STREAM project aims to create an innovative and robust flexibility ecosystem on the low voltage grid side of existing power markets connecting data, technologies, stakeholders and markets, thus facilitating the flexibility provision.

Check it out

InEExS

InEExs proposes Innovative Energy (Efficiency) Service Models for Sector Integration. The project aims to acilitate the implementation of sector-integrating smart energy services and the deployment of a wide range of sustainable technologies, such as renewables, EV, heat pumps, IoT controls and other energy efficiency measures.

Check it out

MultiScore

MultiScore proposes the development of neural models that leverage large data sets to learn both OMR and AMT holistically (end-to-end) under a common framework for transcribing music.

Check it out

MaCoCu

MaCoCu focuses on collecting monolingual and parallel data from the Internet, specially for under-resourced languages and DSI-specific data.

Check it out

lpla/gb-pcbs

Delving into the world of gaming hardware, this project provides open-source designs for Game Boy-related hardware, making advanced DIY tech accessible to enthusiasts and creators worldwide through services like OSH Park or PCB Way.

Check it out

Bitextor

Bitextor generates translation memories from multilingual websites or WARC files. A complete pipeline optimized for high-performance and distributed environments.

Check it out

Paracrawl

Crawling thousands of websites, added to the Internet Archive data and processing all efficiently with open-source software to create a huge, powerful and heterogeneus parallel corpus for Machine Translation systems and Large Language Models.

Check it out

Reverso Context

The most advanced and fast parallel corpora search tool, finding aligned documents and sentences from many public resources at unprecedented speeds, with several million page-views every day and deeply integrated into Reverso ecosystem.

Check it out

Leopoldo Pla Sempere Lecturer & dev. Sometimes musician.

My Expertise

Web & Products

Software & AI

How I Work

Featured Projects

CARAMEL

CRYPTOTRACK

COMMUNITAS

MODERATE

STREAM

InEExS

MultiScore

MaCoCu

lpla/gb-pcbs

Bitextor

Paracrawl

Reverso Context

CV