Leopoldo Pla Sempere Lecturer & dev. Sometimes musician.

My Expertise

Hello! My name is Leo, a passionate software developer from Spain. By night musician and hardware researcher, by day senior software dev and lecturer. Game Boy enthusiast anyway.

I work on Data Science, Natural Language Processing, Machine Learning and Music Information Retrieval state-of-the-art techniques to create useful new tools and technologies in agile environments since 2015.

Developing from simple scrappers and HTML text extractors up to production ready parallel corpora cleaning pipelines, all for a successful set of translation platforms.

Lecturer on computer science related subjects in several degrees as Mathematics and Computer Science, as programming fundamentals or project management and planning.

In music, MIR on the technical side in several projects related to dodecaphonism automatic composition, classical composer detection from audio and neural music transcription; live jazz, wind-orchestra and rock music on the practical. Some bleep-bloop chiptunes too.

When I have some spare time, I calibrate my 3D printer and design useful models. Also PCBs.

Design

Features are not everything in software development. Always staying up to date on code style, designing useful APIs and intuitive interfaces.

Code

Coding in a variety of languages and paradigms for modern times! From local companies websites to wide HPCC deployments.

Tools

Constantly developing with and for humans. Working with the right colaborative tools in SCRUM/agile-like groups makes the daily routine easier!

Featured Projects

stream

COMMUNITAS

COMMUNITAS principal goal is to pave the way for the empowerment and engagement of different types of consumers and prosumers, placing them at the heart of energy markets. It will do so by boosting the creation and exploiting the potentialities of ECs as hubs for innovative energy services, integrated with non-energy benefits, co-created together with citizens and other stakeholders.

Check it out
moderate

MODERATE

MODERATE aims to connect data providers with other building stakeholders by improving interoperability between datasets, by making use of data from different providers and aims to develop services based on data analytics that can transform raw data into knowledge for end-users.

Check it out
stream

STREAM

The STREAM project aims to create an innovative and robust flexibility ecosystem on the low voltage grid side of existing power markets connecting data, technologies, stakeholders and markets, thus facilitating the flexibility provision.

Check it out
ineexs

InEExS

InEExs proposes Innovative Energy (Efficiency) Service Models for Sector Integration. The project aims to acilitate the implementation of sector-integrating smart energy services and the deployment of a wide range of sustainable technologies, such as renewables, EV, heat pumps, IoT controls and other energy efficiency measures.

Check it out
multiscore

MultiScore

MultiScore proposes the development of neural models that leverage large data sets to learn both OMR and AMT holistically (end-to-end) under a common framework for transcribing music.

Check it out
macocu

MaCoCu

MaCoCu focuses on collecting monolingual and parallel data from the Internet, specially for under-resourced languages and DSI-specific data.

Check it out
pcb

lpla/gb-pcbs

Game Boy -related custom hardware source files, easily reproducible with services like OSH Park or PCB Way

Check it out
bitextor

Bitextor

Bitextor generates translation memories from multilingual websites or WARC files. A complete pipeline ready to be used in production distributed environments.

Check it out
paracrawl

Paracrawl

Crawling thousands of websites, added to the Internet Archive data and processing all efficiently with open-source software to create a huge, powerful and heterogeneus parallel corpus for Machine Translation systems.

Check it out
context

Reverso Context

The most advanced and fast parallel corpora search tool, finding aligned documents and sentences from many public resources, with several million page-views every day and deeply integrated into Reverso ecosystem.

Check it out

CV

If you want to know more about me or my work, take a look at my CV:

Vita 📃