Hello! My name is Leo, a passionate software developer from Spain. By night musician and hardware researcher, by day senior software dev and lecturer. Game Boy enthusiast anyway.
I work on Data Science, Natural Language Processing, Machine Learning and Music Information Retrieval state-of-the-art techniques to create useful new tools and technologies in agile environments since 2015.
Developing from simple scrappers and HTML text extractors up to production ready parallel corpora cleaning pipelines, all for a successful set of translation platforms.
Lecturer on computer science related subjects in several degrees as Mathematics and Computer Science, as programming fundamentals or project management and planning.
In music, MIR on the technical side in several projects related to dodecaphonism automatic composition, classical composer detection from audio and neural music transcription; live jazz, wind-orchestra and rock music on the practical. Some bleep-bloop chiptunes too.
When I have some spare time, I calibrate my 3D printer and design useful models. Also PCBs.
Features are not everything in software development. Always staying up to date on code style, designing useful APIs and intuitive interfaces.
Coding in a variety of languages and paradigms for modern times! From local companies websites to wide HPCC deployments.
Constantly developing with and for humans. Working with the right colaborative tools in SCRUM/agile-like groups makes the daily routine easier!
COMMUNITAS principal goal is to pave the way for the empowerment and engagement of different types of consumers and prosumers, placing them at the heart of energy markets. It will do so by boosting the creation and exploiting the potentialities of ECs as hubs for innovative energy services, integrated with non-energy benefits, co-created together with citizens and other stakeholders.
Check it outMODERATE aims to connect data providers with other building stakeholders by improving interoperability between datasets, by making use of data from different providers and aims to develop services based on data analytics that can transform raw data into knowledge for end-users.
Check it outThe STREAM project aims to create an innovative and robust flexibility ecosystem on the low voltage grid side of existing power markets connecting data, technologies, stakeholders and markets, thus facilitating the flexibility provision.
Check it outInEExs proposes Innovative Energy (Efficiency) Service Models for Sector Integration. The project aims to acilitate the implementation of sector-integrating smart energy services and the deployment of a wide range of sustainable technologies, such as renewables, EV, heat pumps, IoT controls and other energy efficiency measures.
Check it outMultiScore proposes the development of neural models that leverage large data sets to learn both OMR and AMT holistically (end-to-end) under a common framework for transcribing music.
Check it outMaCoCu focuses on collecting monolingual and parallel data from the Internet, specially for under-resourced languages and DSI-specific data.
Check it outGame Boy -related custom hardware source files, easily reproducible with services like OSH Park or PCB Way
Check it outBitextor generates translation memories from multilingual websites or WARC files. A complete pipeline ready to be used in production distributed environments.
Check it outCrawling thousands of websites, added to the Internet Archive data and processing all efficiently with open-source software to create a huge, powerful and heterogeneus parallel corpus for Machine Translation systems.
Check it outThe most advanced and fast parallel corpora search tool, finding aligned documents and sentences from many public resources, with several million page-views every day and deeply integrated into Reverso ecosystem.
Check it out