Miguel Vieira

Research Software Engineer

About

I am a Software Engineer with expertise in designing, developing, and deploying Web based projects, analysing and modelling structured and unstructured data, natural language processing, machine learning, and systems administration.

I have over ten years experience developing in Python, Java and XSLT; over five years experience with Django; knowledge of several metadata standards, including TEI, METS, EAD; experience in several Apache projects including Ant, Cocoon, Lucene, Solr, Tomcat and Jetty; also designing, implementing and integrating databases (PostgreSQL, MySQL, SQLite), and ontologies (RDF/OWL).

Technical Skills

Python (Django, Jupyter Notebooks, NLTK, Pandas, spaCy), Java, Octave, Ruby; HTML5, JavaScript, jQuery, d3js, Vue, CSS, SASS, Foundation, Bulma, Jekyll; Subversion, Mercurial, Git; XML, XSLT, XSD, RNG; Apache Cocoon, Apache Lucene, Apache Solr, ElasticSearch; Apache Tomcat, Apache Ant; RDF/OWL, Protégé; SQL, PostgreSQL, MySQL, SQLite, eXist DB, Redis, MongoDB, Firebase; Linux administration, AWS; Agile methodologies.

Experience

My work experience and training.

2018 —

Principal Research Software Engineer
King's Digital Lab, King’s College London

2015

Certification: Agile Project Management Practitioner
APMG

2015 — 2018

Software Engineer
King's Digital Lab, King’s College London

2013

Certification: Machine Learning
Coursera

2012 — 2015

Head of Research Development & Delivery
Department of Digital Humanitities, King’s College London

2005 — 2006

Project Engineer
Critical Software

2001 — 2005

Software Developer
BookMARC

2001

MSc: Computer Science
Universidade de Coimbra, Portugal

1998

BSc: Informatics and Systems Engineering
Instituto Superior de Engenharia de Coimbra, Portugal

Projects

Some of the projects I am currently working on.

African Rock Art

A Jekyll website for the British Museum African Rock Art Image project.

Contentful · Jekyll · Ruby

Data Portal Explorer

A set of tools/libraries to harvest and analyse data from public data portals.

Jupyter notebooks · Pandas · Python · REST

Georgian Papers Programme

A collaborative platform to archive, catalogue, and transcribe items in the Royal Archives and Royal Library relating to the Georgian period, 1714-1837.

Django · REST · Vue · Wagtail

Nanotoms

A prototype for exposing multimedia digital assets in a way that includes narrative metadata, both static and dynamically generated.

Django · Neo4J · NLP · NLTK · Python

spaCy NER

Jupyter notebooks with example workflows for training spaCy NER (name entity recognition) models from ancient languages.

Jupyter notebooks · Python · NLP · spaCy