Education

Skills

Some relevant skills I have:

Python
HuggingFace Transformers
Numpy
Pandas
Scikit-Learn
Git/GitHub/BitBucket/GitLab
Data Mining
Jupyter
MATLAB
Docker
Plot.ly
Seaborn
PyTorch
PostgreSQL/MySQL/SQL
Data Visualization
Linux
Amazon Web Services (AWS)
Google Cloud Platform (GCP)
C/C++
Google App Script
Javascript
BigQuery
Tensorflow + Keras
Leaflet
Matplotlib
Tableau
HTML
Node.JS
MongoDB
Google Cloud Compute
Heroku
SASS/SCSS/CSS
Tcl/Tk
Express.JS
Numba
Flask
Bash
React
Kubernetes

Experience

DrugBank - Machine Learning Engineer II

December 2022 - Current

  • Accelerated clinical data curation velocity by 33% by pre-annotating input text using a NER transformer (BERT) NLP model.
  • Normalized text entities to the DrugBank database using fine-tuned search/ ranking bi-encoder models with >88% F1.
  • Increased sales funnel conversion rate estimation F1 score by 40% using data mining/ analytics techniques.
  • Deployed several models to process PubMed abstracts using BentoML to enable high-throughput data mining.
  • Mitigated over 400 Depend Bot vulnerabilities by migrating tech debt from TensorFlow 1 to the newest HuggingFace/ PyTorch architecture.

AltaML - Associate Machine Learning Developer

October 2021 - December 2021

  • Developing models to provide personalized service to customers.
  • Building ETL data pipelines using BigQuery and Python.
  • Applying models such as random forests, XGBoost, and TabNet.
  • Communicating the abilities and limitations of ML systems to developers, product managers, and internal product users.

Tricca Technologies - Research and Development Co-op

June 2020 - August 2020

  • Tricca Technologies is a startup company focused on medical device development.
  • Designed C++ firmware, circuits, PCBs, and 3D printed parts for an electric arc ampoule sealer for independent laboratories to repackage expensive chemical standards into smaller ampoules, reducing cost and waste.

Microsemi/ Microchip Technology - Physical Design Intern

September 2019 - December 2019

  • Automated component metadata cross-validation process to improve operational flow, using RESTful API and Python.
  • Routed parts of the cutting-edge META-DX1 ethernet chip at the top level, which is capable of up to 1.2 Tbps capacity throughput in a single chip.

Binary Research Group - Data Analyst/ Research Assistant

January 2019 - August 2019

  • Worked on a smart Arduino IoT device to perform colorectal cancer screening.
  • funded by the National Institutes of Health (NIH) for patients in Nigeria.
  • Programmed the screening tests using colorimetric reactions based on metabolite concentrations in urine.
  • Established calibration algorithms to mitigate variations between sensors.

Ultrafast Spectroscopy and Nanotools Labs - Research Assistant

January 2019 - August 2019

  • Created a scheduler script for COMSOL in Python to increase the simulation capacity by 80%.
  • Oversaw 1400 hours of COMSOL simulations with semiconductor and electromagnetic modules.
  • Developed a graphical user interface (GUI) for MATLAB models created by colleagues.
  • Improved an electron diffusion finite element model in Python