Career Profile

An experienced researcher with a demonstrated history of working in research & industry. Skilled in Machine Learning, Deep Learning, NaturalLanguage Processing, Crowdsourcing, Intelligent Human‑Machine Collaboration and computer science. Strong research professional with a Doc‑tor of Philosophy (Ph.D) focused on developing simple and efficient machine learning algorithms that are broadly applicable across a range ofproblem domains including natural language processing and computer vision.

Experiences

Lead Data Scientist

2021 - 2023
Evana AG
  • Developed & demo an End‑to‑End Training Data Platform (TDP) in Azure Cloud Service to centralize all the documents & their corresponding labels in Azure Data‑lake
  • Design and implement document ingestion pipeline using Azure Data Factory
  • Design and implement document classification pipeline to train, deploy, and monitor machine learning models using Azure ML studio

Senior Data Scientist

2017 - 2021
Architrave GmbH
  • Developed & demoed an End‑to‑End pipeline prototype for extracting various data fields from Leased Contracts by down‑streaming & fine‑tuning BERT transformer language model (MLM) in question answering task. (Python, Pytorch, Flask, java script, HTML, MLFlow)
  • Collaborated with all the team members to build an End‑to‑End pipeline for classifying real‑estate assets and extracting data fields from PDF and Excel sheets. (SVM, Python, Flask, Airflow, Docker, AWS, S3, EC2)
  • Designed, implemented, and evaluated the Machine learning-based signature detection application for legal contracts using advanced object detection solutions. (e.g. Fast R‑CNN & Yolo v3)
  • Trained and deployed a machine learning model in production for extracting name entities from documents using bidirectional LSTM (BI‑LSTM) with a bi-directional Conditional Random Field (BI‑CRF) layer. (Keras, Python)
  • Created and presented models for Blueprint image classification using hybrid approaches. (Python, sklearn)

Researcher

2015 - 2017
University of Trento

CogNet Project

  • Building an Intelligent System of Insights and Action for 5G Network Management
  • Applying machine learning to predict the traffic of5G network based on social media content (e.g. Twitter)

Researcher

2014 - 2015
University of Milan

Dante a Teatro e Smart Milano Project

  • Design and implemented an internal crowdsourcing platform for collecting and generating training data
  • Implemented an End‑to‑End question classification pipeline for a closed domain task (food and dining habits in the Middle Ages) in Italian

Researcher

2011 - 2014
University of Trento

IBM jeopardy

  • The project was a research collaboration between IBM and the University of Trento for a deep understanding of question classification tasks and by encoding syntactic similarities of the questions using Tree Kernels

Software Engineer

2011 - 2014
Consolsys Sdn. Bhd
  • Implemented a core application for banking automation solutions.(C#, SQL, Windows, Visual Studio)

Publications

  • Autonomous Crowd‑sourcing through Human‑Machine Collaborative Learning
  • A. Abad, M. Nabi, A. Moschitti
    SigIR 2017, Tokyo Japan
  • Self‑Crowdsourcing for Relation Extraction
  • A. Abad, M.Nabi, A. Moschitti.
    in Proceedings of the Association for Computational Linguistics(ACL’17), Vancouver, Canada, August 2017.
  • Appetitoso: A Search Engine for Restaurant Retrieval based on Dishes
  • G. Barlacchi, A.Abad, E. Rossinelli, A. Moschitti.
    Third Italian Conference on Computational Linguistics, CLiC‑IT 2016 Napoli, Italy, December 2016
  • Taking the Best from the Crowd: Learning Question Passage Classification from Noisy Data
  • A. Abad, A. Moschitti.
    5th Joint Conference on Lexical and Computational Semantics, *SEM 2016, Berlin, Germany, August 2016
  • Distant Supervision for Relation Extraction Using Tree Kernels
  • A. Abad, A. Moschitti.
    6th Italian Information Retrieval Workshop, IIR 2014 Cagliari, Italy, May 2015
  • Creating a Standard for Evaluating Distant Supervision for Relation Extraction
  • A. Abad, A. Moschitti.
    First Italian Conference on Computational Linguistics, CLIC 2014, Pisa, Italy, December 2014
  • A Multi‑task Learning Framework for Time‑continuous Emotion Estimation from Crowd Annotations
  • M. Khomam Abadi, A. Abad, R. Subramanian, N. Rostamzadeh, E. Ricci, J. Varadarajan, N. Sebe
    International ACM Workshop on Crowdsourcing for Multimedia, CrowdMM 2014, Florida, USA, November 2014
  • A Resource for Investigating the Impact of Anaphora and Coreference on Inference
  • A. Abad, L. Bentivogli, I. Dagan, D. Giampiccolo, S. Mirkin, E. Pianta, A. Stern
    LREC, Malta, May 2010

    Skills & Proficiency

    Deep Learning, Transfer Learning, Question Answering, NLP

    AWS, Docker, Airflow

    Flask, REST API

    HTML5, Java script, Semantic UI, CSS

    Python, JAVA, Shell

    Jupyter, Sklearn, Gensim, BERT, Pytorch, NLTK, Pandas, Numpy

    Elastic‑Search, Mysql, MLflow, Git

    https://amzn.to/3LKKy9l