Fatemeh Rahimi

Programming Languages	Python
Python Libraries	PyTorch, Keras, MLflow Hugging Face Transformers, spaCy, NLTK NumPy, scikit-learn, pandas, Mathplotlib, jupyter notebooks
Databases	DBMS: MySQL, SQL
Version control	Git, GitHub, GitLab
Operating system	Windows, Linux
Writing Tools	Microsoft Office , LATEX
Workflow	Agile Development & Scrum
others	Slack, Trello

Programming Languages	Java, C, C++ PHP (CodeIgniter, Laravel) HTML, CSS, JavaScript (Bootstrap) MASM Assembly
Python Libraries	pytest, py2neo, selenium, PyMongo
Cloud	Docker
Programming platforms	Android
Databases	DBMS: postgreSQL, SQLserver(familiar) NoSQL: Neo4j, MongoDB
Graphic Design	Adobe Fireworks, Adobe photoshop, Camtasia
others	Jibble

Programming Languages	MATLAB, R
Ontology	SPARQL, Protégé
HDL	Verilog

MTLV: A Library for Building Multi-task Learning Architectures March 2020 - April 2021

Dalhousie University, Halifax, Canada

Master’s Thesis, Read here>
Developing a CLI tool for exploring four different multi-task learning architectures.
Using language pre-trained models such as BERT, BioBERT, and BlueBERT
propsed a novel approach to cluster task
Published at DocEng-2021

News Multi-label classification with BERT-family models Jan 2021 – Feb 2021

Dalhousie University, Halifax, Canada

Developed a CLI tool to train different BERT-family models on real-world data.
Used: Huggingface Transformers, Pytorch, Pandas, sklearn, numpy.

Learn Faster and Forget Slower via FAST and Stable Task Adaptation for pre-trained language models May 2020 - now

Dalhousie University, Halifax, Canada

Improving the fine-tunning algorithm of pre-trained language model
Developing with Python, PyTorch, hugging face libraries, and MLflow.
Team project (Collaboration with a PhD student)

Information Retrieval of covid-19 academic papers using language models April 2020 - now

Dalhousie University, Halifax, Canada

Dataset is from the TREC-COVID competition
Working in this project as a Team (Collaboration with 2 MCS students)
Using unsupervised fine-tunning and Siamese network to train T5 model for finding related papers to a query.

Query-focused Extractive Summarization using pre-trained models January 2020 - April 2020

Course Project for Natural Language Processing(NLP) Course
Team Project (collaboration with a PhD student)
Used different scoring functions to extract most important sentences of a document
Download Project Report

Abstract: In the process of writing a research paper, researchers often spend a lot of time organizing and summarizing previous work related to the research. To help with this problem, the proposal of this project is to use Query-focused Extractive Summarization algorithms to produce relevant highlights in related research. For this project, the problem of insucient labeled data was solved by using pre-trained models such as BERT and BioBERT to produce accurate representations of words. To measure the validity of the approaches, they were applied to the BioASQ dataset of medical articles and obtained results consistent with each other using Cosine similarity and Euclidean distance, each with several pre-trained models. One of the challenges of this project was that producing the embeddings of a lot of sentences with a pre-trained model is a very time- consuming task, so a scalable tool was developed to eciently compute token embeddings for a variety of pre-trained models.

Classification Of Imbalanced Dataset Using BERT Embeddings May 2019 - Aug 2019

Course Project for Deep Learning Course
Team Project (collaboration with 2 MCS students)
Used BERT embeddings to classify the type of harrasment of each tweet in a imbalanced twitter dataset.
Download Project Report

Abstract: Online harassment is becoming prevalent as a specific type of communication on Twitter. Considering the huge amount of user-generated tweets each day, the problem of detecting and possibly limiting these contents automatically in real-time is becoming a fundamental problem. But often real-world datasets are imbalanced, comprising predominantly of “normal” examples and less number of “abnormal” ones which causes the learning algorithm to simply generate a trivial classifier that classifies every example as the majority class. To tackle this problem, we use SMOTE to oversample the embeddings of the minority classes where the embeddings are obtained from the BERT pretrained language model. Finally, we use these oversampled embeddings to train our bi-directional LSTM classifier model to categorize the tweets into four classes: non-harassment, sexual harassment, physical harassment and indirect harassment. Our experiments show that using SMOTE on the top layer representations of BERT significantly improves the F1 score than merely adjusting the class weights.

Visual Analysis Of Harassment Classification In Twitter May 2019 - Aug 2019

Course Project for Deep Learning Course
Team Project (Collaboration with 2 MCS students)
Developed a visualization system using D3 to demonstrate the semantic similarity of words.
Download Project Report
Although We used the same dataset for this project and the deep learning course, the methogologies and purpose of the projects are completly different.

Abstract: In our project, we develop a Deep Neural Network model to classify the tweets based on four classes: non-harassment, sexual harassment, physical harassment and indirect harassment. We then use Deep SHAP, a unified approach that explains the output of any machine learning model to interpret the predictions of our classifier. We demonstrate the semantic similarity of words in the tweet by visualizing the embeddings in low dimensions using t-SNE. To provide a quick, high level information of the similarities and anomalies between the categories, the final predictions of the model are summarized into different styles of TreeMaps.

Kitchen safety IOT project Dec 2017

Senior Project for Undergrad in Shiraz Univerity
webserver: Laravel Framework
website: CodeIgniter Framework, bootstrap
Download Project Report
GitHub

Descripiton: This project is designed to assist people who work in the kitchen such as, the kitchens of grand hotels, where the matter of temperature and heat turns out to be essential. They have giant refrigerators, freezers, a huge kitchen, and a couple of storage to be taken care of. To make sure that each section is being in an appropriate situation and the systems are functioning correctly, this project is designed to monitor a system to prevent any damage that might happen to these workers. In this system, a temperature and humidity sensor is located in the kitchen. The information that is being received from the environment will be sent out through a Wi-Fi module to a web server, the web server will store the data. Then users can watch theses information from the project website. This website also makes it available for the user to set a threshold in which the kitchen should not increase or decrease. If any change occurred out of the given threshold, the user will immediately be notified with an email. The history of the kitchen's temperature is also available to view anytime on the website.

Client and server project Jul 2017

Course project for Microprocessor Lab
Introduced to serial programming, and sending packet through client to server and vice versa;
Developed using python (server) and C (client);

Compiler for python Jan 2017

Course project for Compiler
Developed using Yacc, Lex;

Opportunistic network basic simulation project Jan 2017

Course project for Concurrency
Introduced to monitoring moving nodes and sending a message to oneanother randomly;
Developed using Python;

Degree overview project Jun 2016

Course project for Software Engineering
Introduced to Modeling and designing ERD, DFD and FHD.
Developed using Python;
Download prototype presentation (in Persian)
Download Project Dodumentation(in Persian)
Download Final presentation(in Persian)

File manipulation simulation Jul 2015

Course project for Assembly.
Designed to work with system calls.
Developed using assembly 8086 - 32 bit.

CMD simulator Jan 2013

Course project for Fundamentals of Computer Programming.
Got familiar with advanced programming skills in Python.
Developed a simulation of the commands of CMD.

Hi! I'm Fatemeh

I am currently working as a senior applied research scientist (NLP) at Pythonic AI.

Who Am I?

Here are some of my overall skills

Research

Data

Skills

My Technical Skills

Professional

Intermediate

Familiar

Education

M.S. Computer Science May 2019 – April 2021

B.S. Software Engineering Sep 2013 – Feb 2018

Work Experience

Senior NLP Scientist Feb 2022 – Present

Applied Research Scientist - NLP May 2021 – Jan 2022

Applied Research Scientist - NLP Nov 2020 – Apr 2021

NLP Research Assistant Mar 2019 – Apr 2021

Data Analyst Aug 2018 – Feb 2019

Linux Instructor Jun 2015 – Aug 2016

Teaching Experience

Practical Data Science Teacher Assistant - CSCI 1109

Computer C Programming - ENGM1081

Teacher Assistant at Learning centre May 2019 – Aug 2020

Databases Lab Assistant Feb 2017 – May 2017

Databases Course Teaching Assistant Sep 2016 – Jan 2017

Projects

MTLV: A Library for Building Multi-task Learning Architectures March 2020 - April 2021

News Multi-label classification with BERT-family models Jan 2021 – Feb 2021

Learn Faster and Forget Slower via FAST and Stable Task Adaptation for pre-trained language models May 2020 - now

Information Retrieval of covid-19 academic papers using language models April 2020 - now

Query-focused Extractive Summarization using pre-trained models January 2020 - April 2020

Classification Of Imbalanced Dataset Using BERT Embeddings May 2019 - Aug 2019

Visual Analysis Of Harassment Classification In Twitter May 2019 - Aug 2019

Kitchen safety IOT project Dec 2017

Client and server project Jul 2017

Compiler for python Jan 2017

Opportunistic network basic simulation project Jan 2017

Degree overview project Jun 2016

File manipulation simulation Jul 2015

CMD simulator Jan 2013

Presentations

Women in Machine Learning and Data Science (WiMLDS), Halifax Chapter Feb 2020

Poster presentation Nov 2019

Awards

Awarded the support for attending virtual Grace Hopper Celebration (vGHC) 2020 Sep 2020

2nd place in Diversity and inclusion hackathon Feb 2020

Awarded the support for attending CAN-CWiC 2019 Nov 2019

Parya Scholarship Nov 2019

Recent Blog

Contact

Hi!
I'm Fatemeh