Hate Speech Diffusion Visualization: IR Coursework Interactive Visualize the diffusion of hate comments on a sample Twitter network, in terms of inter-user diffusion(static) and topic-wise temporal diffusion(real-time). Project Link
Reviewer: Journal of Open Source Software (February 2020 - Present) Packages Reviewed
Mentor - Student Code In 2020, osDFS (June 2020- August 2020) Project Link
Mentor: Google Code In 2019, Systers (November 2019 - February 2020) Project Link
Mentor - Google Summer of Code, Systers, (December 2017- August 2018) Project Link
Maintainer - First Contributors (January 2018 - Present) Project Link
Mentor - Girls Script Summer of Code, Project BENJI (May 2018 - August 2018) Project Link
Mentor - Learn IT Girl 3rd Edition (October 2017- January 2018) Project Link
Topic Aware Hate Diffusion on Twitter (PhD Student, LCS2):
Worked on a modelling hate speech genesis and diffusion over Twitter, combining signals from the user's profile, and external influences. Performed a detailed comparison of the proposed approach against a suit of baselines. Conducted ablation study on various hate signals, and hyper-parameter tuning. Contributed to data collection and literature review. (Paper under review for ICDE 2021). Code Link
Zero-shot Virility Prediction (RA, LCS2):
Worked on three baselines on virility prediction on Reddit datasets, comparing it for various metrics w.r.t to the proposed model. Full paper accepted under Research Track at KDD'20 (listed under publications). Code Link
Collusive User Detection (RA, LCS2):
Explored the area of detecting fraud collusive user groups that provide fake online reviews for products, explored community detection algorithms and matrix factorization based approaches for an unsupervised/semi-supervised collusion detection.
Datasets on Women Empowerment (Policy Intern, WERP India):
Learned the ropes of social policy research and reviewed via Gender Analysis of Women and Girls Empowerment in India. Collected and streamed data of the union territories for the same.
Fuzzy Taxonomy Extraction on Big Data (Undergraduate Thesis, Jamia Millia Islamia):
Setup a Hadoop cluster implemented unsupervised topic modelling algorithm, obtain concepts, and compare performance on a single system vs the cluster. The project aimed to extend the current unsupervised fuzzy taxonomy algorithm to work on the distributed system. A part of the work was accepted as short paper at 9th IC3 2016, (listed under publications). Code Link
Worked on an analytics project to help developers with better toolings. The work included data collection, cleaning, and modelling using a graph-based approach. We used various techniques in recommendation systems to improve developer analytics. I have contributed to the work in Graph Databases, License Conflict Resolution, Package Recommendations using Probabilistic Graphical Models, and Matrix Factorization.
Initially as an intern, I completed PoCs for License and CVE analysis for NPM data, implemented baseline code for data entity-models for ingestion into graph DB.
In addition, I contributed to data engineering and testing jobs for the project. Fixing issues across the analytics platform's backend. When I got an opportunity, I represented the product at meetups and conferences. See Project
Snakes & Ladders: InterProcess Communication Coursework
A 2 player snake and ladder game. The game was developed to understand the concepts of Inter process communication using shared memory paradigm. The graphics were developed using NCurse. Code Link
Hostel Information Management System: DBMS Coursework
Developed as a part of DBMS project, a dynamic web portal for information management of Jamia Girls Hostel. It had 3 levels of views, with varying degree of administration for students and wardens. Code Link
Terminal Based Client-Server Chat System: Computer Networking Coursework
Developed a server-client chat system, using IP/TCP protocols to interact within a given
network and in the same system. It served Unicast, Multicast and Broadcast
methods using SOCKETS and THREAD programing in C language.
Diamond Pricing Feature Visualiation: DecisionStats Internship
A project on pre-model visualization. Wrote scripts to obtain the data, clean it, and then visualise it to understand the relationships among the potential factors affecting diamond's price.
In addition, I developed code for the course material in Python, SAS, R offered for the summer school on analytics. Also, helped curate the initial material for the company website and provide inputs for the UI designs. Code Link
ZAP Scritpted Plugins: Mozilla WoS 2014
Contributed to the initial development of the OWASP ZAP scripted plugin to allow commands from other JVMs to be run in ZAP. Learned the ropes of open source, version controlling and internet security. (Java)
Added as a part of events section under official Jamia website. The portal provided detailed information about the Faculty Development Program on Robotics, Mechanical Department, JMI. Code Link
A catalog for the Jamia female students. Cataloging the PGs and the hotels around the campus. Developed using Jekyll and Google Maps, and hosted on Github. Code Link
As part of International Women Hackathon 2015, an application designed to keep track of your success at tasks over a given period of time. Code Link
A solution for Water Management for Policy Hackathon 2015. Code Link
NGO Management Application:
As a part of GHCI Hackathon 2015, Developed an online application for an NGO to manage its projects all over the country. Code Link
License analysis on the BigQuery Github Archive
I tried to find the most commonly used licenses across Github repositories and how they fair for top Github languages using Google BigQuery. Code Link