I am a Data Scientist working under Dr. Brenda Curtis at the National Institute on Drug Abuse (NIDA). Formerly, I was a Senior Data Scientist for the World Well-Being Project at the Positive Psychology Center in the University of Pennsylvania. I am also a PhD student in Computer Science at the University of Pennsylvania working under H. Andrew Schwartz and Lyle Ungar. Research interests include big data analysis of language on social media and how this can be used to predict and gain insight into health and risky behavior.

Publications

Tweet Classification without the Tweet: An Empirical Examination of User versus Document Attributes. Veronica Lynn, Salvatore Giorgi, Niranjan Balasubramanian and H. Andrew Schwartz. NLP+CSS 2019. PDF Poster Bib
Suicide Risk Assessment with Multi-level Dual-Context Language and BERT. Matthew Matero, Akash Idnani, Youngseo Son, Salvatore Giorgi, Huy Vu, Mohammad Zamani, Parth Limbachiya, Sharath Chandra Guntuku and H. Andrew Schwartz. CLPsych 2019. PDF Poster Bib
The Remarkable Benefit of User-Level Aggregation for Lexical-based Population-Level Predictions. Salvatore Giorgi, Daniel Preotiuc-Pietro, Anneke Buffone, Daniel Rieman, Lyle H. Ungar and H. Andrew Schwartz. EMNLP 2018. PDF Supplement Data Poster Bib
Residualized Factor Adaptation for Community Social Media Prediction Tasks. Mohammadzaman Zamani, H. Andrew Schwartz, Veronica Lynn, Salvatore Giorgi and Niranjan Balasubramanian. EMNLP 2018. PDF Data Bib
Primal World Beliefs. Jeremy Clifton, Joshua D. Baker, Crystal L. Park, David B. Yaden, Alicia Clifton, Paolo Terni, Jessica L. Miller, Guang Zeng, Salvatore Giorgi, H. Andrew Schwartz and Martin E. P. Seligman. Psychological Assessment 2018. PDF Supplement Bib
Current and Future Psychological Health Prediction using Language and Socio-Demographics of Children for the CLPysch 2018 Shared Task. Sharath Chandra Guntuku, Salvatore Giorgi and Lyle H. Ungar. CLPSYCH 2018. PDF Bib
More Evidence that Twitter Language Predicts Heart Disease: A Response and Replication. Johannes Eichstaedt, H. Andrew Schwartz, Salvatore Giorgi, Margaret L. Kern, Gregory Park , Maarten Sap, Darwin R. Labarthe, Emily E. Larson, Martin Seligman, and Lyle H. Ungar. PsyArXiv 2018. PDF Data Bib
Can Twitter be used to predict county excessive alcohol consumption rates? Brenda Curtis, Salvatore Giorgi, Anneke E. K. Buffone, Lyle H. Ungar, Robert D. Ashford, Jessie Hemmons, Dan Summers, Casey Hamilton, H. Andrew Schwartz. PLOSONE 2018. PDF Data Bib
Modeling and Visualizing Locus of Control with Facebook Language. Kokil Jaidka, Anneke Buffone, Salvatore Giorgi, Johannes Eichstaedt, Masoud Rouhizadeh, and Lyle Ungar. Proceedings of the International AAAI Conference on Web and Social Media 2018. PDF Bib
DLATK: Differential Language Analysis ToolKit. H. Andrew Schwartz, Salvatore Giorgi, Maarten Sap, Patrick Crutchley, Johannes C. Eichstaedt, and Lyle Ungar. EMNLP 2017. PDF Code Poster Bib
On the Distribution of Lexical Features at Multiple Levels of Analysis. Fatemeh Almodaresi, Lyle Ungar, Vivek Kulkarni, M. Zakeri, Salvatore Giorgi. and H. Andrew Schwartz. ACL 2017. PDF Bib
Recognizing Pathogenic Empathy in Social Media. Muhammad Abdul-Mageed, Anneke Buffone, Hao Peng, Salvatore Giorgi, Johannes Eichstaedt and Lyle Ungar. ICWSM 2017. PDF Bib
Does well-being translate on Twitter? A comparative evaluation of English and Spanish well-being lexica. Laura Smith, Salvatore Giorgi, Rishi Solanki, Johannes Eichstaedt, H. Andrew, Schwartz, Muhammad Abdul-Mageed, Anneke Buffone and Lyle Ungar. EMNLP 2016. PDF Data Poster Bib
Real men don't say "cute": Using automatic language analysis to isolate inaccurate aspects of stereotypes. Jordan Carpenter, Daniel Preotiuc-Pietro, Lucie Flekova, Salvatore Giorgi, Courtney Hagan, Margaret Kern, Anneke Buffone, Lyle Ungar and Martin Seligman. SPSS 2016. PDF Supplement Bib
Studying the Dark Triad of Personality using Twitter Behavior. Daniel Preotiuc-Pietro, Jordan Carpenter, Salvatore Giorgi and Lyle Ungar. CIKM 2016. PDF Bib
Analyzing Biases in Human Perception of User Age and Gender from Text. Lucie Flekova, Jordan Carpenter, Salvatore Giorgi, Lyle Ungar, and Daniel Preotiuc-Pietro. ACL 2016. PDF Poster Bib
Analyzing crowdsourced assessment of user traits through Twitter posts. Lucie Flekova, Daniel Preotiuc-Pietro, Jordan Carpenter, Salvatore Giorgi, and Lyle Ungar. HCOMP 2015. PDF Supplement Poster Bib
Design and Evaluation of a Web-based Virtual Open Laboratory Teaching Assistant (VOLTA) for Circuits Laboratory Firdous Saleheen, Salvatore Giorgi, Zachary Smith, Joseph Picone and Chang-Hee Won. ASEE Annual Conference and Exposition 2015. PDF Bib
Adaptive Neural Replication and Resilient Control Despite Malicious Attacks. Salvatore Giorgi, Firdous Saleheen, Frank Ferrese and Chang-Hee Won. 5th International Symposium on Resilient Control Systems 2012. PDF Bib



Unigrams with the highest Pearson correlation to each of the dark triad traits.

Software and Data

Differential Language Analysis ToolKit (DLATK)
DLATK is an end to end human text analysis package, specifically suited for social media and social scientific applications. It is written in Python 3 and developed by the World Well-Being Project at the University of Pennsylvania and Stony Brook University.
County Tweet Lexical Bank
County level word and topic loading derived from a 10% Twitter sample from 2009-2015. Anonymized linguistic features extracted from over 1.5 billion English U.S County mapped tweets.
TwitterMySQL
TwitterMySQL is a Python library developed by the World Well-Being Project to pull tweets from the Twitter API and insert them into MySQL.
reddit-crawler-mysql
Reddit crawler with MySQL backend
flask-twitter-predictions
Flask app for running age, gender and (fake) personality predictions from Twitter data.
Map of Twitter Hashtags in Pennsylvania
Community structure of Twitter in Pennsylvania based on users' hashtag use.



Teaching

ENGR 1101: Intro to Engineering
The purpose of ENGR 1101 is to provide you with an understanding of the study and practice associated with civil, electrical and mechanical engineering and technology disciplines. For the electrical section, you will learn several key concepts such as programming in C/C++, hardware design using breadboards and electrical components, interfacing software with hardware using microcontrollers (Arduino), and an introduction to circuit analysis. The last part of this course will involve the Hovercraft design in which you will learn the techniques of soldering and how to efficiently design a prototype in which it is fully functional using an iPad application as the remote controller.

ENGR 4296: Senior Design Project II
Team-oriented engineering system design problems of various types. Topics proposed and orally presented by students in the initial stage of the course sequence. At completion, the project is demonstrated during an oral presentation and a final written report.

ECE 3613: Microprocessor Systems Laboratory
This course provides hands-on experience in assembly language programming for Intel i186EX 16-bit microprocessor and its hardware system implementation. The laboratory assignments utilize 80X86 microprocessor simulations using Emu8086 (www.emu8086.com) and hardware experiments with the FlashLite186 microcomputer by JK Microsystems (www.jkmicro.com) with processor bus logic and output signal measurements using the TechTools DigiView logic analyzer.