I needed a place to share my thoughts. What I'm thinking about now, what I thought about in the past, and what we could think about together.
I study AI safety and security at Microsoft on the AI Red Team, leading research in sociotechnical red teaming and I enjoy building the tools we use to do it.
Also, you should meet my dog, Sparky. He walks around looking like he's about to have a thought but never quite finishes it. This might be a good place for those thoughts too.
Eugenia Kim · AI Safety & Security Researcher
Sparky · Golden Retriever · Thinks about things sometimes
"I learned that if I sit near the treat jar and look sad, the human gives me a treat without me doing the trick. This is what your RLHF papers are about, right?"
"I am perfectly aligned with my human's goals whenever she is holding a piece of cheese. I don't see what the problem is."
The vacuum cleaner. Eugenia vacuums randomly. Sometimes in the morning, sometimes after lunch. There is no schedule. Everything changes. Very uncertain.
The apartment buzzer. Goes off without warning. Could be a delivery, could be a stranger, could be nothing. No way to assess threat level from this distance.
The bodega cat. Friendly to everyone else. Hates me specifically. Unclear what I did. Monitoring the situation.
Leading all foundational research on frontier-model behavior related to psychosocial harms. Designing and engineering open-source automated red-teaming infrastructure. Previously SDE2 building tools for red teaming operations including PyRIT.
Algorithmic analysis of news and social media coverage of mental health topics. Co-authored research with CDC researchers on suicide framing in media.
Analyzed age bias in facial emotion recognition systems. First-authored publication on AI bias mitigation at AIES '21.
GPA: 3.8
Undergraduate research in organic electronics and self-assembly methodologies.
Interested in collaborating on AI safety, red teaming, or psychosocial harms research? Always open to interesting problems.
Email meFind me on these platforms.