I made a website that shows how different models respond when forced to be very reductive about difficult moral questions.

https://llm-morality.deno.dev/

I came up with the questions, trying to find things that might trip them up. There are some cool avenues for exploration on top of this, but I don’t have time to work on them now. Let me know if you see or come up with anything similar!