I made a website that shows how different models respond when forced to be very reductive about difficult moral questions.
https://llm-morality.deno.dev/
I came up with the questions, trying to find things that might trip them up. There are some cool avenues for exploration on top of this, but I don’t have time to work on them now. Let me know if you see or come up with anything similar!