Debugging misaligned completions with sparse-autoencoder latent attribution alignment.openai.com 1 points by rd 4 hours ago