How would you design YouTube's harmful content detection system?

Question

Accepted Answer

Design the ML system that detects harmful content on YouTube — videos that violate policy (violence, hate speech, CSAM, misinformation). YouTube receives 500 hours of video per minute. Walk me through how you'd detect violations at that scale, how you'd handle the human review pipeline, and how you'd measure whether the system is working. 500 hours per minute means you can't watch everything — you need to triage. Think about the different modalities in a video (frames, audio, transcript, title,