Spotting Temporally Precise, Fine-Grained Events in Video