How to Fix Common Audio/Video Sync Problems

January 3rd, 2013 by Justin Colletti

In the early days of motion picture, audio sync was easy: There wasn’t any. When you’re dealing with silent films, you have plenty of room to play fast and loose with frame rates.

The first hand-cranked cameras used in the industry could shoot footage at rates anywhere from 16 to 18 frames per second; there was no standardization. When the finished silent movies were screened for audiences, they were often played back considerably faster than that, at rates over 20 frames per second.

This system allowed the studios to save money on film stock, and let the movie theaters earn more money by turning audiences over at a healthy clip.

But with the birth of the “talkies”, we quickly started to standardize our frame rates to make accommodations for audio. Throw sound into the picture, and all of a sudden people start to notice when Charlie Chaplin starts sounding like Mickey Mouse.

Video Frame Rates for Audio People

Even when sound was first added to picture, workflow remained fairly straightforward for a little while.

Photo courtesy of University of Houston Libraries.

In the U.S., we began to standardize the speed of film at 24 frames per second in the mid 1920s. This allowed for smooth motion capture and reliable audio sync, and it worked nicely with the 60Hz AC frequency coming out of our power outlets.

6 Comments on How to Fix Common Audio/Video Sync Problems

Jonathan S. Abrams
January 5, 2013 at 7:29 pm (11 years ago)

Regarding The Coming of Color:

The difference between the Black and White 30 fps and Color 29.97 fps (ignoring non-drop and drop frame for the moment) was deemed necessary to maintain compatibility with black and white televisions when color broadcasts were transmitted.

The argument when color was developed was that the frequency of the color
subcarrier would create beating with the sound subcarrier that would be visible on some black and white television sets. The sound carrier, however, is frequency modulated. Therefore, beating would have only occurred at a specific frequency. A GE engineer determined that if the frame rate was dropped by .1% (from 30 to 29.97), that the beating would be reduced, and compatibility would be maintained.

As a result of this change, 60Hz AC cannot leak into a video signal, or bars appear to roll through the picture every 17 seconds. Technicians I have worked with over the years describe this phenomenon as a video ground hum.

The equation that has driven audio and video engineers mad by creating this
non-whole number for video sync is: [(number of scanning lines per frame•frames per second)/2]•455=color subcarrier frequency.

When the appropriate numbers are inserted, it becomes: [(525•29.97)/2]•455 –> (15,734.25/2)•455 –> 7,867.125•455 –> 3,579,542

The NTSC adopted this equation, and could not change the lines per frame (or all TV sets would be obsolete), so they changed the frame rate. The idea behind the number 455 is frequency interleaving of the video and color signals, which would minimize interference between brightness (luminance) and color (chrominance) data. The number 455 produces a result that is an even number of half the line rate.

Maintaining compatibility with some black and white sets when audio at a specific frequency was transmitted has created synchronization headaches ever since color video was introduced.

Regarding Sample Rates for CDs:

The sampling rate of 44.1kHz was chosen for CDs because the number of used lines in an NTSC picture frame will divide evenly into 44,100. The total line count in NTSC is 525, and 35 of them are blank. That leaves 490 lines for the picture. 44,100/30 yields 1470 samples per frame. With 490 lines per frame, the samples per line is 1470/490, or 3.

Regarding Timecode:

In NTSC black and white timecode (30fps), the total number of frames per hour is 108,000 (30fps•60sec•60min). When the frame rate is reduced to 29.97 for NTSC color, there are .03 fewer fps. This causes the time being displayed on a timecode reader to be slightly slower than realtime. The math is (30-29.97)•60sec•60min=108 frames.

To make the 29.97 fps timecode match elapsed time, two (2) frames are dropped at every minute that does not contain a zero (00,10,20,30,40,50). The remaining number of minutes (54) are each missing two frames from the count, and 54•2=108, which compensates for the difference. Many readers and generators indicate drop frame timecode by using semicolons instead of colons to separate the hours, minutes, seconds, and frames numbers.

Most of this information is part of a larger paper I wrote, which is available at https://files.nyu.edu/jsa226/public/timecode.pdf.
mp4guy
February 25, 2014 at 10:27 am (10 years ago)

Great post of audio video sync problems..
If you wanna fix it after encocoding
Here are some ways to fix it

http://newbrotricks.blogspot.in/2014/02/blog-post_3092.html or

http://lifehacker.com/5910943/fix-out+of+sync-audio-in-vlc-with-a-keyboard-shortcut
Ryan Petrus
October 7, 2014 at 3:32 pm (10 years ago)

For sync up multiple camera angles, I’d suggest trying out PluralEyes (http://pluraleyes.com).

And if you’re just shooting on 1 camera, check out DreamSync (http://dreamsyncapp.com). It’s not as cumbersome or expensive as PluralEyes and gets the job done for smaller quick projects.

There’s an app called DreamSync, a standalone application that’s built for the novice user as well as professionals. It syncs your footage and audio into one single clip so that it can then be imported into applications like iMovie, Windows Movie Maker, Adobe Premiere, Final Cut X, or any other editing suite.

http://dreamsyncapp.com

Both apps are effective depending on your editing workflow and how much (or little) time you want to dedicate to learning another interface for syncing audio/video footage.
Scritti Politti
February 5, 2015 at 4:42 am (9 years ago)

Interlacing was not done to reduce flicker. It was because they didn’t think a raster could draw an entire frame fast enough. That turned out to be wrong, but here we are with our new digital “advanced” TV system still dealing with this pathetic hack.

Not to mention the bullshit non-integer frame rates.
keyboardes
January 20, 2016 at 7:11 am (8 years ago)

Avdshare
Video Converter will take
change MP4 file frame rate as an example and it can also serve to change AVCHD,
MTS, M2TS, MXF, XAVC, ProRes, MPG, AVI, FLV, MOV, WMV, MKV and almost all video
format frame rates
Gerald Ncube
April 8, 2017 at 5:18 pm (7 years ago)

Hi

I am having a very stressful problem, I am making a music video and my problem is when I shoot the video I use a cd playing it with a cd player then capture that sound together with the video footage, And when I am editing I then use the camera sound and sync it to the original cd sound but my problem is it sync and match in the begging of the song but as it goes the video becomes faster than the original cd sound, how do I fix that please help. I did other songs they all fine but now I cant just get it right.

Gerald

Related posts:

6 Comments on How to Fix Common Audio/Video Sync Problems

Jonathan S. Abrams

mp4guy

Ryan Petrus

Scritti Politti

keyboardes

Gerald Ncube