In the process of working on my TiVo to Video Podcast stuff I finally got to the point where I wanted to do something about commercials. The ideal solution would be callable from ruby or at least the command line so that it can be used in an automated manner. After a few weeks or research and playing around I found a workable solution that required me writing some code.
Early on I made the design decision to put chapter markers in around the commercials rather than cut them out. The main reason for this is “What if the detection is wrong?” For a 30 minute show, you’d end up missing more than half it in a worse case scenario. Since its easy to jump ahead to the next chapter on my iPhone and iPad this seemed like a good decision.
Here’s the three most important facts I learned in my research…
- Commercial detection: No question, comskip is the right tool to use here. The downside is that by default its a windows-only command line tool. It works perfectly under wine, so that mitigates it. Right now I use it via wine, but the source is available, so in the long run it would be good to have a native linux binary to call. Comskip creates a variety of output formats, so I picked one that seemed to be the easiest to work with.
- There is no good command line tool to add chapters to an existing mp4/m4v file: I dug around and found a lot of potential solutions, but the all were either not on linux, couldn’t take the resultant files that comskip spit out, or just not a good fit for what I was doing.
- The MP4v2 library had primitives for adding the chapters: From this point forward, it was just writing some code that did exactly what I wanted.
The code below expects three arguments: 1) The video file to work on, 2) The chapter file output from comskip in ZoomPlayer chapter format, 3) and the total length in seconds of the video file. The last one I might be able to remove once I have more brain time to devote to this.
#include <fstream> #include <string> #include <boost/lexical_cast.hpp> #include <boost/regex.hpp> #include <mp4v2/mp4v2.h> // Compile with something like: g++ AddChapterInfo.cpp -o AddChapterInfo -lmp4v2 -lboost_regex int main(int argc, char *argv[]) { char *m4vfilename = argv[1]; char *chapfilename = argv[2]; uint32_t total_length = boost::lexical_cast<uint32_t>(argv[3]); std::ifstream chapfile(chapfilename); MP4FileHandle m4vfile = MP4Modify(m4vfilename); // Add the chapter track, have it reference the first track // (should be the video) and set the "clock ticks per second" to 1. // (We may want to set that to 1000 to go into milliseconds.) MP4TrackId chapter_track = MP4AddChapterTextTrack(m4vfile, 1, 1000); boost::regex chpre("^AddChapterBySecond\\((\\d+),"); boost::smatch rem; std::string s; uint32_t last_time = 0; while (getline(chapfile, s)) { if (boost::regex_search(s, rem, chpre)) { uint32_t t = boost::lexical_cast<int>(rem[1]) * 1000; if (t > 0) { MP4AddChapter(m4vfile, chapter_track, t - last_time); last_time = t; } } } if (total_length - last_time > 0) { MP4AddChapter(m4vfile, chapter_track, total_length - last_time); } MP4Close(m4vfile); MP4Optimize(m4vfilename); return 0; } |
I’ve been using this code for over two weeks straight and has been operating perfectly, but obviously this code could be made a lot more robust, especially in the areas of error handling. I’ve only run into issues when comskip guesses commercials wrong, which is only payoff for putting chapters in instead of nuking the commercials all together.
In the long run, I should either write and release a generic tool that helps the next poor sap like me or work on using swig bindings to mp4v2 so I could just do the calls in ruby.
One thought on “Addings chapters to an existing mp4/m4v file”