Convert WebVTT to a Transcript using Python

by @edent | # # # # # | 1 comment | Read ~926 times.
YouTube showing subtitles.

I want to convert YouTube's auto-generated subtitles into a plain transcript. Why is this so hard? This blog post gives a more detailed explanation than my answer to this StackOverflow question. Here's what the subtitles look like when you view a video: And here's what the code which generates those subtitles looks like: 00:00:00.930 -->… Continue reading →

Using YouTube to Transcode Videos to DASH on the Command Line

by @edent | # # # # | 9 comments | Read ~1,584 times.

This is part of my redecentralisation efforts to liberate my videos from YouTube. MPEG-DASH is a simple method of streaming videos which doesn't require any specialised server software. You convert a high resolution video into a series of smaller resolution videos. You chop each of the videos up into several chunks. As the video plays,… Continue reading →

Liberate Your YouTube Videos

by @edent | # # # # | Read ~390 times.

If you've been following this blog, you'll know that Google unjustly shut down my YouTube channel. They've now reinstated it - but I can no longer trust them as custodians of my data. So, here's a quick tutorial on how to download all your videos - and metadata - from YouTube. The Official Way Google… Continue reading →

Context Specific Content Surfacing

by @edent | # # # # # # | 1 comment | Read ~157 times.

In a mixed paradigm environment, how do you ensure content is surfaced which is context specific? By which I mean - how do you make your content serve the user's time-bound constraints? What I'm trying to say is - serendipitous discovery must be restricted based on temporal imperatives. Or, to break it down further, a… Continue reading →

Sky News Infringed My Copyright

by @edent | # # # # # # | 22 comments | Read ~7,254 times.

UPDATE! I have reached a settlement with Sky. Update: 16 March, 2011. They have finally paid up! tl;dr Sky News stole my copyrighted work and distributed it without credit or payment. I asked them to pay £1,500. They refused. Full Story During the recent O2 brouhaha I recorded a video showing how the issue could… Continue reading →

Mobile Badvertising: Sony Pictures - Easy A

by @edent | # # # # # | 1 comment | Read ~266 times.

Sometimes a mobile advert is just so bad that you wonder what drugs the people who commissioned were on. Easy A is new film aimed squarely a the youth market.  Perfect fodder for mobile advertising. Cute little advert, doesn't say what it will link to, but other than that, not bad.  So let's click through… Continue reading →

Embedding Mobile YouTube in Dabr / Twitter

by @edent | # # # # # | 3 comments | Read ~376 times.

I've decided to add a preview of YouTube videos into Dabr, the mobile Twitter service. The code is fairly simple - although I still suck at regular expressions. if (preg_match_all('#youtube.com/watch?v=([_-dw]+)#', $tmp, $matches, PREG_PATTERN_ORDER) > 0) { foreach ($matches[1] as $match) { $images[] = theme('external_link', "http://m.youtube.com/watch?v={$match}", "<img src='http://i.ytimg.com/vi/{$match}/1.jpg' class='twitpic' />"); } } Here's how it looks. … Continue reading →

YouTube and QR Codes

by @edent | # # # | 5 comments | Read ~888 times.

I haven't seen this mentioned anywhere else. It would appear that the mobile version of YouTube now includes links to QR codes, so you can share the video with your friends. I've only seen this using the Android browser, I'm not sure if it's available on other devices. Browsing http://m.youtube.com/ at the bottom is a… Continue reading →

WordPress Plugin - YouTube for XHTML and Mobile

by @edent | # # | 11 comments | Read ~820 times.

This blog is XHTML 1.0 Strict. Unfortunately, the code produced by YouTube for embeding their videos isn't. Nor is it compatible with the dotMobi WordPress Mobile Pack. This scrap of code is a WordPress Plugin. it allows me to write something like [youtube id="p5ACl4iBGd0"] The code then outputs either the XHTML 1.0 embeding code or… Continue reading →