[Home] [Download] [Tools] [Research] [Documentation] [Forums] [Terms]

TubeKit is a toolkit for creating YouTube crawlers. It allows one to build one's own crawler that can crawl YouTube based on a set of seed queries and collect up to 17 different attributes. TubeKit assists in all the phases of this process starting database creation to finally giving access to the collected data with browsing and searching interfaces.

The toolkit is implemented primarily using PHP and available to download from here.


Steps to create your YouTube crawler with TubeKit:
  1. Provide basic information (project name, directory to store the crawler, etc.).
  2. Set up the database.
  3. Select up to 17 different attributes to collect for a YouTube video.
  4. Set up various schedules for crawling.
  5. Access your crawler and enter seed queries.
New: Want to crawl YouTube to collect data and contextual information without downloading TubeKit or installing anything on your side? Try ContextMiner!
New: Check out the new tools available here that allow you to do various crawling operations on YouTube, including harvesting videos and user profiles.

Creative Commons License
TubeKit by Chirag Shah is licensed under a Creative Commons Attribution-Noncommercial-Share Alike 3.0 United States License. TubeKit is supported by NSF grant #IIS 0455970.

| Send comments | © 2008 Chirag Shah | Last update: October 5, 2008 |