Tightrope Media Systems PEG Experts Series Archiving your Digital Video - 9/3/15

Join John Hauser, "The Archive Guy", the founder of the Community Media
Archive, and recipient of the ACM's 2015 Buske Leadership Award as he
talks about how to upload your digital video to the Internet Archive,
change metadata for your items and use the archive as a means of sharing
videos with other access centers. There will be plenty of time for
questions in the session, so come prepared to "Ask the Archivist!"

Internet Archive
it's a digital public library

How to get Info/Video into the Archive

minimum needed to get started
step 1 register email account on IA
       https://archive.org/account/login.createaccount.php

step 2: Upload via the form
        https://archive.org/upload/

step 3: how to get your own collection setup
        email Jeff Kaplan (collections-service@archive.org) with the following
information (Jeff will use this information to establish your collection
under the Community Media Archive):

* the name you want for your collection
* request to be a "sub collection" of the Community Media Archive (collection:"community_media")
* the email address(es) you used to register for an archive account that should have admin access to the collection.
* a descriptive text blurb for your collection's detail page.
* a logo for your collection

What are the issues you'll run into?
upload to the wrong collection
	check that your collection is listed and *not* "Community Video"
	solved by presetting collection in upload url
	https://archive.org/upload/?collection=access_humboldt

metadata
what can i put in metadata fields?

examples of some pretty hairy metadata
Field[1]:collection
Field[2]:series
Field[3]:mediatype
Field[4]:date
Field[5]:year
Field[6]:contributor
Field[7]:creator
Field[8]:item
Field[9]:talk-id
Field[10]:original-html
Field[11]:speaker
Field[12]:slug
Field[13]:event
Field[14]:filmed
Field[15]:published
Field[16]:original-download
Field[17]:duration
Field[18]:runtime
Field[19]:title
Field[20]:file
Field[21]:description
Field[22]:licenseurl
Field[23]:subject
File: /mnt1/tedtalks_480p_mp4/eLSeed_2015U.mp4 -> /eLSeed_2015U/eLSeed_2015U.mp4
retrieving existing metadata for eLSeed_2015U...
item eLSeed_2015U does not exist yet.
1 file(s) to upload in eLSeed_2015U
PUT http://s3.us.archive.org/eLSeed_2015U/eLSeed_2015U.mp4
x-amz-auto-make-bucket:1
x-archive-meta-original-html:http://www.ted.com/talks/el_seed_street_art_with_a_message_of_hope_and_peace
x-archive-meta-creator:TED.com
x-archive-meta-speaker:eL Seed
x-archive-meta-date:2015
x-archive-meta-collection:tedtalks
x-archive-meta-series:tedtalks
x-archive-meta-talk-id:2304
x-archive-meta-filmed:1426464000
x-archive-meta-original-download:http://download.ted.com/talks/eLSeed_2015U-480p.mp4?apikey=TEDDOWNLOAD
x-archive-meta-slug:el_seed_street_art_with_a_message_of_hope_and_peace
x-archive-meta-contributor:Access Humboldt - Eureka, CA
x-archive-meta-subject:Tedtalks;TED;Talks;TED Fellows;art;arts;culture;language;poetry;TED2015;2015
x-archive-meta-licenseurl:http://creativecommons.org/licenses/by-nc-nd/3.0/
x-archive-meta-duration:339
x-archive-meta-description:Born in France to Tunisian parents, eL Seed delights in juggling multiple cultures, languages and identities. Not least in his artwork, which sets Arabic poetry in a style inspired by street art and graffiti. In this quietly passionate talk, the artist and TED Fellow describes his central ambition: to create art so beautiful it needs no translation.
x-archive-meta-event:TED2015
x-archive-meta-published:1437663497
x-archive-meta-runtime:00:05:39
x-archive-meta-mediatype:movies
x-archive-meta-title:eL Seed: Street art with a message of hope and peace
x-archive-meta-year:2015
x-archive-ignore-preexisting-bucket:1

Recommended Metadata Practices
push all metadata you have to archive

consider including a "presenter" or "submitter" element, so you can later search by this
include a "series" element
"Runtime" in HH:MM:SS format
"Date"
use multiple "subject" elements
put year in a separate subject
put station name, initials, and state in separate "subject" elements
 
how do i add custom fields using their upload form?
"More Options Add aditional metadata" link at the bottom of the form
name of the field goes into "key"
value of the field goes into the "value" text entry box

example:
to get a NTSC SD MPEG2 version derived from an MPEG4 original, you'd put
"collection" in the key field and
"broadcast_ready" in the value field

how do i preset metadata with their form?
https://blog.archive.org/2013/02/08/presetting-metadata-with-the-new-beta-uploader/
http://archive.org/upload/?collection=access_humboldt,broadcast_ready

how do i edit/add metadata for items I've already uploaded?
https://archive.org/edit/AH-ecc_7-7-15

Why am I bothering to do all this work with metadata?
Browse by Subject/Keywords
https://archive.org/browse.php?field=subject&mediatype=movies&collection=access_humboldt

can i upload 1080p video to IA?
what are the naming requirements (.HD.mov suffix)?

how do i get an SD version derived from my HD original
see above answer for adding custom fields field=collection, value=broadcast_ready

How do i check on the status of my upload?
history page; compact view - derive step - log
https://catalogd.archive.org/history/KA_Crash_Course_Big_History_8_-_The_Modern_Revolution
https://catalogd.archive.org/log/392668997

How long will it take before my item appears in the search results and database?
3x4 times the runtime of your item

What's happening to my item?
https://catalogd.archive.org/catalog.php?justme=1

How can i tell if there's a backlog of items waiting to be processed?
from item detail page got to "history", click on graphs, check "Waiting Derives", "Derives per Hour"
https://archive.org/~tracey/stats/

How to Get information out of Internet Archive

Individual Items
single formats for single items, use "Download Options" link to down the format you want
or
https://archive.org/compress/MediaEdge536/formats=MPEG2,ARCHIVE%20BITTORRENT,METADATA

Search and "Advanced Search"
test output with HTML table output
when ready, switch to CSV (or RSS) output
https://archive.org/details/big_history_project
https://archive.org/details/mediaedge?sort=-publicdate
 
RSS feeds per collection
https://archive.org/services/collection-rss.php?collection=big_history_project
https://archive.org/services/collection-rss.php?collection=mediaedge

Series hosted on the Internet Archive
Big History Project
Media Edge
PopTech
Peak Moment
Rompeviento TV
TedTalks
The Royal Society for the Arts
Veterans Helping Veterans

Bittorent Client
last 50 items in a collection's or advanced search result's RSS feed
torrent client must support RSS Feeds (utorrent and qBittorrent do)
copy archive's RSS feed into "Add Feed" or "New Subscription"

Alternatives to Bittorrent RSS Feeds
Example script to download torrent files of last 50 items to a "watch" directory for a Bittorrent client
http://accesshumboldt.net/Community_Media_Archive/ACM_2015_Presentation/mkget_last_50_ah_items.sh.txt

Example script to download MPEG2 and metadata files of last 50 items to a destination directory
http://accesshumboldt.net/Community_Media_Archive/ACM_2015_Presentation/mkget_last_50_MPEG2_metadata_ah_items.sh.txt

Statistics on use of Community Media Archive 
http://accesshumboldt.net/Community_Media_Archive/ACM_2015_Presentation/cma_stats_2015-07-31.pdf

Community Media Archive Wiki
http://goo.gl/H9OoPQ
http://accesshumboldt.net/wiki/index.php?title=Community_media_archive#ACM_2014_Western_Regional_Conference_Presentation_Session_Links