87
National Archives and Records Administration
Best Practices for the Capture of Social Media Records
13
Provider
Paid Service
Product Description & Use Cases
Platforms Able
to Capture
Method of Capture
Notes
Erado
Offers email, social media, and instant
message archiving to comply with
FINRA, SEC, Sarbanes-Oxley, Gramm-
Leach-Bliley, FERC, NERC, and HIPAA.
LinkedIn, Face-
-
book, Twitter,
and blogs
Captures content either directly from social
media platforms or using Erado’s platform-
specific tools. Content is converted into Erado
format and either hosted or delivered to a
customer-preferred platform.
Facebook Download
service
Facebook provides a backup of a us-
-
er’s profile, including content posted
to timeline, photos and videos up-
loaded to account, friend lists, user-
created Notes, RSVP’d events, sent
and received messages, any com-
ments made on timeline posts, pho-
tos, and other timeline content. Users
can also request an enhanced archive
that contains additional information.
Facebook
Facebook sends a ZIP file to the email address
associated with a particular Facebook ac-
count.
Only content associated
with user’s account can be
backed up and accessible.
Flickr API
Flickr provides an open Application
Programming Interface (API) so Flickr
can communicate with other software
or tools. Flickr also provides RSS feeds
of updates to content.
Flickr
FlickrEdit
Previously FlickrBackup, this open-
-
source, Java-based desktop app al-
lows users to download, edit, or up-
load photos to and from Flickr.
Flickr
Free YouTube Down-
-
load
Software allows download of single
YouTube videos or a batch of all of
the videos of a selected YouTube user
or channel.
YouTube
Saves content in original YouTube format or
r
converts to AVI, MP4, and WMV formats.
freezePAGE
Service preserves web snapshots and
automatically logs the date page was
saved, IP address of the person who
saved it, page size and more. Requires
login every 3 days for unregistered
Web sites
Captures a snapshot of webpages and saves
on freezePAGE server when a user enters URL
(manual capture). Includes main web page
and embedded elements such as images,
stylesheets, and script files.
Webpage must be less
than 3MB (or 10 MB for
premium user accounts)
with less than 500 embed-
ded elements and retriev-
-
88
National Archives and Records Administration
Best Practices for the Capture of Social Media Records
14
Provider
Paid Service
Product Description & Use Cases
Platforms Able
to Capture
Method of Capture
Notes
users and 31 days for registered users
or account and pages may be deleted.
able within 120 seconds.
Global Relay
Global Relay Archive captures elec-
-
tronic messages in real time and cre-
ates a copy of each message, which is
then indexed, serialized and
time/date stamped. Users can access
and search archived content.
Facebook, Twit-
-
ter, LinkedIn
Hanzo Archives
Offers commercial web archiving ser-
-
vices for regulatory compliance, litiga-
tion-support, and e-discovery. Sub-
scribers can manage their web ar-
chives according to their RM policies
with associated metadata.
Web sites, so-
-
cial media
Uses proprietary tools to capture content
from complex websites, including rich-media
and interactive content. Content can be
searched, reviewed and exported. Subscrib-
ers can define their capture policy and have it
captured and organized by time.
Hearsay Social
Hearsay Social’s compliance module
provides workflow management,
monitoring, and capture of social me-
dia from a central dashboard.
Facebook, Twit-
-
ter, LinkedIn,
Google+, Four-
square
Using APIs, data is archived within context,
catalogued, and searchable. Users can export
data to existing enterprise systems, including
SiteMinder, Websphere, Autonomy, and Sy-
mantec Enterprise Vault.
Hootsuite
Provides a social media dashboard for
managing multiple accounts. Users
can spread messages across net-
works, monitor keyword mentions in
streams, and track results with built-
in click-through stats and integrated
Google Analytics.
Twitter
Archived Messages is an optional add-on for
r
the HootSuite Pro Plan. Twapperkeeper, now
part of Hootsuite, archives tweets.
First 100 messages are
archived for free and start-
ing at $10/month for addi-
tional levels. Available
from GSA's Apps.gov
If this, then that
Users create ittt tasks by putting one
ne
channel's trigger together with an-
other channel's action. Tasks are exe-
cuted every 15 minutes and can be
turned on or off and shared with oth-
ers.
Twitter, Face-
-
book, weather
forecasts,
email, etc.
Limited channels
103
National Archives and Records Administration
Best Practices for the Capture of Social Media Records
15
Provider
Paid Service
Product Description & Use Cases
Platforms Able
to Capture
Method of Capture
Notes
Iterasi
Subscription service to create web
archives for the corporate, legal and
government industries. It includes
data available requiring authentica-
tion such as direct messages on Twit-
ter and messages on Facebook.
Twitter, Face-
-
book, LinkedIn
Web crawlers capture entire sites or individu-
-
al pages on-demand or on a regular schedule.
Can also capture contents of RSS feeds (e.g.,
blog feeds, Twitter). The “Page Notary Tool”
captures any webpage, even those password
or firewall protected.
LiveOffice
Social Archives service offered by Sy-
-
mantec is part of the larger LiveOffice
software package.
Twitter, Face-
-
book, LinkedIn
Captures social media content in a centralized
repository
Only available with
LiveOffice AdvisorMail.
Memento
Memento, an LC-funded project run
n
by Los Alamos National Laboratory
and Old Dominion University, propos-
es a technical framework for integrat-
ing current and past Web.
Web sites
Firefox plug-in retrieves web captures from
m
the Internet Archive from a specified date
and time.
Cloud Preservation
by Nextpoint
Cloud Preservation is a subscription
service that provides automated,
cloud-based capture of web content
for marketing, compliance, and litiga-
tion-related needs.
Websites, blogs
Twitter, Face-
book
Uses Amazon’s Web Services to crawl the
Web and archive sites, blogs and social media
posts. Web crawler set to capture HTMP
source code and images at pre-determined
intervals.
Ohmygov
Social media monitoring and metrics
service that allows users to track so-
cial media accounts and compare
their news mentions and rankings
against their peers.
Twitter, Face-
-
book
Provides account tracking which captures the
full content of tweets.
User’s comparisons are
limited by which agencies
the service tracks.
Ownbackup
Service that provides daily automated
backups of social media with unlim-
ited storage.
Facebook, Twit-
-
ter, LinkedIn,
Salesforce,
Gmail
Provides daily snapshots of cloud data, en-
-
crypts data via AES 256-bit, and stores on
Amazon's EBS.
PageFreezer
A subscription service to archive,
browse and search dynamic web con-
tent in compliance with records man-
agement laws and as legal evidence.
Uses web crawling software to take daily
snapshots of websites. Only new web pages
and changes to web pages are archived to
save on storage. Subscribers can request a
local copy of all their web content in their
86
National Archives and Records Administration
Best Practices for the Capture of Social Media Records
16
Provider
Paid Service
Product Description & Use Cases
Platforms Able
to Capture
Method of Capture
Notes
native formats (HTML, PDF, TXT, MS Office,
OpenOffice, XML, CSS, Flash)
Parallel-Flickr
kr
Open-source tool for backing up Flickr
kr
photos and generating a database-
backed website that matches the
viewing permissions the user has cho-
sen on Flickr.
Flickr
Downloads and stores a local copy of original
photos and their "640x" versions along with
the information retrieved via the API as a
JSON file. It stores enough data about each
photo in a database so that it can reconstruct
your photostream and with a webpage for
each photo. It uses the Flickr API as a single
sign-on and validation service, which means
that the site can retrieve and store your con-
tact list and the relationship which each per-
son in it.
Described as a work in
progress.
Patrina
Captures, indexes, and consolidates
social media feeds into a hosted
archive as WORM optical, format.
Facebook,
Twitter,
LinkedIn,
blogs
Recollect
Backs up users pictures, tweets, and
check-ins from multiple social media
accounts.
Twitter, Flickr,
Instagram,
Foursquare
Users have the ability to
download their data.
Reed Archives
Captures social media content on
demand or on a schedule through
social media APIs. Items can be
tagged and searched. Users can also
implement retention schedules on
archived content.
Facebook, Twit-
-
ter and
LinkedIn, Web
sites, RSS feeds
Users can export archives
individually as PDFs or cre-
ate bulk exports of entire
websites and social media
accounts into native for-
mat, eDRM XML and PDF.
Site Replay
Subscription service provides daily
captures of screenshots with digital
watermarks and signatures.
Websites
Captures screenshots of webpages and pro-
-
vides access to them on their secure server.
Stored screenshots can be viewed online or
downloaded monthly.
SiteSucker for Mac
OS X
SiteSucker is a Macintosh application
that automatically downloads web
Web sites
Asynchronously copies a site's webpages,
images, backgrounds, movies, and other files
81
National Archives and Records Administration
Best Practices for the Capture of Social Media Records
17
Provider
Paid Service
Product Description & Use Cases
Platforms Able
to Capture
Method of Capture
Notes
sites from the Internet.
to a local hard drive. Users enter a URL, press
return, and SiteSucker downloads the web
site.
Smarsh
Subscription service provides cloud-
-
hosted social media archiving with no
installation. Content is captured, pre-
served and indexed in a Smarsh ar-
chive where posts can be searched
and retrieved.
LinkedIn, Face-
-
book, Twitter
and Chatter
Social media activity is captured via proxy or
through APIs in real-time and each archived
page and object is time stamped, hashed and
stored in native format. Service package in-
cludes a monthly copy of client data via en-
crypted DVD. Users control updates and dele-
tion schedules and can export content in its
native format.
Each archive is read-only
y
with 3 copies stored across
two physical locations.
Content retains its original
usability and live links.
SMC4 by Integritie
SMC4 enables automated capture,
control, communication and compli-
ance of social media. SMC4 workflow
has all the features of standard with
advanced case management.
Facebook, Twit-
-
ter, LinkedIn,
Google+, email
SocialSafe
A downloadable application which
will automatically download content
when requested and store it locally
on the user’s hard drive. No addition-
al storage of content is available
elsewhere.
Facebook, Twit-
-
ter, Instagram,
LinkedIn,
Google+,
Viadeo
Saves all tweets (not just latest 3200) as a ZIP
file for photos and CSV for Twitter.
Socialware
The software platform provides a cen-
n-
tralized access point for managing
and capturing social media in compli-
ance with organizational and legal
policies.
Facebook,
Twitter,
LinkedIn
Sonian
IM Archive claims that it is easy to
deploy, requires no maintenance, and
adapts to the evolving IM, SMS, and
social media technology landscape.
Social media
91
National Archives and Records Administration
Best Practices for the Capture of Social Media Records
18
Provider
Paid Service
Product Description & Use Cases
Platforms Able
to Capture
Method of Capture
Notes
Spredfast
Customers use the Social Media Man-
-
agement System to monitor, coordi-
nate, and measure social media con-
tent.
Twitter, Face-
-
book, Facebook
applications,
LinkedIn,
YouTube,
Flickr,
SlideShare,
blogs
Captures detailed records in the Enterprise
Repository of every post made across each
social media platform. It also captures an
audit trail, internal comments and classifica-
tions, and all public engagement from a post.
ThinkUp
An open source web application that
captures activity on social networks.
Twitter, Face-
-
book, Google+
Captures posts made to selected social net-
-
works (at the time of this writing, Twitter,
Facebook and Google+). Provides a metrics
dashboard and the ability to export posts
made to those networks as a CSV files and a
set of associated metrics.
Total Discovery
Software supporting litigation, digital
investigations and electronic policy
consulting now includes data collec-
tion capabilities for social media.
Twitter, Face-
-
book
Archives services are tied
to e-Discovery services,
not stand alone.
Tweet Archivist
Windows application that helps users
archive tweets for later data-mining
and analysis.
Twitter
Tweetbook
Creates a PDF ebook of most recent
tweets, replies, and favorites at user’s
request.
Twitter
Up to 3200 most recent tweets included due
to Twitter API limitation. Allows an option for
a backup file in XML.
Tweet Library (Mac)
Creates a local and searchable archive
of tweets, favorites, and retweets.
Creates collections so that users can
create timelines.
Twitter
Exports the archive, timeline, or collections to
a text file for saving to your Mac or PC.
Downloads up to 3200
tweets on the first launch.
Users can upload the .zip
archive from Twitter for
their complete tweet his-
tory.
Tweet Nest
t
Installs on web server to provide a
backup of tweets that users can store,
browse, and search. Users can also
Twitter
Users can follow creator
Andy Graulund
(@graulund) for script up-
-
92
National Archives and Records Administration
Best Practices for the Capture of Social Media Records
19
Provider
Paid Service
Product Description & Use Cases
Platforms Able
to Capture
Method of Capture
Notes
customize the display.
dates.
Tweetstream
TweetStream provides simple Ruby
access to Twitter's Streaming API us-
ing open authorization.
Twitter
Uses em-twitter, an EventMachine-based
sed
ruby client for the Twitter Streaming API, to
connect to Twitter.
TwInbox
Twitter add-on for Outlook email with
h
searching and grouping capabilities as
well as graphs of usage statistics.
Twitter
Downloaded to Outlook.
Twitter API through
h
manual backup
By submitting queries to the Twitter
API, users can backup their data man-
ually, including the data of all the
people followed and user tweets.
Twitter
Users can save their tweets, the people they
follow and their followers as XML files by
manually saving each page.
With an active Twitter ac-
-
count, the manual backup
can be a lengthy process.
Twitter Archive
Download
Users can request a copy of their
Twitter Archive from Twitter. The ZIP
file includes user tweets and
retweets.
Twitter
Twitter Archiving
Google Spreadsheet
(TAGS)
Script that allows users to pull data
from Twitter’s API and save it in a
Google Spreadsheet.
Twitter
The updated version of
Twitter Archiving Google
Spreadsheet (TAGS) works
with the new Twitter API
and authenticates access.
Twitter Backup
Downloadable software that captures
all tweets and provides them in XML.
Uses a document type identical to
Twitter's API.
Twitter
Twitterscribe
Provides a daily backup of user’s last
200 tweets along with some metada-
ta in the Twitterscribe database. Us-
ers can browse tweet by month and
search by keyword, username,
hashtag, or URL.
Twitter
When user’s first sign up, they capture up to
3200 tweets (the limit of the Twitter API).
Requests user’s last 200 tweets from Twitter
daily. Users can then login and export their
tweets from Twitterscribe to CSV or PDF files.
Only captures tweets and
retweets.
WARCreate
Google Chrome extension that allows
users to create a Web ARChive
(WARC) file from any browseable
Websites
WARCreate allows multi-
-
ple archiving sessions to
exist in a single WARC file
62
National Archives and Records Administration
Best Practices for the Capture of Social Media Records
20
Provider
Paid Service
Product Description & Use Cases
Platforms Able
to Capture
Method of Capture
Notes
webpage.
automatically and inte-
grate with Memento
o
.
WP-DB Manager
er
WordPress plug-in allows users to
o
optimize, repair, backup, restore, or
delete a backup database,
drop/empty tables and run selected
queries. Supports automatic schedul-
ing.
WordPress
WP-DB Manager uses mysqldump application
n
to generate the backup and mysql application
to restore them via shell while WP-DB-Backup
uses PHP to generate the backup.
X1 Social Discovery
X1 Social Discovery collects, authenti-
-
cates, searches, reviews and produces
content. MD5 hash values are calcu-
lated upon capture and maintained in
native format. Content can be tagged,
sorted and exported.
Facebook, Twit-
-
ter, YouTube,
LinkedIn
Data is collected and indexed from social me-
-
dia streams, linked content and websites
through APIs and direct web navigation, ag-
gregating data in real time. Metadata is cap-
tured through APIs provided by the sites.
YTD Video Down-
-
loader
Software allows download of videos
from YouTube, including HD and HQ
videos.
Video from
YouTube, Face-
book, Google
Video, Yahoo
Video
Downloads and converts videos to MOV,
MP4, 3GP, WMV, AVI, or MP3 files. Videos
are played in Flash.
Documents you may be interested
Documents you may be interested