Web Archiving

Website Archiving
Overview
How It Works
Features
Sample Applications

Archiving Solutions for the Modern Web

Today's Web is where companies do business, and how they do business. It's how people connect, share and collaborate. It's always on, and it's always evolving.

All of the Web pages, blog posts, images, and interactive elements that your company has invested in shouldn't disappear with each update. As the Web becomes increasingly real-time, it's important not to lose our long-term memory.

Today's Web needs to be preserved, so that your company can tell its story. The modern Web, however, is not intended to be printed, filed or faxed. It should be experienced.

Smarsh Web archiving is designed to capture, preserve and re-create the Web experience as it existed in time. Whether to prepare and protect your company for litigation and e-discovery, to comply with regulatory obligations, or to keep a historical record of the websites that affect your business, Smarsh can help you bring dormant Web experiences back to life.

Web ArchivingSmarsh Web Archiving crawls any website and captures each Web page and its contents in their original format, providing the precise record of what was published online at any specific point in time. Archived Web pages are preserved and rendered in your browser with their original look and feel. Interactive elements remain functional, and links between pages are preserved, pointing to the destination Web page or document as it existed.

Social collaboration, shared documents in the cloud and authenticated, individual Web user experiences are now a corporate reality. As a result, the need for a verifiable, electronic record of what was said, and when, is increasing rapidly. Each archived page (and every archived object) is time-stamped and stored unaltered in native format, preserving forensic integrity.

Because of dynamic or database-driven content, third-party tools and publishing systems, traditional Web backup solutions can give a picture of what was published to the Web, but cannot re-create what your customers experienced. Files can be restored from backups, but Web pages are comprised of dozens of individually linked components.

Website ArchivingAnd rather than investing your staff's time in taking screenshots, printing out volumes of individual pages and archaically filing them away, all you need to do is turn it on. Decide how often and how much content is to be archived, and then search your Web archive, share with collaborators and easily monitor changes to the content over time.

Fortune 100 companies trust Smarsh with their online history. Web archive content is preserved in the secure, geographically-dispersed Smarsh data centers. Smarsh brings the best-in-class search, supervision and on-demand production capabilities that highlight its suite of market-leading solutions for archiving electronic communications to the Web.

How It Works

Smarsh Web Archiving crawls any website and captures each Web page and its contents in their original format, providing the precise record of what was published online at any specific point in time. Archived Web pages are preserved and rendered in your browser with their original look and feel. Interactive elements remain functional, and links between pages are preserved, pointing to the destination Web page or document as it existed at the time of archival.

Organizations define an archive policy that captures exactly what pages or objects they need on an automated, recurring schedule. To assure forensic integrity, each archived page (and every archived object) is time-stamped and stored in its native format. Once stored, each archived asset is completely read-only.

Full-text, historical search and browsing make accessing archived pages instantaneous. Archived Web pages and their audit trails can be reviewed by any authorized user. Archived Web pages can be exported for download where they retain their original functionality.

Smarsh Web Archiving

Capture  |  Preserve  |  Search  |  Supervise  |  Produce

Capture 

Archive a complete web site and all of its components, a single Web page, entire blogs, wikis or RSS feeds. Smarsh will help you develop and configure an archive policy that fits your needs.

  • Capture any website | Web archives are only truly authentic if they capture the whole picture. Archive a complete website and its components, a single Web page, entire blogs, wikis or RSS feeds. Complex content – like slide shows, social media feeds, videos, ads, and more – is captured and preserved.
  • Control what's captured | Capture or exclude specific website components or objects, and control how much or little content to archive. Granularly define how far to extend the crawl and capture of link content. Leverage flexible configuration options to suit your specific requirements.
  • Repeat on any schedule | Capture sites every day, each month, once a year, or anything in-between. Smarsh specialists configure a Web archiving policy to automatically archive as often as your site changes.
  • Re-create the personal Web experience | Dynamic or database-driven content, third-party tools and publishing systems create unique experiences for individuals. Rather than simply capturing what was published to the Web, capture and re-create the Web experience as it may have been rendered to specific individuals at specific points in time.

Back to top

Preserve 

Smarsh stores every archived Web object in its original, native form. Every file is time-stamped and stored in redundant, geographically-dispersed data centers.

  • Maintain original usability | Smarsh delivers the original Web page on demand as it existed when archived. Replay slide shows, video and Flash components from the archive. Explore interactive elements powered by JavaScript, and submit Ajax requests.
  • Back up hosted documents | If it's linked to from the Web, it can likely be archived with Smarsh. PDFs and other documents are preserved exactly as they were published.
  • Redundant storage and data centers | Messages are preserved in secure, geographically-dispersed data centers. Everything is saved to WORM (write once, read many) optical storage.

Back to top

Search 

Full-text, historical search and browsing make accessing archived pages instantaneous. Authorized users can search across their Web archives based on virtually any criteria, either ad-hoc or on a consistent, systematic basis.

  • Instant overview of the Web over time | Instantly see what has changed with full-length screenshots for every archived page on the Web archiving dashboard. Focus on a narrow date range, or view years at a time.
  • Reference history with a single click | Use the bookmarklet feature to recall the history of any page from your archive. Then, select any date in a single click.
  • Saved searches | Save search criteria and repeat searches for convenience, consistency and evidence of policy enforcement.

Back to top

Supervise 

Your supervision procedures are not at the mercy of inflexible technology. Customize your firm's supervision experience for optimal efficiency in review and effectiveness in identifying and mitigating risk.

  • Immersive review experience | A film strip of archived content enables fast and simple navigation within your archive, while the ability to render each page in its original, native format creates an immersive review experience.
  • Change alerts | Email updates automatically summarize changes on any page being archived, including third-party sites. Each email includes the full text of what was added and deleted, and a screenshot preview provides a visual reference.
  • Active links | Click on archived links and navigate to external pages rendered as they were at the specific point in time. Archive your entire website and compare the content and context over time.
  • Full audit review | Document every administrator session and action taken within your Web archive. Reviewers have the ability to annotate and flag content. Actions are logged and subsequent metadata is indexed and searchable.
  • Reporting center | Produce customizable reports on the Web archive usage, system audit history and Web archive data. Demonstrate policy enforcement and ensure accountability among multiple managers responsible for Web page review.

Back to top

Produce 

Authorized users can retrieve, export and produce entire websites in original form, on demand.

  • Self-sufficient export | Web archives can be securely downloaded to a PC, encrypted and saved to a portable media device, or imported into a third-party legal review platform. This can be utilized for real-time access to data during an investigation or examination, or to restore data for disaster recovery purposes.
  • Export in native format | Send fully functional archives to anyone. Treat Web objects like any other document by importing it into your existing legal review platform. Exported archives are completely file-based and don't even need an internet connection. The included load file identifies every page and its metadata.
  • Export with hashes | Retain forensic integrity during the review process. Cryptographic hashes allow you to verify that no content has been altered throughout the EDRM process.
  • Collaborate and share | Create a private library with whomever you choose. Every archive is completely private and you control who has access. Sharing grants instant access to the history of your website. Or use export to share archives with anyone else.

Back to top

Sample Applications

Companies have been reluctant to adopt new Web-based technologies because of a lack of infrastructure to monitor and archive content. Smarsh Web Archiving provides a fundamentally new approach to capturing content that opens new opportunities for publishing, archiving and supervision. The list of nearly 200 content types supported by Smarsh is growing daily.

At a glance, Smarsh Web Archiving makes anything you can view in your Web browser searchable and retrievable, with no installation. Alternative applications include:

  • Wiki Archiving: Wikis often present special challenges, especially when dealing with legacy non-Sharepoint installations that teams heavily depend on.
  • Enterprise Social: Content from corporate social-media training, sharing and collaboration networks can be archived.
  • Custom intranets: The vast majority of corporate information is stored on custom intranets and shared across teams. This data was not designed to be archived for compliance, but often must be.

 

Use the filters below to find regulations and laws relevant to you and your company.