Terapot

Terpot is a high-capacity email archive solution which uses distributed storage and distributed computing for archiving, searching/indexing, analysis of massive amount of emails. It also reports information about social networks and domain networks by analyzing individual email data.

Features

  • Email Journaling / Crawling
    • Push - Archives email data via SMART-HOST function of mail servers or DNS
    • Polling - Archives email data from external storages via SFTP / FTP / NAS
  • Archive
    • Importing existing emails
    • Live-saving outgoing/incoming emails
    • Comes with compression (at most 50% of storage saved)
  • Storage
    • Scalable email archiving using distributed storage
    • Easily extensible capacity using distributed storage
    • Safer and reliable archiving using replication
  • Distributed Indexing
    • Live-search for outgoing and incoming emails with real-time indexing
    • Fast search for massive amount of emails with distributed indexing
  • E-Discovery
    • Fast search with distributed indexing and archiving
    • Non-stop service uing distributed search systems
    • Consistent searching speed regardless of data amount
    • Fast analysis via On-fly Discovery
    • Distributed downloads for huge search results
  • E-Mining
    • Various analysis on individual email archives
    • Social Network, Domain Network
  • Web Admin Interface
    • Monitoring of overall storage usage
    • Export archived emails
    • Management for individual email archives
  • Open-API
    • Provides Open APIs (REST, JSON) for integration with other systems

System Architecture

Usages

  • Mail Storage
    • Archives massive email data for long periods (over 10 years)
    • Increased quota effect
    • Reduces load on primary mail servers by decreasing the size of mail boxes
    • Convenient search for old emails
    • Restores deleted emails
  • Email Compliance
    • Fast delivery of huge email data in case of lawsuits within limited time
  • Retrieve Information
    • Various analysis of email data that have increasing potential values