首页 > 其他 > 详细

Tachyon 概述

时间:2015-10-15 18:57:04      阅读:431      评论:0      收藏:0      [点我收藏+]

Tachyon 概述

    Tachyon是一个以内存为中心的分布式存储系统,能够可靠的在跨集群框架之间以内存速度进行共享,比如 Spark 和 MapReduce。它通过利用 lineage 信息和使用内存实现了高性能,Tachyon 在内存中缓存工作文件集,从而避免频繁读取的数据从磁盘去加载数据,这使得不同的 jobs/queries 和框架 以内存的速度 去访问缓存的文件集。


    Tachyon与 Hadoop 是可以兼容的,现有的Spark 和 MapReduce 程序能够在它上面运行而不用修改任何代码。这个项目是开源的(Apache License 2.0) ,被部署在多个公司。已经有超过过来自30多个机构的80余贡献者,包括:Yahoo,IntelRed Hat, 和 Tachyon Nexus. 该项目是Berkeley 数据分析栈(BDAS)的存储层,并还有部分Fedora发行版的(Fedora distribution.)


GIt仓库(Github Repository | 发布和下载(Releases and Downloads | 用户文档(User Documentation | (开发者文档)Developer Documentation | Meetup 组(Meetup Group | JIRA | User Mailing List


现阶段特性

  • 类似Java 文件API:Tachyon’s 本地化API类似于 java.io.File 类, 提供了 InputStream and OutputStream 接口, 高效的支持内存I/O映射. 我们推荐使用Tacyon这些API 获得最佳效能。 

  • 兼容性: Tachyon 实现了Hadoop FileSystem 接口.因此,Hadoop MapReduce 和 Spark 能够不做修改进行运行。然后,紧密结合才能充分利用Tachyon的优势,我们正在努力实现这个目标。点到点延迟加速依赖于工作负载和所述框架,由于各种框架有不同的执行开销。

  • Pluggable underlayer file system: To provide fault-tolerance, Tachyon checkpoints in-memory data to the underlayer file system. It has a generic interface to make plugging different underlayer file systems easy. We currently support HDFS, S3, GlusterFS, and single-node local file systems, and support for many other file systems is coming.

  • Native support for raw tables: Table data with over hundreds of columns is common in data warehouses. Tachyon provides native support for multi-columned data, with the option to put only hot columns in memory to save space.

  • Web UI: Users can browse the file system easily through the web UI. Under debug mode, administrators can view detailed information of each file, including locations, checkpoint path, etc.

  • Command line interaction: Users can use ./bin/tachyon tfs to interact with Tachyon, e.g. copy data in and out of the file system.

User Documentation

Deployment Guide:

Configuration:

Frameworks on Tachyon:

Others:

Tachyon Presentations:

  • Strata and Hadoop World 2014 (October, 2014) pdf pptx

  • Spark Summit 2014 (July, 2014) pdf

  • Strata and Hadoop World 2013 (October, 2013) pdf

Developer Documentation

Contributing to Tachyon

Building Tachyon Master Branch

External resources

Tachyon Mini Courses:

Hot Rod Hadoop With Tachyon on Fedora 21

Support or Contact

You are welcome to join our mailing list to discuss questions and make suggestions. We use JIRA to track development and issues. If you are interested in trying out Tachyon in your cluster, please contact Haoyuan.

Acknowledgement

Tachyon is an open source project started in the UC Berkeley AMP Lab. This research is supported in part by NSF CISE Expeditions Award CCF-1139158, LBNL Award 7076018, and DARPA XData Award FA8750-12-2-0331, and gifts from Amazon Web Services, Google, SAP, The Thomas and Stacey Siebel Foundation, Adatao, Adobe, Apple, Inc., Blue Goji, Bosch, C3Energy, Cisco, Cray, Cloudera, EMC, Ericsson, Facebook, Guavus, Huawei, Informatica, Intel, Microsoft, NetApp, Pivotal, Samsung, Splunk, Virdata, VMware, and Yahoo!.

We would also like to thank to our project contributors.

Related Projects

Berkeley Data Analysis Stack (BDAS) from AMPLab at Berkeley


Tachyon 概述

原文:http://my.oschina.net/Rayn/blog/517690

(0)
(0)
   
举报
评论 一句话评论(0
关于我们 - 联系我们 - 留言反馈 - 联系我们:wmxa8@hotmail.com
© 2014 bubuko.com 版权所有
打开技术之扣,分享程序人生!