TR-3969 discusses a brief overview of the Apache Hadoop project, along with some best practices for and performance testing with the NetApp E-Series storage system.
Normal 0 false false false EN-US X-NONE X-NONE table.MsoNormalTable {mso-style-name:"Table Normal"; mso-tstyle-rowband-size:0; mso-tstyle-colband-size:0; mso-style-noshow:yes; mso-style-priority:99; mso-style-parent:""; mso-padding-alt:0in 5.4pt 0in 5.4pt; mso-para-margin:0in; mso-para-margin-bottom:.0001pt; mso-pagination:widow-orphan; font-size:10.0pt; font-family:"Times New Roman",serif;}This report briefly discusses the various components ofthe Hadoop ecosystem. It also presents an overview of the E-Series solution byNetApp, why you should choose NetApp for Hadoop, and how the E-Series storage systemperforms in terms of throughput and time. It also includes best practices forconfiguring a Hadoop cluster and the kernel-level tuning to extract optimalperformance.
For more info, please check here