

Apache Hadoop
A software framework for distributed storage that facilitates using a network of many computers to solve problems involving massive amounts of data and computation using MapReduce programming model
&
+ | Distributed Computing | Hadoop enables the processing of large data sets across clusters of computers. |
---|---|---|
+ | Scalability | It can scale from single servers to thousands of machines. |
+ | Storage | Offers local computation and storage capabilities. |
+ | Programming Model | Utilizes simple programming models for distributed data processing. |
+ | Fault Tolerance | Designed to handle failures at the application layer. |
+ | HDFS | High-throughput access to application data via the Hadoop Distributed File System. |
+ | YARN | Short for “Yet Another Resource Navigator”; Manages resources and job scheduling across the cluster. |
+ | MapReduce | A system for parallel processing of large data sets within the YARN framework. |
- | Complex Setup | Intricate setup process, particularly challenging for beginners. |
- | Batch Processing | Slower batch-processing model compared to alternatives like Apache Spark. |
- | No Real-Time Processing | Absence of support for real-time data processing |
- | High Latency | Elevated latency stemming from batch processing |
- | Single Point of Failure | Vulnerability due to reliance on a single master node |
- | Data Locality Constraints | Difficulties in ensuring data locality |
- | Scalability Challenges | Complications associated with scaling Hadoop clusters |
- | Resource-Intensive | Significant hardware resources required for operation |
System Requirements
# | Minimum |
---|---|
1 |
|
2 | SSH installed and SSHD running to use the Hadoop scripts that manage remote Hadoop daemons |
Ratings
4.155
G2CROWD | 4.35 based on 81 reviews |
---|---|
TrustRadius | 8.010 based on 214 reviews |
License
Categories
Alternatives
Distributed File System
No alternative software available under 'Distributed File System' category.
Cloud Computing
No alternative software available under 'Distributed File System' category.
Cloud Computing
Notes
- Apache, Apache Hadoop name and logo are trademarks of Apache Software Foundation.
- Hardware System requirements (optimal) are not from official website.