Manado, Indonesia. 95252
(+62) 823-9602-9583
bayudwiyansatria@gmail.com

Tag: Technology

Software Engineer | DevOps Engineer

spark-vs-hadoop

What is differences about Apache Hadoop vs Apache Spark

What is Big Data? What size of Data is considered to be big and will be termed as Big Data? We have many relative assumptions for the term Big Data. It is possible that, the amount of data say 50 terabytes can be considered as Big Data for Startup’s but it may not be Big Data for the companies like Google and Facebook. It is because they have infrastructure to store and process this vast amount of data. Apache Hadoop and Apache Spark are both Big Data analytics frameworks they provide some of the most popular tools used to carry out common Big Data-related tasks.

hadoop

Apache Hadoop Introduction

The Apache Hadoop software library is a framework that allows for the distributed processing of large data sets across clusters of computers using simple programming models. It is designed to scale up from single servers to thousands of machines, each offering local computation and storage.

Rather than rely on hardware to deliver high-availability, the library itself is designed to detect and handle failures at the application layer. So delivering a highly-available service on top of a cluster of computers, each of which may be prone to failures.

Featured

Apache Spark Parallel Program Flows

Apache Spark Flows – Apache Spark consists of several purpose-built components as we have discuss at the introduction of apache spark. Let’s see what a typical Spark program looks like. Imagine that a 300 MB log file is stored in a three-node HDFS cluster. Hadoop File System (HDFS) automatically splits the file into 128 MB parts and places each part on a separate node of the cluster.

Apache_hadoop

Setup And Configure Cluster Node Hadoop Installation

This describes how to setup and configure a cluster-node Hadoop installation so that you can quickly perform simple operations using Hadoop MapReduce and the Hadoop Distributed File System (HDFS).

Featured

Apache Spark Parallel Processing Introduction

Apache Spark is usually defined as a fast, general-purpose, distributed computing platform. Apache spark provides high-level APIs in Java, Scala, Python and R, and an optimized engine that supports general execution graphs. It also supports a rich set of higher-level tools including Spark SQL for SQL and structured data processing, MLlib for machine learning, GraphX for graph processing, and Spark Streaming.

devops azure cover

Continuous integration and continuous delivery (CI/CD) With DevOps Azure

Continuous integration and continuous delivery (CI/CD) are considered by most to be the backbone of DevOps. Things start to get really interesting when you combine these practices with programmable infrastructure and a suite of services that allow you to automate the entire lifecycle of an application.

Featured

Cloud Ways Platform Reviews

Introduction CloudWays is one of the most significant web hosting provider in the industry. With the number of hosting provider piling up, it is always essential to choose the one which is affordable and provides useful features. When it comes to host a WordPress site, no one does it better than the Cloudways. CloudWays is managed cloud …

Featured

Things You Must Know Before Choose GCP, AWS or Azure

Shifting to Cloud Computing: Why Do It at All? Things GCP AWS Azure you must know! Cloud Computing has taken the world by storm and there are some significant reasons behind it. Everyone remembers the days when every bit of office work had to be documented in triplicate. Not only did it waste money, it also …

Featured

How Google Analytics Works?

Introduction What is Google Analytics? How Google Analytics works? Google Analytics is free service offered by Google? Yes this is free, This web service analyzes the behavior of the visitors to the website. However google analytics is used for the Search Engine Optimization (SEO) and with that you could improve your business plan for marketing purpose. …

Featured

Reviews Bing Search Engines

Introductions Bing Search Engines – Have you ever tried asking your search engine something to no avail? With many of us facing this problem, Bing has been updated with four new features that give users more thorough answers and enable it to respond to broader search terms. Bing is search engine is owned and operated …