Skip to document

Information
AI Chat

Poster - Practical

Practical

Course

Data Science and Analytics

9 Documents

Students shared 9 documents in this course

University

Dr. Vishwanath Karad MIT World Peace University

Academic year: 2022/2023

Uploaded by:

Anonymous Student

This document has been uploaded by a student, just like you, who decided to remain anonymous.

Dr. Vishwanath Karad MIT World Peace University

Comments

Please sign in or register to post comments.

Students also viewed

Related documents

Preview text

Apache Hive

Apache Hive is an open-source data warehousing and SQL-like query language

system built on top of Hadoop for processing and analyzing large datasets. It

provides a high-level interface for managing and querying structured and semi-

structured data in a distributed storage environment, making it a valuable tool

for big data processing and analytics. Hive uses a language called HiveQL,

which is similar to SQL, to interact with data stored in Hadoop's HDFS or other

distributed file systems.

Map-Reduce

Hive's extensibility is achieved through the

integration of user-defined functions (UDFs)

and custom scripting to enable custom data

processing and transformations.

System Architecture

SQL vsHadoop

SQL: Ideal for structured data, offers a standard

query language for relational databases, enabling

efficient data retrieval and manipulation.

Hadoop: Suited for big data with structured, semi-

structured, or unstructured data, uses a distributed

file system and batch processing, enabling

scalable storage and processing of vast datasets.

SQL: Provides ease of use, well-defined schemas,

and ACID transactions for traditional databases.

Hadoop: Offers flexibility for diverse data types

and massive scalability, but requires complex data

transformations and lacks real-time capabilities,

making it suitable for different use cases.

1.

2.

3.

4.

KEY FEATURES

Extensibility

Apache Hive offers a SQL-like query

language for managing and analyzing large

datasets in Hadoop. Its schema-on-read

approach allows flexible handling of

structured and semi-structured data. With

seamless integration into the Hadoop

ecosystem, it supports scalability,

optimization, and robust security features,

making it a vital component for big data

processing and analytics.

Was this document helpful?

Poster - Practical

Course: Data Science and Analytics

9 Documents

Students shared 9 documents in this course

University: Dr. Vishwanath Karad MIT World Peace University

Was this document helpful?

Apache Hive

Apache Hive is an open-source data warehousing and SQL-like query language

system built on top of Hadoop for processing and analyzing large datasets. It

provides a high-level interface for managing and querying structured and semi-

structured data in a distributed storage environment, making it a valuable tool

for big data processing and analytics. Hive uses a language called HiveQL,

which is similar to SQL, to interact with data stored in Hadoop's HDFS or other

distributed file systems.

Map-Reduce

Hive's extensibility is achieved through the

integration of user-defined functions (UDFs)

and custom scripting to enable custom data

processing and transformations.

System Architecture

SQL vsHadoop

SQL: Ideal for structured data, offers a standard

query language for relational databases, enabling

efficient data retrieval and manipulation.

Hadoop: Suited for big data with structured, semi-

structured, or unstructured data, uses a distributed

file system and batch processing, enabling

scalable storage and processing of vast datasets.

SQL: Provides ease of use, well-defined schemas,

and ACID transactions for traditional databases.

Hadoop: Offers flexibility for diverse data types

and massive scalability, but requires complex data

transformations and lacks real-time capabilities,

making it suitable for different use cases.

1.

2.

3.

4.

KEY FEATURES

Extensibility

Apache Hive offers a SQL-like query

language for managing and analyzing large

datasets in Hadoop. Its schema-on-read

approach allows flexible handling of

structured and semi-structured data. With

seamless integration into the Hadoop

ecosystem, it supports scalability,

optimization, and robust security features,

making it a vital component for big data

processing and analytics.

Students also viewed

Related documents

@font-face { font-family: 'DM Sans'; font-style: normal; font-weight: 400; src: url(https://d20ohkaloyme4g.cloudfront.net/build/css/fonts/dm-sans-400-ext.woff2) format('woff2'); unicode-range: U+0100-024F, U+0259, U+1E00-1EFF, U+2020, U+20A0-20AB, U+20AD-20CF, U+2113, U+2C60-2C7F, U+A720-A7FF; font-display: swap;} @font-face { font-family: 'DM Sans'; font-style: normal; font-weight: 400; src: url(https://d20ohkaloyme4g.cloudfront.net/build/css/fonts/dm-sans-400.woff2) format('woff2'); unicode-range: U+0000-00FF, U+0131, U+0152-0153, U+02BB-02BC, U+02C6, U+02DA, U+02DC, U+2000-206F, U+2074, U+20AC, U+2122, U+2191, U+2193, U+2212, U+2215, U+FEFF, U+FFFD; font-display: swap;} @font-face { font-family: 'DM Sans'; font-style: normal; font-weight: 500; src: url(https://d20ohkaloyme4g.cloudfront.net/build/css/fonts/dm-sans-500-ext.woff2) format('woff2'); unicode-range: U+0100-024F, U+0259, U+1E00-1EFF, U+2020, U+20A0-20AB, U+20AD-20CF, U+2113, U+2C60-2C7F, U+A720-A7FF; font-display: swap;} @font-face { font-family: 'DM Sans'; font-style: normal; font-weight: 500; src: url(https://d20ohkaloyme4g.cloudfront.net/build/css/fonts/dm-sans-500.woff2) format('woff2'); unicode-range: U+0000-00FF, U+0131, U+0152-0153, U+02BB-02BC, U+02C6, U+02DA, U+02DC, U+2000-206F, U+2074, U+20AC, U+2122, U+2191, U+2193, U+2212, U+2215, U+FEFF, U+FFFD; font-display: swap;} @font-face { font-family: 'DM Sans'; font-style: normal; font-weight: 700; src: url(https://d20ohkaloyme4g.cloudfront.net/build/css/fonts/dm-sans-700-ext.woff2) format('woff2'); unicode-range: U+0100-024F, U+0259, U+1E00-1EFF, U+2020, U+20A0-20AB, U+20AD-20CF, U+2113, U+2C60-2C7F, U+A720-A7FF; font-display: swap;} @font-face { font-family: 'DM Sans'; font-style: normal; font-weight: 700; src: url(https://d20ohkaloyme4g.cloudfront.net/build/css/fonts/dm-sans-700.woff2) format('woff2'); unicode-range: U+0000-00FF, U+0131, U+0152-0153, U+02BB-02BC, U+02C6, U+02DA, U+02DC, U+2000-206F, U+2074, U+20AC, U+2122, U+2191, U+2193, U+2212, U+2215, U+FEFF, U+FFFD; font-display: swap;} @font-face { font-family: 'DM Sans'; font-style: italic; font-weight: 400; src: url(https://d20ohkaloyme4g.cloudfront.net/build/css/fonts/dm-sans-italic-400-ext.woff2) format('woff2'); unicode-range: U+0100-024F, U+0259, U+1E00-1EFF, U+2020, U+20A0-20AB, U+20AD-20CF, U+2113, U+2C60-2C7F, U+A720-A7FF; font-display: swap;} @font-face { font-family: 'DM Sans'; font-style: italic; font-weight: 400; src: url(https://d20ohkaloyme4g.cloudfront.net/build/css/fonts/dm-sans-italic-400.woff2) format('woff2'); unicode-range: U+0000-00FF, U+0131, U+0152-0153, U+02BB-02BC, U+02C6, U+02DA, U+02DC, U+2000-206F, U+2074, U+20AC, U+2122, U+2191, U+2193, U+2212, U+2215, U+FEFF, U+FFFD; font-display: swap;} @font-face { font-family: 'DM Sans'; font-style: italic; font-weight: 500; src: url(https://d20ohkaloyme4g.cloudfront.net/build/css/fonts/dm-sans-italic-500-ext.woff2) format('woff2'); unicode-range: U+0100-024F, U+0259, U+1E00-1EFF, U+2020, U+20A0-20AB, U+20AD-20CF, U+2113, U+2C60-2C7F, U+A720-A7FF; font-display: swap;} @font-face { font-family: 'DM Sans'; font-style: italic; font-weight: 500; src: url(https://d20ohkaloyme4g.cloudfront.net/build/css/fonts/dm-sans-italic-500.woff2) format('woff2'); unicode-range: U+0000-00FF, U+0131, U+0152-0153, U+02BB-02BC, U+02C6, U+02DA, U+02DC, U+2000-206F, U+2074, U+20AC, U+2122, U+2191, U+2193, U+2212, U+2215, U+FEFF, U+FFFD; font-display: swap;} @font-face { font-family: 'DM Sans'; font-style: italic; font-weight: 700; src: url(https://d20ohkaloyme4g.cloudfront.net/build/css/fonts/dm-sans-italic-700-ext.woff2) format('woff2'); unicode-range: U+0100-024F, U+0259, U+1E00-1EFF, U+2020, U+20A0-20AB, U+20AD-20CF, U+2113, U+2C60-2C7F, U+A720-A7FF; font-display: swap;} @font-face { font-family: 'DM Sans'; font-style: italic; font-weight: 700; src: url(https://d20ohkaloyme4g.cloudfront.net/build/css/fonts/dm-sans-italic-700.woff2) format('woff2'); unicode-range: U+0000-00FF, U+0131, U+0152-0153, U+02BB-02BC, U+02C6, U+02DA, U+02DC, U+2000-206F, U+2074, U+20AC, U+2122, U+2191, U+2193, U+2212, U+2215, U+FEFF, U+FFFD; font-display: swap;} @font-face { font-family: 'Lazzer'; font-style: normal; font-weight: 900; src: url(https://d20ohkaloyme4g.cloudfront.net/build/css/fonts/lazzer-900.woff2) format('woff2'); unicode-range: U+0000-00FF, U+0131, U+0152-0153, U+02BB-02BC, U+02C6, U+02DA, U+02DC, U+2000-206F, U+2074, U+20AC, U+2122, U+2191, U+2193, U+2212, U+2215, U+FEFF, U+FFFD; font-display: swap;}