Computer Science Homework Help

Hive and Comma Separated Value Report

Get Your Custom Essay Written From Scratch
We have worked on a similar problem. If you need help click order now button and submit your assignment instructions.
Just from $13/Page
Order Now

Hive allows you to use (for the most part) SQL statements against data stored in Hadoop. It creates MapReduce code to execute the data requests across one or more nodes.

  • Requirements
  • Complete the following steps (all of the database activity must be in Hive):
    • 1. Review the three CSV (comma-separated value) files provided with this assignment.
    • 2. Write a script to create three tables for the CSV files.
      • a. One table should be called DVDxxx where xxx are your initials.
      • b. One table should be called Actorxxx where xxx are your initials.
      • c. One table should be called DVDActorxxx where xxx are your initials.
      • d. The script files should be called MakeTablesxxx where xxx are your initials
    • 3. Create the tables in Hive by demonstrating the execution of the script
    • 4. Import the data from the three CVS files into the three tables.
      • a. (Screenshot) Show that the data is actually there with a simple SELECT for each table
    • 5. Create one script called Demoxxx (where xxx are your initials) to accomplish each of the following required results in Hive. Please comment your script indicating the particular required results you are addressing:
      • a. All DVD titles, prices and genres
      • b. All DVD titles and prices sorted by price in descending order
      • c. DVD titles and prices for the genre Animation
      • d. All the genres for the DVDs, but only displayed once (not duplicated)
      • e. The count of the number of genres, including the blank genres
      • f. The average price of DVDs for each genre
      • G. The list of DVD titles and their release dates, where the release date is in the year 2005.
    • 6. (Screenshot) Demonstrate each of the required results in question 5.
    • Submit: – The 2 scripts files – The 8 screenshots in a word file.
    • Acknowledgement: The list of DVDs and Actors was retrieved from http://www.hometheaterinfo.com/dvdlist.htmss