Skip to content

This project involves a comprehensive analysis of Netflix's movies and TV shows data using SQL. The goal is to extract valuable insights and answer various business questions based on the dataset. The following README provides a detailed account of the project's objectives, business problems, solutions, findings, and conclusions.

Notifications You must be signed in to change notification settings

Venus-0107/NETFLIX_PROJECT

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

4 Commits
 
 
 
 
 
 

Repository files navigation

NETFLIX_PROJECT

Overview

This project involves a comprehensive analysis of Netflix's movies and TV shows data using SQL. The goal is to extract valuable insights and answer various business questions based on the dataset. The following README provides a detailed account of the project's objectives, business problems, solutions, findings, and conclusions.

Objectives

  • Analyze the distribution of content types (movies vs TV shows).
  • Identify the most common ratings for movies and TV shows.
  • List and analyze content based on release years, countries, and durations.
  • Explore and categorize content based on specific criteria and keywords.

Dataset

The data for this project is sourced from the Kaggle dataset:

##importing the dataset

select * from dbo.netflix_titles

Business Problems and Solutions

1. Count the Number of Movies vs TV Shows

select 
type,
count(*) as totalcontent 
from dbo.netflix_titles
group by type

Objective: Determine the distribution of content types on Netflix.

2. Find the Most Common Rating for Movies and TV Shows

select rating,
count(*) as commonrating
from dbo.netflix_titles
group by rating

Objective: Identify the most frequently occurring rating for each type of content.

3. List All Movies Released in a Specific Year (e.g., 2020)

select title
from dbo.netflix_titles
where type = 'movie' and
release_year= '2020'

Objective: Retrieve all movies released in a specific year.

4. Find the Top 5 Countries with the Most Content on Netflix

select TOP 5 country,
count(title)
from dbo.netflix_titles 
group by country
order by count(title) desc 

Objective: Identify the top 5 countries with the highest number of content items.

5. Identify the Longest Movie

select * from dbo.netflix_titles
where type = 'movie' 
and duration = (select max(duration) from dbo.netflix_titles)

Objective: Find the movie with the longest duration.

6. Find All Movies/TV Shows by Director 'Rajiv Chilaka'

select 
*  from dbo.netflix_titles 
where director  like '%rajiv chilaka%'

Objective: List all content directed by 'Rajiv Chilaka'.

7. List All TV Shows with More Than 5 Seasons

select *,
trim(trailing ' season' from duration) as season
from dbo.netflix_titles 
where type= 'tv show' 
and trim(trailing ' season' from duration) > 5

Objective: Identify TV shows with more than 5 seasons.

8. Find All Content Without a Director

select * from dbo.netflix_titles where director is null

Objective: List content that does not have a director.

9. find the movies actor is 'salman khan'

select * from dbo.netflix_titles where cast like '%salman khan%'

Objective: find the movies actor is 'salman khan'

10. Categorize Content Based on the Presence of 'Kill' and 'Violence' Keywords

select *,
case 
when description like '%kills%' or 
description like '%violence%' then 'bad_content'
else 'good_content'
end category
from dbo.netflix_titles

Objective: Categorize content as 'Bad' if it contains 'kill' or 'violence' and 'Good' otherwise. Count the number of items in each category.

About

This project involves a comprehensive analysis of Netflix's movies and TV shows data using SQL. The goal is to extract valuable insights and answer various business questions based on the dataset. The following README provides a detailed account of the project's objectives, business problems, solutions, findings, and conclusions.

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published