Syracuse University, Masters of Applied Data Science - IST 722 Data Warehouse
-
Updated
Mar 24, 2020 - TSQL
Syracuse University, Masters of Applied Data Science - IST 722 Data Warehouse
ETL and Dimensional Modeling with PostgreSQL
Design and develop a dimensional data model for University Student Degree Program Enrollment and Performance using data modeling tool Dataedo
Analysis of a dataset containing information collected on food inspection done at various restaurants in Dallas.
2022 SCC Data Science & Analytics Workshop on Databases
An examination of a dataset collecting data from food inspections conducted at several Boston restaurants.
Analysis of New York State Police Department Arrests dataset. Created Dimensional Model for the provided dataset. Using Alteryx and Talend, built ETL pipelines to process, clean the data and create dimensions and facts in the destination database. Further, visualized the necessary details of the database using Tableau and PowerBI.
dimensional modeling of AdventureWorks2017 for sales, creating a DataMart. It includes an ETL pipeline that loads the data from AdventureWorks2017 to AdventureWorksDM using SQL Server Integration Services (SSIS) and implements Slowly Changing Dimension (SCD) handling using the SCD wizard and Merge statement.
Starts with a conceptual model ends with a Tableau interactive dash board. In between there is building ER diagrams, forward engineering to build normalized databases, dimensional data modelling and visualizations in tableau.
Creating erd using MySQL Workbench, Forward engineering, Dimensional data model, Tableau visualisation
Case Study SQL Reporting
Coursera DWH for BI Capstone (Implementation)
Designed a multi-dimensional data model using LucidChart. Developed ELT pipeline using Python/Pandas. S&P 500 data obtained via yfinance, an open-source library. Output normalized data to excel. Performed analysis and generated reports with Power BI.
Business Intelligence FER labs
Analytics Engineering with dbt on Bigquery. This project implements the use of Analytics Engineering Best practices to build a dimensional data model, using dbt (data build tool) and BigQuery.
A Data Warehousing project for retail sales using dimension modelling best practices with SCD type 2 on AWS Redshift. Utilizing AWS Lambda, Glue Workflows and Python Shell jobs to create and automate an ELT pipeline where batch data coming into S3 is loaded onto Redshift and necessary transformations are performed to meet requirements.
A comprehensive dimensional model for COVID data, enabling insights for future vaccination campaigns through robust visual analytics.
creating a data warehouse for a football game management company and some SQL queries to analyse data.
Add a description, image, and links to the dimensional-modeling topic page so that developers can more easily learn about it.
To associate your repository with the dimensional-modeling topic, visit your repo's landing page and select "manage topics."