Azure Cosmos DB

Azure Cosmos DB supports multiple application programming interfaces (APIs) that enable developers to use the programming semantics of many common kinds of data store to work with data in a Cosmos DB database. The internal data structure is abstracted, enabling developers to use Cosmos DB to store and query data using APIs with which they’re already familiar. When to use Cosmos DB Cosmos DB is a highly scalable database management system. Cosmos DB automatically allocates space in a container for your partitions, and each partition can grow up to 10 GB in size. Indexes are created and maintained automatically. There’s… Read more

Azure BLOB Storage

Azure Blob Storage is a service that enables you to store massive amounts of unstructured data as binary large objects, or blobs, in the cloud. Blobs are an efficient way to store data files in a format that is optimized for cloud-based storage, and applications can read and write them by using the Azure blob storage API. Azure Blob Storage supports three different types of blob: Blob storage provides three access tiers, which help to balance access latency and storage cost:

Top 6 Business Intelligence Tools 2024

What are business intelligence tools? Business Intelligence (BI) tools are a diverse set of software applications, platforms, and solutions designed to extract, transform, and present data to support data analysis, trend identification, and strategic decision-making within an organization. Capabilities of BI tools: Data integration: BI tools can connect to and consolidate data from multiple different sources, such as databases, spreadsheets, and cloud-based applications. Data transformation: since data can come in many different formats, this aspect of BI tools is crucial for improving the compatibility and usability of the data. Users can transform data by cleaning, structuring, and aggregating it. Data visualization: the bread-and-butter… Read more

What is looker?

Looker is a cloud-based Business Intelligence (BI) tool that helps you explore, share, and visualize data that drive better business decisions. It is now a part of the Google Cloud Platform When to Use Looker If you’re looking for customized visuals, collaborative dashboards, and a single source of truth, plus top-of-the-line customer support, Looker might be the best BI platform for you. Being fully browser-based cuts down on potential confusion as your team gets up and running, and pricing customized to your company means you get exactly what you need to meet your company’s analytics goals. When Not to Use… Read more

Slowly Changing Dimensions (SCD) Types

Slowly Changing Dimensions in Data Warehouse is an important concept that is used to enable the historic aspect of data in an analytical system. As you know, the data warehouse is used to analyze historical data, it is essential to store the different states of data. In data warehousing, we have fact and dimension tables to store the data. Dimensional tables are used to analyze the measures in the fact tables. In a data environment, data is initiated at operational databases and data will be extracted-transformed-loaded (ETL) to the data warehouse to suit the analytical environment. Customer, Product are examples… Read more

Types of Dimensions in Data warehouse

What is Dimension? Dimension table contains the data about the business. The primary keys of the dimension tables are used in Fact tables with Foreign key relationship. And the remaining columns in the dimension is normal data which is the information about the Objects related to the business.Eg: Product,Customer,Orders,Company,Date etc. Slowly changing dimensions refer to how data in your data warehouse changes over time. Slowly changing dimensions have the same natural key but other data columns that may or may not change over time depending on the type of dimensions that it is.  Slowly changing dimensions are important in data… Read more

Star Schema vs Snowflake Schema: differences

What is a star schema? A star schema is a database schema used to store data in a star format. This schema consists of a central table, called the “fact table,” and a number of directly connected other tables, called “dimension tables.” The fact table contains information about metrics or measures, while the dimension tables contain information about descriptive attributes. The star schema is very simple and easy to understand, making it ideal for cloud data warehousing and business intelligence applications. What is a snowflake schema? A snowflake schema is a type of database schema that is used to store data in a more… Read more

Difference between PySpark and Python

PySpark is the Python API that is used for Spark. Basically, it is a collection of Apache Spark, written in Scala programming language and Python programming to deal with data. Spark is a big data computational engine, whereas Python is a programming language. To work with PySpark, one needs to have basic knowledge of Python and Spark. The market trends of PySpark and Python are expected to increase in the next few years. Both terms have their own features, limitations, and differences. So, let’s check what aspects they differ. PySpark PySpark is a python-based API used for the Spark implementation and… Read more

ACID Properties

In the context of database transaction processing, the acronym ACID refers to the four key properties of a transaction: • Atomicity • Consistency • Isolation • Durability Atomicity : All changes to data are performed as if they are a single operation. That is, all the changes are performed, or none of them are.For example, in an application that transfers funds from one account to another, the atomicity property ensures that, if a debit is made successfully from one account, the corresponding credit is made to the other account. Consistency :  data is in a consistent state when a transaction starts and when… Read more

SSIS – Non Blocking , Partially Blocking and Full Blocking

Data flow transformations in SSIS use memory/buffers in different ways.  The way transformation uses memory can impact the performance of your package.  Transformations memory/Buffer usage are classified into 3 categories: 1.Non Blocking  2.Semi Blocking  3. Full Blocking All the dataflow components can be categorized to be either Synchronous or Asynchronous. Synchronous vs Asynchronous :  Synchronous components The output of an synchronous component uses the same buffer as the input.  Reusing of the input buffer is possible because the output of an synchronous component always contain exactly the same number of records as the input.            … Read more

Join SQLChampion

Get practical SQL Server, Azure, Power BI, Data Engineering, and AI insights delivered directly to your inbox.

No spam. Unsubscribe anytime.