Big Data – Page 2 – NEO

Category: Big Data

neo_aksa February 13, 2021 Optimize concurrency for merge operation in delta table

Big Data

Concurrency control is normal in OLTP operations, but for OLAP, not really. So I didn’t take care of it until […]

neo_aksa January 25, 2021 Some features need to be improved in Azure Data Products

Big Data • Computer Science

Azure Storage Explorer/Data Lake Ghost file In some rare case, if you delete files in ASE, then you call APIs […]

neo_aksa October 9, 2020 Spark 3.0 new features – Learning from Dr.Kazuaki Ishizaki

Big Data • Computer Science

Dr.Kazuaki Ishizaki gives a great summary of spark 3.0 features in his presentation “SQL Performance Improvements at a Glance in […]

neo_aksa July 30, 2020 Columnstore index for MS SQL SERVER

Big Data • Computer Science

Columnstore is the most popular storage tech within big data. We must have already heard parquet, delta lake. They are […]

neo_aksa March 16, 2020 Data Factory CI/CD in Azure DevOps

Big Data • Computer Science • ETL&DW

Azure Pipeline is consist of two parts: pipeline and release. They represent CI and CD separately. Build Pipeline – to […]

neo_aksa February 18, 2020 Apache Kafka in Practice – 1

Big Data

First thing first, I should remind all visitors I am not a master in Kafka. Actually I am just a […]

neo_aksa January 21, 2020 Apache Kafka Concepts and Theory.

Big Data • Computer Science

It’s a little bit late to talk about Kafka, since this technology has been widely used for a long time. […]

neo_aksa December 17, 2019 Spy into metadata-driven ELT on Datafactory and Databricks

Big Data • Computer Science

Azure provides datafactory and azure databricks for handling with ELT pipeline on a scalable environment. Datafactory provides more integrated solution […]

neo_aksa September 23, 2019 Our new cloud architecture launched!

Big Data

After so many discussion, evaluation and testing, we finally launched a basic architecture for Azure cloud. I hid some key […]

neo_aksa September 20, 2019 Set up MySQL on Azure Ubuntu and compare with Azure SQL

Big Data • Computer Science

I will combine three parts: Create Ubuntu VM & attach data disk, Install and configure MySQL, Performance comparison with Azure […]