Big Data • Computer Science • ETL&DW
Since I started to play with cluster, I thought there was no mission which was not able to be completed […]
Big Data • Computer Science • ETL&DW
Since I started to play with cluster, I thought there was no mission which was not able to be completed […]
Big Data • Computer Science • ETL&DW
SCD II is widely used to process dimensional data with all historical information. Each change in dimensions will be recorded […]
1990, engineers were fighting for optimizing code performance and increasing CPU speeds. 1994, MPI started to be the dominant model […]
Concurrency control is normal in OLTP operations, but for OLAP, not really. So I didn’t take care of it until […]
Azure Storage Explorer/Data Lake Ghost file In some rare case, if you delete files in ASE, then you call APIs […]
Computer Science • Linux • Others
Since I got a RP4 8GB from microcenter with a fan case, I tried to use it to replace pc […]
Since Microsoft moves from windows to cloud in the last 10 years, he is more welcome to opensource especially Linux. […]
Dr.Kazuaki Ishizaki gives a great summary of spark 3.0 features in his presentation “SQL Performance Improvements at a Glance in […]
Columnstore is the most popular storage tech within big data. We must have already heard parquet, delta lake. They are […]
Computer Science • HPC • Linux • Machine Learning
Since Microsoft upgraded WSL to version 2, it introduced full Linux kernel and full VM manage features. Except the performance […]