共有55个标签
论文 (31)
- 2023/05/24 SQL Server Column Store Indexes
- 2023/05/21 Advanced Database Systems: History of Databases
- 2023/05/07 Column-Stores vs. Row-Stores: How Different Are They Really?
- 2023/05/06 Lakehouse A New Generation of Open Platforms that Unify Data Warehousing and Advanced Analytics
- 2023/05/04 Building An Elastic Query Engine on Disaggregated Storage
- 2023/05/03 What Goes Around Comes Around
- 2023/02/05 Efficiently Compiling Efficient Query Plans for Modern Hardware
- 2023/02/04 Generating code for holistic query evaluation
- 2023/02/01 Implementing Database Operations Using SIMD Instructions
- 2023/01/19 SIMD-Scan: Ultra Fast in-Memory Table Scan using onChip Vector Processing Units
- 2023/01/13 Accelerating Analytics with Dynamic In-Memory Expressions
- 2023/01/07 Materialization Strategies in the Vertica Analytic Database: Lessons Learned
- 2023/01/05 MonetDB/X100: Hyper-Pipelining Query Execution
- 2023/01/03 Access Path Selection in Main-Memory Optimized Data Systems Should I Scan or Should I Probe
- 2023/01/02 Photon A Fast Query Engine for Lakehouse Systems
- 2022/11/18 关于云环境中多租户问题的论文
- 2022/11/17 How to Read a Paper
- 2022/09/03 Architecture of a Database System论文翻译
- 2022/08/11 确定要在DBMS中用mmap吗
- 2022/07/03 What's Really New with NewSQL论文
- 2022/07/02 Snowflake论文
- 2022/07/01 Spark SQL论文
- 2022/06/24 Delta Lake论文
- 2022/06/16 Chubby论文
- 2022/05/16 Raft论文
- 2022/05/01 Paxos Made Simple论文
- 2022/04/20 BigTable论文
- 2022/04/20 MapReduce论文
- 2022/04/13 GFS论文
- 2022/03/20 Hive论文
- 2021/11/06 Spark论文
工作记录 (17)
- 2024/10/24 个人工作简介
- 2023/11/09 TPCx-HS优化总结
- 2023/05/14 资源隔离修改配置动态加载
- 2023/05/13 资源隔离设计
- 2023/05/12 镜像合并&配置文件同步
- 2022/09/18 高可用设计
- 2022/08/01 Kyuubi设计调研
- 2022/07/30 容灾部署调研
- 2022/06/05 自定义HiveServer设计
- 2022/01/25 几种开源数据库对元数据的管理
- 2021/12/30 数据迁移工具DB-bridge
- 2021/12/29 HANA调研
- 2021/12/28 Teradata调研
- 2021/12/27 统一查询项目整合Calcite
- 2021/12/26 统一查询项目介绍
- 2021/12/26 QuickSQL执行过程
- 2021/11/26 HANA和TeraData数据库迁移调研
cmu-database (7)
- 2023/05/21 Advanced Database Systems: History of Databases
- 2023/01/22 Advanced Database Systems: Query Execution & Processing
- 2022/07/22 卡内基梅隆的数据库课程-1
- 2022/07/04 卡内基梅隆的数据库课程-2
- 2022/07/04 卡内基梅隆的数据库课程-3
- 2022/07/04 卡内基梅隆的数据库课程-4
- 2022/07/04 卡内基梅隆的数据库课程-5
leveldb (7)
- 2023/04/14 LevelDB 多版本和压缩
- 2023/04/11 LevelDB 辅助工具类
- 2023/04/09 LevelDB SSTable模块
- 2023/04/06 LevelDB MemTable模块
- 2023/04/03 LevelDB Log模块
- 2023/03/30 LevelDB 公开的接口
- 2023/03/27 LevelDB 基本概念
spark (7)
- 2024/09/09 Spark-Streaming 原理
- 2024/03/02 Spark原理-解析过程和Catalog
- 2023/01/02 Photon A Fast Query Engine for Lakehouse Systems
- 2022/07/01 Spark SQL论文
- 2022/03/06 Spark逻辑计划的解析
- 2022/03/04 Spark的注入规则
- 2021/11/06 Spark论文
mysql (6)
- 2023/01/01 MySQL的并发
- 2022/12/22 MySQL的恢复
- 2022/12/19 MySQL的缓存
- 2022/12/15 MySQL查询分析
- 2022/12/13 用工具分析MySQL存储文件
- 2022/12/02 MySQL文件存储结构
数据库 (6)
- 2022/11/20 调试MySQL
- 2022/09/03 Architecture of a Database System论文翻译
- 2022/08/11 确定要在DBMS中用mmap吗
- 2022/08/08 为何Uber要将PostgreSQL迁到MySQL
- 2022/07/02 PingCAP提供的数据库学习资料
- 2021/11/26 TPC-DS
查询优化 (4)
- 2023/01/13 Accelerating Analytics with Dynamic In-Memory Expressions
- 2023/01/07 Materialization Strategies in the Vertica Analytic Database: Lessons Learned
- 2023/01/05 MonetDB/X100: Hyper-Pipelining Query Execution
- 2023/01/03 Access Path Selection in Main-Memory Optimized Data Systems Should I Scan or Should I Probe
openlogreplicator (3)
- 2024/08/25 OpenLogReplicator的一些改动
- 2024/07/15 Oracle的CDC工具OpenLogReplicator原理
- 2024/06/15 Oracle的CDC工具OpenLogReplicator编译
calcite (2)
- 2021/12/27 统一查询项目整合Calcite
- 2021/12/26 统一查询项目介绍
hive (2)
- 2024/01/14 Hive MetaStore的实现和优化
- 2022/03/20 Hive论文
iceberg (2)
- 2024/02/29 Compaction in Apache Iceberg
- 2024/02/26 The Life of a Read/Write Query for Apache Iceberg Tables
k8s (2)
- 2024/11/04 k8s 网络
- 2022/09/18 k8s POD使用总结
lakehouse (2)
- 2024/01/10 Analyzing and Comparing Lakehouse Storage Systems
- 2023/05/06 Lakehouse A New Generation of Open Platforms that Unify Data Warehousing and Advanced Analytics
paxos (2)
- 2022/06/16 Chubby论文
- 2022/05/14 Basic Paxos总结
raft (2)
- 2022/06/05 Raft A Consensus Algorithm for Replicated Logs记录
- 2022/05/16 Raft论文
scala (2)
- 2024/10/15 scala总结
- 2022/02/24 scala的一些特性
simd (2)
- 2023/02/01 Implementing Database Operations Using SIMD Instructions
- 2023/01/19 SIMD-Scan: Ultra Fast in-Memory Table Scan using onChip Vector Processing Units
snowflake (2)
- 2023/05/04 Building An Elastic Query Engine on Disaggregated Storage
- 2022/07/02 Snowflake论文
向量化 (2)
- 2023/02/01 Implementing Database Operations Using SIMD Instructions
- 2023/01/19 SIMD-Scan: Ultra Fast in-Memory Table Scan using onChip Vector Processing Units
查询编译 (2)
- 2023/02/05 Efficiently Compiling Efficient Query Plans for Modern Hardware
- 2023/02/04 Generating code for holistic query evaluation
计算框架 (2)
- 2022/04/20 MapReduce论文
- 2021/11/06 Spark论文
读书笔记 (2)
- 2021/12/19 分布式数据库课程中的论文
- 2021/12/02 数据密集型应用设计读书笔记
b树 (1)
- 2022/10/15 B+树执行过程
data_ingestion (1)
- 2024/01/15 Data Ingestion: Architectural Patterns
deltalake (1)
- 2022/06/24 Delta Lake论文
facebook (1)
- 2024/02/02 Data engineering at Meta
hana (1)
- 2021/12/29 HANA调研
janino (1)
- 2024/05/05 Janino简单使用
kudu (1)
- 2022/02/13 Kudu的模型设计
kyuubi (1)
- 2022/08/01 Kyuubi设计调研
llvm (1)
mapreduce (1)
- 2022/03/26 MapRedue是一个巨大的退步
newsql (1)
- 2022/07/03 What's Really New with NewSQL论文
oceanbase (1)
- 2023/03/26 OceanBase开发者大会分享
parquet (1)
- 2024/10/02 Parquet for Spark
presto (1)
- 2023/07/12 Presto在各大公司的应用
quick-sql (1)
- 2021/12/26 QuickSQL执行过程
teradata (1)
- 2021/12/28 Teradata调研
tpcx-hs (1)
- 2023/11/09 TPCx-HS优化总结
分布式 (1)
- 2022/04/13 GFS论文
列存 (1)
动态注入 (1)
- 2022/05/02 Java的APM工具原理
多租户 (1)
- 2022/11/18 关于云环境中多租户问题的论文
大数据 (1)
- 2023/11/19 The History of Big data
存储 (1)
- 2022/04/13 GFS论文
微服务 (1)
- 2021/09/27 微服务的设计的IDEALS
数据模型 (1)
- 2023/05/03 What Goes Around Comes Around
数据迁移 (1)
- 2021/12/30 数据迁移工具DB-bridge
汇编 (1)
- 2021/10/10 从汇编角度看程序的执行
测试 (1)
- 2021/11/26 TPC-DS
索引 (1)
- 2023/05/24 SQL Server Column Store Indexes
编译原理 (1)
- 2022/02/26 用javacc实现四则运算
网络 (1)
- 2022/10/16 容器网络
随便写写 (1)
- 2021/09/22 第一篇文章