DDIA/tools

tools

dz / DDIA / tools

Subgraphs

Node Tree

Nodes

SQL
content SQL
parents DDIA/glossary/query_language, DDIA/toc/1_foundations_of_data_systems/02_data_models_query_languages, DDIA/tools
location knowledge/ddia.dz:357

monogdb_aggregration_pipeline
content MongoDB aggregation pipeline
parents DDIA/glossary/query_language, DDIA/toc/1_foundations_of_data_systems/02_data_models_query_languages, DDIA/tools
location knowledge/ddia.dz:363

cypher
content Cypher
parents DDIA/glossary/query_language, DDIA/toc/1_foundations_of_data_systems/02_data_models_query_languages, DDIA/tools
location knowledge/ddia.dz:369

SPARQL
content SPARQL
parents DDIA/glossary/query_language, DDIA/toc/1_foundations_of_data_systems/02_data_models_query_languages, DDIA/tools
location knowledge/ddia.dz:375

datalog
content Datalog
parents DDIA/glossary/query_language, DDIA/toc/1_foundations_of_data_systems/02_data_models_query_languages, DDIA/tools
location knowledge/ddia.dz:381

CSS
content CSS
parents DDIA/glossary/query_language, DDIA/toc/1_foundations_of_data_systems/02_data_models_query_languages, DDIA/tools
location knowledge/ddia.dz:388

XSL_XPath
content XSL/XPath
parents DDIA/glossary/query_language, DDIA/toc/1_foundations_of_data_systems/02_data_models_query_languages, DDIA/tools
location knowledge/ddia.dz:395

bitcask
content Bitcask
parents DDIA/toc/1_foundations_of_data_systems/03_storage_and_retrieval, DDIA/glossary/log_structured, DDIA/tools
location knowledge/ddia.dz:453

levelDB
content LevelDB
parents DDIA/toc/1_foundations_of_data_systems/03_storage_and_retrieval, DDIA/glossary/log_structured, DDIA/tools
location knowledge/ddia.dz:471

cassandra
content Cassandra
children DDIA/glossary/vnode, system_design_interview/tools/cassandra
parents DDIA/toc/1_foundations_of_data_systems/03_storage_and_retrieval, DDIA/glossary/log_structured, DDIA/tools
location knowledge/ddia.dz:477

hbase
content HBase
children DDIA/glossary/region_hbase (A region is a partition in hbase)
parents DDIA/toc/1_foundations_of_data_systems/03_storage_and_retrieval, DDIA/glossary/log_structured, DDIA/tools
location knowledge/ddia.dz:483

lucene
content lucene
parents DDIA/toc/1_foundations_of_data_systems/03_storage_and_retrieval, DDIA/glossary/log_structured, DDIA/tools
location knowledge/ddia.dz:489

JSON
content JSON
parents DDIA/toc/1_foundations_of_data_systems/04_encoding_evolution, DDIA/tools
location knowledge/ddia.dz:529

XML
content XML
parents DDIA/toc/1_foundations_of_data_systems/04_encoding_evolution, DDIA/tools
location knowledge/ddia.dz:534

CSV
content CSV
parents DDIA/toc/1_foundations_of_data_systems/04_encoding_evolution, DDIA/tools
location knowledge/ddia.dz:539

thrift
content thrift
parents DDIA/toc/1_foundations_of_data_systems/04_encoding_evolution, DDIA/glossary/binary_schema, DDIA/tools
location knowledge/ddia.dz:551

protocol_buffers
content protocol_buffers
parents DDIA/toc/1_foundations_of_data_systems/04_encoding_evolution, DDIA/glossary/binary_schema, DDIA/tools
location knowledge/ddia.dz:557

avro
content avro
parents DDIA/toc/1_foundations_of_data_systems/04_encoding_evolution, DDIA/glossary/binary_schema, DDIA/tools
location knowledge/ddia.dz:563

bigtable
content Bigtable is a fully managed wide-column and key-value NoSQL database service for large analytical and operational workloads as part of the Google Cloud portfolio.
children DDIA/glossary/tablet_bigtable, DDIA/glossary/wide_column_store (Bigtable is a prototypical example of a wide-column,store.)
parents DDIA/tools
hyperlink https://en.wikipedia.org/wiki/Bigtable
location knowledge/ddia.dz:708

riak
content Riak is a distributed NoSQL key-value data store that offers high availability, fault tolerance, simplicity, and scalability. It follows Amazon's Dynamo principles and has Erlang's fault-tolerant data replication and automatic distribution for performance and resilience. Riak moved to an open-source project in August 2017, with many Enterprise Edition features incorporated.
children DDIA/glossary/vnode
parents DDIA/tools
hyperlink https://en.wikipedia.org/wiki/Riak
location knowledge/ddia.dz:728

couchbase
content Couchbase Server is a source-available, distributed NoSQL database software that optimizes interactive applications for multiple concurrent users. It provides easy scalability, low latency, and high throughput for key-value access, JSON document manipulation, and presentation. Designed to scale from a single machine to large-scale deployments, Couchbase Server supports clustering.
children DDIA/glossary/vbucket
parents DDIA/tools
hyperlink https://en.wikipedia.org/wiki/Couchbase_Server
location knowledge/ddia.dz:746

sqlite
content SQLite is an open-source database engine written in C that is not a standalone app, but rather a library embedded in software. It is widely used in various embedded systems and web browsers, mobile phones, and other devices. The SQLite library follows PostgreSQL syntax and does not enforce type checking by default, allowing for flexible data insertion.
children sqlite/create_index
hyperlink https://en.wikipedia.org/wiki/SQLite
location knowledge/ddia.dz:833

XA_JTA_transaction
content Jakarta Transactions, formerly, Java Transactions API, enables distributed transactions to be done across multiple X/Open XA resources in a Java environment
children DDIA/toc/2_distributed_data/09_consistency_consensus/if_leader_fails/wait_for_recovery (Many XA/JTA coordinators choose this option)
parents DDIA/tools
hyperlink https://en.wikipedia.org/wiki/Jakarta_Transactions
location knowledge/ddia.dz:1088

ZooKeeper
content Apache ZooKeeper is an open-source server for coordinating distributed systems. It provides a hierarchical key-value store for distributed configuration, synchronization, and naming registry services. Originally part of the Hadoop project but now a top-level Apache project.
children DDIA/toc/2_distributed_data/09_consistency_consensus (Zookeeper plays an important part in providing,an "outsourced" consesus, failure detection,,and membership service that applicatoins can use.)
parents DDIA/tools
hyperlink https://en.wikipedia.org/wiki/Apache_ZooKeeper
location knowledge/ddia.dz:1095

MapReduce
content MapReduce is a programming model for processing and generating big data sets, combining a map procedure that filters and sorts data with a reduce method that performs summaries, orchestrated by a framework for parallel and distributed processing on a cluster. It is inspired by functional programming concepts, but adapted for big data processing scalability and fault tolerance. MapReduce can be implemented in multiple programming languages with varying optimizations and is part of open-source systems like Apache Hadoop.
children DDIA/toc/3_derived_data/10_batch_processing, DDIA/toc/3_derived_data/10_batch_processing/unix tools (the design philosophy of unix tools carried over,into MapReduce), DDIA/toc/3_derived_data/mapreduce_join_algos
parents DDIA/glossary/query_language, DDIA/toc/1_foundations_of_data_systems/02_data_models_query_languages, DDIA/tools
hyperlink https://en.wikipedia.org/wiki/MapReduce
location knowledge/ddia.dz:1164