Add minimal documentation for hypothetical partitioning.

2026-05-24 09:38:21 +00:00 · 2018-12-03 15:23:31 +01:00 · 2018-12-03 15:23:31 +01:00 · 54665eeb1b
commit 54665eeb1b
parent 12bc46965e
4 changed files with 136 additions and 13 deletions
--- a/docs/hypothetical_indexes.rst
+++ b/docs/hypothetical_indexes.rst
@ -1,10 +0,0 @@
-.. _hypothetical_indexes:
-
-Hypothetical Indexes
-====================
-
-An hypothetical, or virtual, index is an index that doesn't really exists, and
-thus doesn't cost CPU, disk or any resource to create.  They're useful to know
-if specific indexes can increase performance for problematic queries, since
-you can know if PostgreSQL will use these indexes or not without having to
-spend resources to create them.
--- a/docs/hypothetical_objects.rst
+++ b/docs/hypothetical_objects.rst
@ -0,0 +1,28 @@
+.. _hypothetical_objects:
+
+Hypothetical objects
+====================
+
+HypoPG support two kinds of hypothetical objects: hypothetical indexes and
+hypothetical partitioning.
+
+.. _hypothetical_indexes:
+
+Hypothetical indexes
+--------------------
+A hypothetical, or virtual, index is an index that doesn't really exists, and
+thus doesn't cost CPU, disk or any resource to create.  They're useful to know
+if specific indexes can increase performance for problematic queries, since
+you can know if PostgreSQL will use these indexes or not without having to
+spend resources to create them.
+
+.. _hypothetical_partitioning:
+
+Hypothetical partitioning
+-------------------------
+Hypothetical partitioning, available for PostgreSQL servers version 10 and
+above, is a real table on which you can hypothetically apply partitining
+scheme as you would do for declarative partitioning.  PostgreSQL will act as if
+this table was really partitioned, so you can know quickly test multiple
+partitioning scheme, and you can see how each of them will change your queries
+behavior and check which one is the best for your specific appliation.
--- a/docs/index.rst
+++ b/docs/index.rst
@ -5,8 +5,10 @@ HypoPG

 `HypoPG <https://github.com/HypoPG/hypopg>`_ is a `PostgreSQL
 <https://postgresql.org>`_ extension, adding support for
-:ref:`hypothetical_indexes`.
-It's compatible with **PostgreSQL 9.2 and above**.
+:ref:`hypothetical_objects`: :ref:`hypothetical_indexes` and
+:ref:`hypothetical_partitioning` (only for pg10 and above).
+
+This extension is compatible with **PostgreSQL 9.2 and above**.

 .. note::

@ -20,7 +22,7 @@ It's compatible with **PostgreSQL 9.2 and above**.
   :maxdepth: 1
   :caption: Contents:

-   hypothetical_indexes
+   hypothetical_objects
   installation
   usage
   contributing
--- a/docs/usage.rst
+++ b/docs/usage.rst
@ -169,3 +169,106 @@ Some other convenience functions are available:

 - **hypopg_drop_index(oid)**: remove the given hypothetical index
 - **hypopg_reset()**: remove all hypothetical indexes
+
+Hypothetical partitioning
+-------------------------
+
+.. note::
+
+   This is only possible for PostgreSQL 10 an above.  The partitioning
+   possibilites depend on the PostgreSQL version.  For instance, you can't
+   create a hypothetical hash partition on using PostgreSQL 10.
+
+
+For clarity, let's see how it works with a very simple test case:
+
+.. code-block:: psql
+
+  CREATE TABLE hypo_part_range (id integer, val text);
+  INSERT INTO hypo_part_range SELECT i, 'line ' || i FROM generate_series(1, 29999) i;
+
+This is a simple table, containing some rows and without indexes.  Trying to
+retrieve a row will do as expected:
+
+.. code-block:: psql
+
+  EXPLAIN SELECT * FROM hypo_part_range WHERE id = 2;
+                              QUERY PLAN                            
+  ------------------------------------------------------------------
+   Seq Scan on hypo_part_range  (cost=0.00..537.99 rows=1 width=14)
+     Filter: (id = 2)
+  (2 rows)
+
+Now, let's try to hypothetically partition this table with a range partitioning
+scheme.  For that, we have two functions:
+
+- **hypopg_partition_table**: it has two mandatory arguments.  The first
+  argument is the table to be hypothetically partitioned, and the second is the
+  `PARTITION BY` clause, as you would use for declarative partitioning
+- **hypopg_add_partition**: it has two mandatory arguments, and one optional.
+  The first mandatory argument is the partitiong name, the second is the
+  `PARTITION OF` clause, and the optional argument is a `PARTITION BY` clause,
+  if you want to declare multiple level of partitioning.
+
+For instance:
+
+.. code-block:: psql
+
+  SELECT hypopg_partition_table('hypo_part_range', 'PARTITION BY RANGE(id)');
+  SELECT tablename FROM hypopg_add_partition('hypo_part_range_1_10000', 'PARTITION OF hypo_part_range FOR VALUES FROM (1) TO (10000)');
+  SELECT tablename FROM hypopg_add_partition('hypo_part_range_10000_20000', 'PARTITION OF hypo_part_range FOR VALUES FROM (10000) TO (20000)');
+  SELECT tablename FROM hypopg_add_partition('hypo_part_range_20000_30000', 'PARTITION OF hypo_part_range FOR VALUES FROM (20000) TO (30000)');
+
+.. note::
+
+  If you need to declare bounds on a textual column, the dollar-quoting
+  notation will be helpful.  For instance:
+
+  .. code-block:: psql
+
+    SELECT hypopg_add_partition('p_name', $$PARTITION OF tbl FOR VALUES FROM 'aaa' TO 'aab'$$);
+
+Now, let's see what happens if we try to retrieve a row of the hypothetically
+partitioned table:
+
+.. code-block:: psql
+
+  EXPLAIN SELECT * FROM hypo_part_range WHERE id = 2;
+                                             QUERY PLAN                                           
+  ------------------------------------------------------------------------------------------------
+   Append  (cost=0.00..179.95 rows=1 width=14)
+     ->  Seq Scan on hypo_part_range hypo_part_range_1_10000  (cost=0.00..179.95 rows=1 width=14)
+
+We can see that since there's an Append node, PostgreSQL acted as if the table
+was partitioned, and that all but one partition was pruned.
+
+It's also possible to create a hypothetical index on the hypothetical
+partitions:
+
+
+.. code-block:: psql
+
+  SELECT hypopg_create_index('CREATE INDEX on hypo_part_range_1_10000 (id)');
+                                                                    QUERY PLAN                                                                   
+  -----------------------------------------------------------------------------------------------------------------------------------------------
+   Append  (cost=0.04..8.06 rows=1 width=14)
+     ->  Index Scan using <258199>btree_hypo_part_range_1_10000_id on hypo_part_range hypo_part_range_1_10000  (cost=0.04..8.05 rows=1 width=14)
+           Index Cond: (id = 2)
+  (3 rows)
+
+Manipulate hypothetical partitions
+----------------------------------
+
+Some other convenience functions are available:
+
+- **hypopg_table()**: list all hypothetical partitions that have been created
+- **hypopg_analyze(regclass, fraction)**: perform an operation similar to
+  ANALYZE on a hypothetically partitioned table, to get better estimates
+- **hypopg_statistic(): returns the list of statistics gathered by
+  previous runs of **hypopg_analyze**, in the same format as `pg_statistic`.
+  For an easier reading, the view **hypopg_stats** exists, which returns the
+  data in the same format as `pg_stats`
+- **hypopg_drop_table(oid)**: delete a previously created partition, or unpartition
+  a hypothetically partitioned table (including the stored statistics if any)
+- **hypopg_reset_table()**: remove all previously created hypothetical partition
+  (inclufing the stored statistics if any)