In this PostgreSQL post explains how to use the INTERSECT operator with syntax and examples.
The PostgreSQL INTERSECT operator returns the intersection of 2 or more datasets. Each dataset is defined by a SELECT statement. If a record exists in both data sets, it will be included in the INTERSECT results. However, if a record exists in one data set and not in the other, it will be omitted from the INTERSECT results.
Explanation: The INTERSECT query will return the records in the blue shaded area. These are the records that exist in both Dataset1 and Dataset2.
Each SELECT statement within the INTERSECT must have the same number of fields in the result sets with similar data types.
The syntax for the INTERSECT operator in PostgreSQL is:
The columns or calculations that you wish to retrieve.
The tables that you wish to retrieve records from. There must be at least one table listed in the FROM clause.
Optional. These are conditions that must be met for the records to be selected.
The following is an INTERSECT operator example that has one field with the same data type:
In this INTERSECT example, if a category_id appeared in both the products and inventory table, it would appear in your result set.
Now, let's complicate our example further by adding WHERE conditions to the INTERSECT query.
In this example, the WHERE clauses have been added to each of the datasets. The first dataset has been filtered so that only records from the products table where the category_id is less than 800 are returned. The second dataset has been filtered so that only records from the inventory table are returned where the quantity is greater than 5.
Next, let's look at an example of how to use the INTERSECT operator in PostgreSQL to return more than one column.
For example:
In this INTERSECT example, the query will return the records from the contacts table where the contact_id, last_name, and first_name values match the customer_id, last_name, and first_name value from the customers table.
There are WHERE conditions on each data set to further filter the results so that only records from the contacts are returned where the last_name is not Smith. The records from the customers table are returned where the customer_id is less than 100.
The following is an INTERSECT example that uses a ORDER BY clause:
Since the column names are different between the two SELECT statements, it is more advantageous to reference the columns in the ORDER BY clause by their position in the result set. In this example, we've sorted the results by contact_id / company_id in ascending order, as denoted by the ORDER BY 1.
The contact_id / company_id fields are in position #1 in the result set.