Skip to main content

MONGODB INDEXES FOR BEGINNERS

What is Indexing?

What indexing does is sort our mongoDB collection in a particular order based on the value of one field(or more than one field). Assume that I have a collection called customers and I have kept customerName as the field for indexing, then what MongoDB does is that it’ll create a list of all the names in an alphabetical order in the document.
The list will be just the names and each item in the list will contain a pointer to the real document in the collection. What this does is the next time you run a find query with the customerName as the filter parameter, Mongo will directly look into that list and easily and quickly find your required document(s).

Pros and cons of indexing

The biggest advantage of indexing is that it speeds up your findupdate and delete queries. Quite naturally because it is easier to search for the elements based on the indexed field.
The disadvantages of indexing is that 1. It takes up memory (obviously). 2.It slows down write queries.
The write queries will obviously be slowed down because every time you make a write query you need to update the indexed field list in the collection as well and sort it again based on that field.

Creating Indexes in Mongo

Let us see how can you create indexes in Mongo.
First of all there are three different kinds of indexes that you should probably know of :-
1.Single index -> Where you sort a collection based on just a single field value.
2.Compound index -> Where you sort a collection first based on a single value and then for the values that have the same first value, you sort the list according to a second field value that you have provided.
3.Partial index -> When you sort the collection based on a field value but only in a particular range (we’ll see how later).
Single Index
1
2
3
4
db.collection.createIndex({fieldName: 1});
 
#example
db.customers.createIndex({customerName: 1})
The 1 represents ascending order of list sorting.
Compound Index
1
db.customers.createIndex({customerName: 1, age: 1});
What this does is create an index based on the customer name first and then the age (if two or more customer names are the same). Note that for compound indexes, you can use them to index for the leftmost field or all the fields moving left to right. They will speed up your queries for both. But this indexing will not work if you think of querying only over age.
Partial Index
Assume we run a particular query more often than not. For example, we often search for customers with age less than 19 and we only want to create an index on the age field for documents where is less than 19, what this will do is not slow down our insertions for documents where age is greater than 19.
1
db.customers.createIndex({age: 1, {partialExpression: {age: {$lt: 19}}}});

I hope you got the basics of indexing and when to use indexes and which fields to use indexes on -> basically create indexes for the field you run the most queries on. Note that it is a bad idea to create an index for each field as it slows down your insertions by a lot and takes up a lot of memory.

Comments

Popular posts from this blog

4 Ways to Communicate Across Browser Tabs in Realtime

1. Local Storage Events You might have already used LocalStorage, which is accessible across Tabs within the same application origin. But do you know that it also supports events? You can use this feature to communicate across Browser Tabs, where other Tabs will receive the event once the storage is updated. For example, let’s say in one Tab, we execute the following JavaScript code. window.localStorage.setItem("loggedIn", "true"); The other Tabs which listen to the event will receive it, as shown below. window.addEventListener('storage', (event) => { if (event.storageArea != localStorage) return; if (event.key === 'loggedIn') { // Do something with event.newValue } }); 2. Broadcast Channel API The Broadcast Channel API allows communication between Tabs, Windows, Frames, Iframes, and  Web Workers . One Tab can create and post to a channel as follows. const channel = new BroadcastChannel('app-data'); channel.postMessage(data); And oth...

Certbot SSL configuration in ubuntu

  Introduction Let’s Encrypt is a Certificate Authority (CA) that provides an easy way to obtain and install free  TLS/SSL certificates , thereby enabling encrypted HTTPS on web servers. It simplifies the process by providing a software client, Certbot, that attempts to automate most (if not all) of the required steps. Currently, the entire process of obtaining and installing a certificate is fully automated on both Apache and Nginx. In this tutorial, you will use Certbot to obtain a free SSL certificate for Apache on Ubuntu 18.04 and set up your certificate to renew automatically. This tutorial will use a separate Apache virtual host file instead of the default configuration file.  We recommend  creating new Apache virtual host files for each domain because it helps to avoid common mistakes and maintains the default files as a fallback configuration. Prerequisites To follow this tutorial, you will need: One Ubuntu 18.04 server set up by following this  initial ...

Working with Node.js streams

  Introduction Streams are one of the major features that most Node.js applications rely on, especially when handling HTTP requests, reading/writing files, and making socket communications. Streams are very predictable since we can always expect data, error, and end events when using streams. This article will teach Node developers how to use streams to efficiently handle large amounts of data. This is a typical real-world challenge faced by Node developers when they have to deal with a large data source, and it may not be feasible to process this data all at once. This article will cover the following topics: Types of streams When to adopt Node.js streams Batching Composing streams in Node.js Transforming data with transform streams Piping streams Error handling Node.js streams Types of streams The following are four main types of streams in Node.js: Readable streams: The readable stream is responsible for reading data from a source file Writable streams: The writable stream is re...