PyMOL plugin

This plugin provides tools for identification, classification and analysis of conserved water molecules and their networks from molecular dynamics simulation trajectories.

The main interface consists of the following sections:

Input Options

Input trajectories MUST be aligned and centered before analysis. The plugin will not perform any alignment or centering. If the system is centered on the protein, it usually works well. However, in instances where the area of interest is close to the edge of the simulation box, system should be centered on the position of the center of mass of the area of interest.

Selection Input Type: Choose the type of input you want to provide. Options include:

A PyMOL selection is first exported to a PDB file and water molecules are extracted via the water selection panel with MDAnalysis.

PyMOL Selection: Enter the name of the PyMOL object or a selection string which contains waters (can contain other elements as well) in your PyMOL window. It is recommended that the selection includes both the system and the water molecules of interest.

Test Input: Click this button to test the provided input.

Water Selection

This section provides guidance on selecting water molecules for analysis. Proper water selection ensures that the analysis focuses on relevant waters in your structure. Users should take great care when choosing the waters of interest. Making an appropriate choice is crucial, as selecting a large number of water molecules can significantly slow down the analysis process. Typically, the most relevant water molecules are those situated near the binding site, proteins, or other surfaces. While you can analyze bulk waters, they tend to diffuse freely. If results are produced from these waters, especially those classified as WCW type, exercise caution when interpreting the data.

  • Water Selection Center: Choose the method for selecting the center of water molecules selection. Options include:

Provide a MDAnalysis selection string. The geometric mean of the selected atoms will be used as the center. This selection is handled by MDAnalysis whose selection language is similar in most instances but not identical to PyMOL. For more information see MDanalysis.

Users can provide residue name assigned to water molecules, as well as the atom names for oxygen and hydrogen atoms in water molecules. Alternatively, the plugin offers an automatic option, which attempts to identify water residue names and atom names using conventions from widely-used MD engines and tools.

Solvent Residue Name: Specify the name of the solvent residue or opt for automatic detection.

Water Oxygen Atom Name: Specify the name of the water oxygen or opt for automatic detection.

Water Hydrogen Atom Name: Specify the name of the water hydrogen or opt for automatic detection.

Distance: Specify the distance from the center for water selection inside which waters shall be selected for analysis.

Test Selection: Click this button to test the water selection.

Export CWS Input Data: Click this button to export the CWS input data to a file.

Compute results

Compute conserved waters and classify them into Fully, Half or Weakly Conserved water molecules. More information can be found in the ConservedWaterSearch documentation.

  • Clustering Method. Choose the clustering method. Options include:

Quick Multi-Stage Re-Clustering procedure. The best ratio of quality and speed.

  • Clustering Algorithm. Choose the clustering algorithm. Options include:

Faster, but produces slightly worse clusters.

Water Types for Clustering: Select the types of water molecules for clustering. In principle users should choose FCW, HCW and WCW. In some cases it might make sense to leave WCW out. This will also reduce the time for the analysis by about a third. For more information see ConservedWaterSearch documentation. OnlyO option clusters only oxygen atoms without considering hydrogen orientation data.

Clustering Options: Depending on the chosen method, provide the necessary parameters. Best to leave as is. For large number of snapshots (>1000) it is recommended to increase the value of EveryMinsamp to not more than 10% of the number of snapshots (if using QMSRC or MSRC).

Compute Clustering Button: Click this button to start the clustering analysis.

Advanced Settings

Users are discouraged to change the default values for the advanced settings, except for number of threads setting under njobs.

Note

Number of threads: using more thread than 1 will often decrease the performance of the clustering procedure. Only use more than 1 thread if you have a very large system (thousands of frames) or a large selection of water molecules per frame.