Demystifying Molecule Attribute Queries in SAMSON.

Molecular modelers often find it challenging to quickly access and analyze specific molecular properties, especially when dealing with extensive molecular datasets. SAMSON tackles this hurdle using its Node Specification Language (NSL) and an intuitive set of molecule attributes that streamline molecular analysis. In this post, we’ll explore how molecule attributes within SAMSON provide a powerful toolset for querying molecular properties, enabling faster and more accurate modeling workflows.

What Are Molecule Attributes?

The molecule attribute space in SAMSON is a tool designed to match molecule nodes and allow users to query specific properties of molecules. A significant feature of this space is its use of concise short names for attributes, supporting quick, efficient expressions. These attributes include inherent properties like the number of atoms, chains, or segments, along with visibility or selection statuses.

Categories of Molecule Attributes

The molecule attributes are grouped into three broad sets:

Inherited from Node Attributes: These include properties like hidden and hasMaterial, which let you check a node’s metadata-related states.
Inherited from StructuralGroup Attributes: These encompass chemically relevant properties, such as the formalCharge, number of specific elements like carbons or hydrogens, and partial charges.
Specific to the Molecule Attribute Space: These attributes address higher-order structure, such as numberOfChains, numberOfResidues, and numberOfSegments.

Exploring Specific Examples

To help you grasp the power of these attributes, let’s delve into a few practical examples:

1. Filtering Molecules by Number of Chains

The numberOfChains attribute (short name: nc) enables you to filter molecules based on their chain count:

molecule.numberOfChains < 3 (shortened as mol.nc < 3): Matches molecules with fewer than 3 chains.
molecule.numberOfChains 2:4 (shortened as mol.nc 2:4): Matches molecules with 2 to 4 chains.

This is especially helpful when working with biomolecular systems like proteins, where chain count correlates with functional properties.

2. Investigating Residue Information

Residues are essential to molecular insight, and the numberOfResidues attribute (short name: nr) offers a way to evaluate their distribution:

molecule.numberOfResidues > 100 (shortened as mol.nr > 100): Matches molecules with more than 100 residues.
molecule.numberOfResidues 100:120 (shortened as mol.nr 100:120): Matches molecules with residue counts in the range of 100–120.

By assembling such queries, you can analyze specific families of biomolecules or identify residues contributing to structural varieties.

3. Evaluating Structural Segments

When skimming through datasets for hierarchical organization, the numberOfSegments attribute (short name: ns) becomes useful. For example:

molecule.numberOfSegments > 2 (or mol.ns > 2): Matches molecules containing more than 2 segments.
molecule.numberOfSegments 2:4 (or mol.ns 2:4): Targets molecular structures with 2 to 4 distinct segments.

Segment-level evaluations are impactful when performing docking studies or protein analyses.

Simplifying Powerful Queries

Another benefit of SAMSON’s NSL is its user-friendly syntax. With shortened forms like mol.nc or mol.nr, users can quickly build complex queries while keeping their workflows efficient. By mastering these attributes, researchers can gain new insights, extract relevant molecules, and automate repetitive tasks in molecular modeling.

Learn More

The molecule attribute space in SAMSON is robust and designed according to the needs of molecular modelers. To discover even more about how to use these attributes effectively, visit the original documentation at https://documentation.samson-connect.net/users/latest/nsl/molecule/.

SAMSON and all SAMSON Extensions are free for non-commercial use. Download SAMSON today at https://www.samson-connect.net.