Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Mid-chain neutral Arginine is missing from the substructure library #2011

Open
Yoshanuikabundi opened this issue Feb 6, 2025 · 0 comments
Open

Comments

@Yoshanuikabundi
Copy link
Collaborator

Neutral arginine (missing HH12 or HH22 to make the guanidino group neutral) is present in the substructure dictionary, but only as a terminal residue. It should be present mid-chain as well.

>>> import json
>>> from openff.toolkit.utils.utils import get_data_file_path
>>> substructure_file_path = get_data_file_path(
...     "proteins/aa_residues_substructures_explicit_bond_orders_with_caps_explicit_connectivity.json"
... )
>>> with open(substructure_file_path) as subfile:
...     substructure_dictionary = json.load(
...         subfile
...     )
>>> # The only mid-chain Arginine has a charged sidechain
>>> [
...     names
...     for names in substructure_dictionary["ARG"].values()
...     if (
...         "OXT" not in names and "H2" not in names
...     )
... ]
[['N',
  'CA',
  'C',
  'O',
  'CB',
  'CG',
  'CD',
  'NE',
  'CZ',
  'NH1',
  'NH2',
  'H',
  'HA',
  'HB2',
  'HB3',
  'HG2',
  'HG3',
  'HD2',
  'HD3',
  'HE',
  'HH11',
  'HH12',
  'HH21',
  'HH22',
  '*',
  '*']]
>>> # Neutral arginine sidechains exist as termini
>>> [
...     names
...     for names in substructure_dictionary["ARG"].values()
...     if (
...         "HH12" not in names or "HH22" not in names
...     )
... ]
[['N',
  'CA',
  'C',
  'O',
  'CB',
  'CG',
  'CD',
  'NE',
  'CZ',
  'NH1',
  'NH2',
  'OXT',
  'HA',
  'HB2',
  'HB3',
  'HG2',
  'HG3',
  'HD2',
  'HD3',
  'HE',
  'HH11',
  'HH21',
  'HH22',
  'HXT',
  'H1',
  'H2'],
 ['N',
  'CA',
  'C',
  'O',
  'CB',
  'CG',
  'CD',
  'NE',
  'CZ',
  'NH1',
  'NH2',
  'OXT',
  'HA',
  'HB2',
  'HB3',
  'HG2',
  'HG3',
  'HD2',
  'HD3',
  'HE',
  'HH11',
  'HH21',
  'HH22',
  'H1',
  'H2'],
 ['N',
  'CA',
  'C',
  'O',
  'CB',
  'CG',
  'CD',
  'NE',
  'CZ',
  'NH1',
  'NH2',
  'HA',
  'HB2',
  'HB3',
  'HG2',
  'HG3',
  'HD2',
  'HD3',
  'HE',
  'HH11',
  'HH21',
  'HH22',
  'H1',
  'H2',
  '*'],
 ['N',
  'CA',
  'C',
  'O',
  'CB',
  'CG',
  'CD',
  'NE',
  'CZ',
  'NH1',
  'NH2',
  'OXT',
  'HA',
  'HB2',
  'HB3',
  'HG2',
  'HG3',
  'HD2',
  'HD3',
  'HE',
  'HH11',
  'HH12',
  'HH21',
  'HXT',
  'H1',
  'H2'],
 ['N',
  'CA',
  'C',
  'O',
  'CB',
  'CG',
  'CD',
  'NE',
  'CZ',
  'NH1',
  'NH2',
  'OXT',
  'HA',
  'HB2',
  'HB3',
  'HG2',
  'HG3',
  'HD2',
  'HD3',
  'HE',
  'HH11',
  'HH12',
  'HH21',
  'H1',
  'H2'],
 ['N',
  'CA',
  'C',
  'O',
  'CB',
  'CG',
  'CD',
  'NE',
  'CZ',
  'NH1',
  'NH2',
  'HA',
  'HB2',
  'HB3',
  'HG2',
  'HG3',
  'HD2',
  'HD3',
  'HE',
  'HH11',
  'HH12',
  'HH21',
  'H1',
  'H2',
  '*'],
 ['N',
  'CA',
  'C',
  'O',
  'CB',
  'CG',
  'CD',
  'NE',
  'CZ',
  'NH1',
  'NH2',
  'OXT',
  'HA',
  'HB2',
  'HB3',
  'HG2',
  'HG3',
  'HD2',
  'HD3',
  'HE',
  'HH11',
  'HH21',
  'HH22',
  'H1',
  'H2',
  'H3'],
 ['N',
  'CA',
  'C',
  'O',
  'CB',
  'CG',
  'CD',
  'NE',
  'CZ',
  'NH1',
  'NH2',
  'HA',
  'HB2',
  'HB3',
  'HG2',
  'HG3',
  'HD2',
  'HD3',
  'HE',
  'HH11',
  'HH21',
  'HH22',
  'H1',
  'H2',
  'H3',
  '*'],
 ['N',
  'CA',
  'C',
  'O',
  'CB',
  'CG',
  'CD',
  'NE',
  'CZ',
  'NH1',
  'NH2',
  'OXT',
  'HA',
  'HB2',
  'HB3',
  'HG2',
  'HG3',
  'HD2',
  'HD3',
  'HE',
  'HH11',
  'HH12',
  'HH21',
  'H1',
  'H2',
  'H3'],
 ['N',
  'CA',
  'C',
  'O',
  'CB',
  'CG',
  'CD',
  'NE',
  'CZ',
  'NH1',
  'NH2',
  'HA',
  'HB2',
  'HB3',
  'HG2',
  'HG3',
  'HD2',
  'HD3',
  'HE',
  'HH11',
  'HH12',
  'HH21',
  'H1',
  'H2',
  'H3',
  '*']]
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant