Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_1269 |
Symbol | |
ID | 9245119 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014210 |
Strand | - |
Start bp | 1574288 |
End bp | 1575433 |
Gene Length | 1146 bp |
Protein Length | 381 aa |
Translation table | 11 |
GC content | 74% |
IMG OID | |
Product | protein-L-isoaspartate(D-aspartate) O-methyl transferase |
Protein accession | YP_003679214 |
Protein GI | 297560240 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 24 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATCCCCC CGCACGAGAG GATGGTCGAA CGCCTCGTCG AGGAGGGCGC TCTGGCCGCG GAGTGGCGCG GGGCCTTCGC GGCGGTGCCG CGGCACCTGT TCCTGCCCGA CGAGATCGCC GACCCCGACA CGGGCGTGCC CGTGGACCGC TACCGCGGCG AACTGCGGTG GCTGGAGGCG GCCTACGCCG ACGTTCCCGT CGTCACCCAG GTCGACGACG GCGCCCGCTA CGGCCCCGGC GAGCCCACGT CCTCGTCCCC GGCGCCGTCC TCGGTGGCCG AGACGGTGCA GCGGCTGCGG CTGGCGCCGG GCATGCGGGT GCTGGAGGTG GGCACCGGCA GCGGGTACAC CGCCGCCCTG CTCGCGCGGT TCCTGGGCGA CGACGCCGTC ACCTCGGTGG AGATCGACTT CGAGCTGGCC GAACAGGCGC GGGTGCGGCT GATGAGCGCG GGGCTCACAC CTCTGGTGGT CAGCGGGGAC GGCATGCGCG GCTGGGCCGG GCGCGCGCCC TACGACCGGG TGGTCAGCGC GGTGACCGTG CAGCGTGTCC CCTACGCCTG GGTGGCGCAG TGCCAGCCCG GCGGACGCGT CCTGACGCCG TGGGGCACCG CCTTCCACCA CGGGGTGACG GCCGACCTCG TGGTCGGCCC GCACGGGACC GCGCACGGGA GGTTCACCGG GCACACCGAC CGCATGTGGG CGCGCGCGGA CCGGACCCCG CGCCACCTGC TGGAGACCTA CGTCCGTCCG GACGACGACG AGCACACCAC GACGTGCTCC CGGCTGCACC CGGGAGAGGT GGTCGGCGAC TCCGACGGGG CCTTCGCGGT GGGGTTGCAG ATGCCCGACG TGCACCGGAT CGTCCGGCTC GGGGGCGGAC CCGAGGACCC GCGGTTCACG GTGTACCTGC TGGACTCCTC GACCTGGTCG TGGGCCTCCT GGCACATCGA CCCCGGGCAC CGGGACCGGG GCTACGAGGT GCGCCAGCAC GGCCCCCGGA GGCTGTTCAA CGAATTGGAG GCGGCCCACC TGCTGTGGGT GGAGGCGGGG CGTCCCGCGC ACACCCGGTT CGGACTGACG GTGTCGGCCG AACACCAGCT GGTGTGGCTG GACGACGAGG CCACGATCTT CGCCGCCACG AGGTGA
|
Protein sequence | MIPPHERMVE RLVEEGALAA EWRGAFAAVP RHLFLPDEIA DPDTGVPVDR YRGELRWLEA AYADVPVVTQ VDDGARYGPG EPTSSSPAPS SVAETVQRLR LAPGMRVLEV GTGSGYTAAL LARFLGDDAV TSVEIDFELA EQARVRLMSA GLTPLVVSGD GMRGWAGRAP YDRVVSAVTV QRVPYAWVAQ CQPGGRVLTP WGTAFHHGVT ADLVVGPHGT AHGRFTGHTD RMWARADRTP RHLLETYVRP DDDEHTTTCS RLHPGEVVGD SDGAFAVGLQ MPDVHRIVRL GGGPEDPRFT VYLLDSSTWS WASWHIDPGH RDRGYEVRQH GPRRLFNELE AAHLLWVEAG RPAHTRFGLT VSAEHQLVWL DDEATIFAAT R
|
| |