Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_1290 |
Symbol | |
ID | 9245140 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014210 |
Strand | - |
Start bp | 1597141 |
End bp | 1598289 |
Gene Length | 1149 bp |
Protein Length | 382 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | |
Product | Protein-L-isoaspartate(D-aspartate) O-methyltransferase |
Protein accession | YP_003679234 |
Protein GI | 297560260 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 32 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGCTCCCCC AGCACGAGAA GCTGGTCTCC CGCCTCACCC AGGCGGGCAA CCTGGACCAG CATTGGCACG GTGCCTTCGC GGCGGTGGAG CGCCACCGCT TCCTGCCCGG CCGCATCACC GCTCCCGACG GCACCACCGT GGACCGGGAC GGTGACCACG ACGGCGATCG GGAACGGTGG CTCGAACTGG CCTATGACGA CATCCCGGTC ATCACGCAGG TGGACGACGG TACCGGGGAG GGATCCGGGT ACCCGGCCAG CTCCGCCTCG CAACCCTCGA TCGTCGCCGA CATGCTTCAC CGGCTGGACG TGTTCCCGGG GATGCGCGTG TTGGAGGTCG GAACCGGCAC GGGGTACAAC GCGGGGCTGC TCTCCCACCG GTTGGGCGGT GAGAACGTCA CCACCGTCGA GATCGACGCC GACCTCGCCG AACAAGCCCG TGTACGACTG CTCGACGCGG GTTTCGCGGC ACACGTCGTC ACCGGCGACG GCACACGGGG ATGGCCGAAG CGAGCACCCT ATGACCGGGT GCTCAGCACC GCGGCGGTCC AGCGGGTGCC CTATGCCTGG GTCGCCCAGA GCAGGCCCGG CGGGCGGATC CTCACCCCCT GGGGAACCGC CTTCCACAAC GGGGCCCTGG CTGAGCTGCG GGTGGGGCCG GACGGCTCCG CGCGGGGTCA CTTCGCAGGG GACGTGGCGT TCATGTGGGT GCGCGACCAG CGAATACCCA GGCGTGTCGT CGAGACCCAC GTCCGCCCCG AGGAGCAGGA GTTCACGCGC AGCCGCACCG GACTACACCC CTACGAGCCG ATCAGCGACT TCAGCGCGAG CTTCGCCATC GGGTTGCACA TGCCCACCGT TCTGAACCGG GTCGAGTACA CCGACGAGGA ACAGCGTTTC ACGGTTCACC TGGTGGATCC GGGCACCGGC TCCTGGGCGT CCTGGCACGT CGACCCCGAC CGCGGGGAGA CCGGTTACGA AGTCCACCAG CACGGGCCTC GGCGCCTGTT CTCCGAACTG GAGGCCGCCT ACACGTGGTG GCAGGAGGAG GGACCGCCCG AGCACACGCG GTTCGGACTC ACCGTTTCAG CAGAACGGCA GAACGCATGG CTGGACCACG AGGGGCGTCC CGTTCTCACC GCACCCTGA
|
Protein sequence | MLPQHEKLVS RLTQAGNLDQ HWHGAFAAVE RHRFLPGRIT APDGTTVDRD GDHDGDRERW LELAYDDIPV ITQVDDGTGE GSGYPASSAS QPSIVADMLH RLDVFPGMRV LEVGTGTGYN AGLLSHRLGG ENVTTVEIDA DLAEQARVRL LDAGFAAHVV TGDGTRGWPK RAPYDRVLST AAVQRVPYAW VAQSRPGGRI LTPWGTAFHN GALAELRVGP DGSARGHFAG DVAFMWVRDQ RIPRRVVETH VRPEEQEFTR SRTGLHPYEP ISDFSASFAI GLHMPTVLNR VEYTDEEQRF TVHLVDPGTG SWASWHVDPD RGETGYEVHQ HGPRRLFSEL EAAYTWWQEE GPPEHTRFGL TVSAERQNAW LDHEGRPVLT AP
|
| |