Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_5453 |
Symbol | |
ID | 9249356 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014211 |
Strand | + |
Start bp | 638218 |
End bp | 639387 |
Gene Length | 1170 bp |
Protein Length | 389 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | |
Product | Protein-L-isoaspartate(D-aspartate) O-methyltransferase |
Protein accession | YP_003683338 |
Protein GI | 297564365 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.291198 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 24 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCAACC CCGAAGACCT CCGGTCCCGC CTCGTCGAGG AGATCGCCCT CTCTCCGGCC TGGCGGGACA CCTTCGAGCG GGTTCCCCGC CACCGCTTCA TCCCCGACCG GATCTGGATC GAGGACGGGG ATGACCTGAC CGTCCTCGAT AGGGCCGATG ACTCGGACGC GTGGTTGCGT GCCTGCTACG CCGACCGGCC CGTCATCACC CAGATCGACG ATGGAGACCC GACCGGGCGC GGGCAGCGGT CCTCGTCGGC GTCCATGCCG AGCATCGTGG CCCTGATGCT GGAGGCCACC GACCTCGCCG CCGGTCAGCG GGTGTTGGAG ATCGGCACCG GAACCGGGTG GAACGCCGCG CTGCTCGCTG ACAGGGCCGG CGCGGGGAAC GTGACGTCGG TGGAGATCGA CCCGGCCGTG GCCGCGCGGG CTGAGGAGAA CCTGGAGGGC CACGGTGTCC ATGTGGTCCT CGGGGACGGT GAGAAGGGTT GTCCGCCCGA CGCCCCCTAC GACCGGGTAC TGGCGACCGC CGCCGTCCAG AGGGTTCCTT ACCCGTGGGT GGAGCAGACG GTGCCCGGCG GGCGGATCGT GACCCCGTGG GGGACGAGCT TCCACAACGG CACGCTGCTC CGCCTCCAGG TCGGCGCGGA CGGAACGGCG TCCGGGAGGT TCGGCGGGAA CGCAGGCTTC ATGTGGGTGC GTGGCCAGCG CACACCGCAC GGCACGCTCG ATGAGCGCGT CCGTCCCGAC CACGAGTACA CCGAGACCAC CACGGACCTG CACCCGTACG AGCCGGTCGG CGACTTCGAC GCGAGCTTCG CGATCGGTCT GCGGGTTCCC GGCATGAAGG ACCTGCTGGT CTTCGACGAC GACGTGCCGG GCAACCCGGA CTACACGGTG TACCTGATGG ACCCGGGTTC GGGTTCGTGG GCCTCGTGGC GGGTCCGGTC CGGCACTCGT GAGTTCGGGG TCCGCCAGCA CGGCCCGCGC TGCCTCTTCG ACGAGCTGGC CGGGGCCTAC GCCTGGTGGC GGGAGGCGGG CCGCCCGGAG CACTCCCGGT TCGGCGTGAC CGTGACCTGC GAGGGCCAAC GCGTGTGGTT GGACGACCCC GGCAACGTCC TCCCCGTGGG GCAGATCGCC GCCGGACAAG CGGAGAACGG TCAGACATGA
|
Protein sequence | MTNPEDLRSR LVEEIALSPA WRDTFERVPR HRFIPDRIWI EDGDDLTVLD RADDSDAWLR ACYADRPVIT QIDDGDPTGR GQRSSSASMP SIVALMLEAT DLAAGQRVLE IGTGTGWNAA LLADRAGAGN VTSVEIDPAV AARAEENLEG HGVHVVLGDG EKGCPPDAPY DRVLATAAVQ RVPYPWVEQT VPGGRIVTPW GTSFHNGTLL RLQVGADGTA SGRFGGNAGF MWVRGQRTPH GTLDERVRPD HEYTETTTDL HPYEPVGDFD ASFAIGLRVP GMKDLLVFDD DVPGNPDYTV YLMDPGSGSW ASWRVRSGTR EFGVRQHGPR CLFDELAGAY AWWREAGRPE HSRFGVTVTC EGQRVWLDDP GNVLPVGQIA AGQAENGQT
|
| |