Gene Ndas_5453 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_5453 
Symbol 
ID9249356 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014211 
Strand
Start bp638218 
End bp639387 
Gene Length1170 bp 
Protein Length389 aa 
Translation table11 
GC content71% 
IMG OID 
ProductProtein-L-isoaspartate(D-aspartate) O-methyltransferase 
Protein accessionYP_003683338 
Protein GI297564365 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.291198 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCAACC CCGAAGACCT CCGGTCCCGC CTCGTCGAGG AGATCGCCCT CTCTCCGGCC 
TGGCGGGACA CCTTCGAGCG GGTTCCCCGC CACCGCTTCA TCCCCGACCG GATCTGGATC
GAGGACGGGG ATGACCTGAC CGTCCTCGAT AGGGCCGATG ACTCGGACGC GTGGTTGCGT
GCCTGCTACG CCGACCGGCC CGTCATCACC CAGATCGACG ATGGAGACCC GACCGGGCGC
GGGCAGCGGT CCTCGTCGGC GTCCATGCCG AGCATCGTGG CCCTGATGCT GGAGGCCACC
GACCTCGCCG CCGGTCAGCG GGTGTTGGAG ATCGGCACCG GAACCGGGTG GAACGCCGCG
CTGCTCGCTG ACAGGGCCGG CGCGGGGAAC GTGACGTCGG TGGAGATCGA CCCGGCCGTG
GCCGCGCGGG CTGAGGAGAA CCTGGAGGGC CACGGTGTCC ATGTGGTCCT CGGGGACGGT
GAGAAGGGTT GTCCGCCCGA CGCCCCCTAC GACCGGGTAC TGGCGACCGC CGCCGTCCAG
AGGGTTCCTT ACCCGTGGGT GGAGCAGACG GTGCCCGGCG GGCGGATCGT GACCCCGTGG
GGGACGAGCT TCCACAACGG CACGCTGCTC CGCCTCCAGG TCGGCGCGGA CGGAACGGCG
TCCGGGAGGT TCGGCGGGAA CGCAGGCTTC ATGTGGGTGC GTGGCCAGCG CACACCGCAC
GGCACGCTCG ATGAGCGCGT CCGTCCCGAC CACGAGTACA CCGAGACCAC CACGGACCTG
CACCCGTACG AGCCGGTCGG CGACTTCGAC GCGAGCTTCG CGATCGGTCT GCGGGTTCCC
GGCATGAAGG ACCTGCTGGT CTTCGACGAC GACGTGCCGG GCAACCCGGA CTACACGGTG
TACCTGATGG ACCCGGGTTC GGGTTCGTGG GCCTCGTGGC GGGTCCGGTC CGGCACTCGT
GAGTTCGGGG TCCGCCAGCA CGGCCCGCGC TGCCTCTTCG ACGAGCTGGC CGGGGCCTAC
GCCTGGTGGC GGGAGGCGGG CCGCCCGGAG CACTCCCGGT TCGGCGTGAC CGTGACCTGC
GAGGGCCAAC GCGTGTGGTT GGACGACCCC GGCAACGTCC TCCCCGTGGG GCAGATCGCC
GCCGGACAAG CGGAGAACGG TCAGACATGA
 
Protein sequence
MTNPEDLRSR LVEEIALSPA WRDTFERVPR HRFIPDRIWI EDGDDLTVLD RADDSDAWLR 
ACYADRPVIT QIDDGDPTGR GQRSSSASMP SIVALMLEAT DLAAGQRVLE IGTGTGWNAA
LLADRAGAGN VTSVEIDPAV AARAEENLEG HGVHVVLGDG EKGCPPDAPY DRVLATAAVQ
RVPYPWVEQT VPGGRIVTPW GTSFHNGTLL RLQVGADGTA SGRFGGNAGF MWVRGQRTPH
GTLDERVRPD HEYTETTTDL HPYEPVGDFD ASFAIGLRVP GMKDLLVFDD DVPGNPDYTV
YLMDPGSGSW ASWRVRSGTR EFGVRQHGPR CLFDELAGAY AWWREAGRPE HSRFGVTVTC
EGQRVWLDDP GNVLPVGQIA AGQAENGQT