Gene Ndas_1310 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_1310 
Symbol 
ID9245160 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp1615902 
End bp1617161 
Gene Length1260 bp 
Protein Length419 aa 
Translation table11 
GC content70% 
IMG OID 
ProductProtein-L-isoaspartate(D-aspartate) O-methyltransferase 
Protein accessionYP_003679252 
Protein GI297560278 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACGACCT CCGACACCTC CACCTCTACT CCGGGTACCG ACCGTATTGA TGAGCTGCGC 
GAACGCATGA TCATTCGCCT GCGAGCCGTC AACGCCCTGC ACTCCGAACC AGTCACCAGA
GCGGTGCGCA CCGTGCCCCG CCACGCCTTC GCTCCGGAGT TCCCTCTGGA GGAGGTCTAC
GACCCGGAGA CGGTGCTGCG CACTCGCTTC GCCGACGACG GCACCGCCAC CAGCTCGGTC
TCAGCGACCT GGGTGCAGAC CCTGATGCTG GAACGCGCCC GACTCCGCCC CGGCATGCGG
GTGCTGGAGG TCGGGTCGGG CGGCTACAAC GCCGCACTGG CCGCTGAGGT CGTCGGACCC
GCCGGATCGG TGACCTCCCT CGACATCGAC CCCGCGGTCA TCGACCGCGC CCGCACCCAC
CTGACCCAGG CCGGATACGC CGACCGGGTG GAGCTGGTCG TCGGCGACGC CGAACACGGC
CACCCCAGCA GCGCACCCTA CGACCGGATC ATCGTCACGG CCGCCGCCAC CGACCTGCCT
CCGACCTGGA CTCATCAGCT CGTCGACGGT GGACGCCTCG TGCTCCCGTT GGGGGTGCGG
GGTTTCCAGC GCGTCTACGC CTTCACTCGC CACGGCGCAC TGCTGCTGGG CGAGACCGGC
AACCACGCGG GGTTCGTCCC CATGCAGGGC CTGGGCGCTT CCGCCTCCAC CACCGTGCCG
GTCACCGACA CGGTCGAGCT GACCACCGAC GCCGACCCTA CGCCCGCCCC CGAAGCCCTG
TCCGCAGTGT TCGCTCTCCC CGCTCAGGAG CAGTTCACCG GGGTCCGTAT CGCTGGCATG
GAACCCTTCA GCGCCCTGCA GATGTGGATG GCCACCGGGC TACCCCACTT CGCCACCCTG
ACCGGCACCG CCGACGATTT CCGCTACCGA CCTATGACCC ATGCCCTGAT CCCGGCGATG
TGGGACGGGA CCTCACTGGC CTACCTGCTG CTCCCCGCGG TACCCGATGC CTCCGCCACA
TACGAGTTCG TGGCCTGCGG TCACGGGCCC GACGCCGCCG ACGTAGTCGC CACCATGGCC
GACCACGTCC GTCGCTGGGA CCGCGACCAC CGCGGCGGGC CCGGTCCCCG GTTCCTGGCC
CATCCTGACT TCCCACCTGG GGACGGGATC ATTCGCACCC GCCACACCCG TCTGCTGGCC
CTGTGGGACC AGGCCGAGGA GGACCACGGT TTGGACTGGC CCGCCGACCC CGGCAGGTGA
 
Protein sequence
MTTSDTSTST PGTDRIDELR ERMIIRLRAV NALHSEPVTR AVRTVPRHAF APEFPLEEVY 
DPETVLRTRF ADDGTATSSV SATWVQTLML ERARLRPGMR VLEVGSGGYN AALAAEVVGP
AGSVTSLDID PAVIDRARTH LTQAGYADRV ELVVGDAEHG HPSSAPYDRI IVTAAATDLP
PTWTHQLVDG GRLVLPLGVR GFQRVYAFTR HGALLLGETG NHAGFVPMQG LGASASTTVP
VTDTVELTTD ADPTPAPEAL SAVFALPAQE QFTGVRIAGM EPFSALQMWM ATGLPHFATL
TGTADDFRYR PMTHALIPAM WDGTSLAYLL LPAVPDASAT YEFVACGHGP DAADVVATMA
DHVRRWDRDH RGGPGPRFLA HPDFPPGDGI IRTRHTRLLA LWDQAEEDHG LDWPADPGR