Gene Ndas_1290 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_1290 
Symbol 
ID9245140 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp1597141 
End bp1598289 
Gene Length1149 bp 
Protein Length382 aa 
Translation table11 
GC content69% 
IMG OID 
ProductProtein-L-isoaspartate(D-aspartate) O-methyltransferase 
Protein accessionYP_003679234 
Protein GI297560260 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones32 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCTCCCCC AGCACGAGAA GCTGGTCTCC CGCCTCACCC AGGCGGGCAA CCTGGACCAG 
CATTGGCACG GTGCCTTCGC GGCGGTGGAG CGCCACCGCT TCCTGCCCGG CCGCATCACC
GCTCCCGACG GCACCACCGT GGACCGGGAC GGTGACCACG ACGGCGATCG GGAACGGTGG
CTCGAACTGG CCTATGACGA CATCCCGGTC ATCACGCAGG TGGACGACGG TACCGGGGAG
GGATCCGGGT ACCCGGCCAG CTCCGCCTCG CAACCCTCGA TCGTCGCCGA CATGCTTCAC
CGGCTGGACG TGTTCCCGGG GATGCGCGTG TTGGAGGTCG GAACCGGCAC GGGGTACAAC
GCGGGGCTGC TCTCCCACCG GTTGGGCGGT GAGAACGTCA CCACCGTCGA GATCGACGCC
GACCTCGCCG AACAAGCCCG TGTACGACTG CTCGACGCGG GTTTCGCGGC ACACGTCGTC
ACCGGCGACG GCACACGGGG ATGGCCGAAG CGAGCACCCT ATGACCGGGT GCTCAGCACC
GCGGCGGTCC AGCGGGTGCC CTATGCCTGG GTCGCCCAGA GCAGGCCCGG CGGGCGGATC
CTCACCCCCT GGGGAACCGC CTTCCACAAC GGGGCCCTGG CTGAGCTGCG GGTGGGGCCG
GACGGCTCCG CGCGGGGTCA CTTCGCAGGG GACGTGGCGT TCATGTGGGT GCGCGACCAG
CGAATACCCA GGCGTGTCGT CGAGACCCAC GTCCGCCCCG AGGAGCAGGA GTTCACGCGC
AGCCGCACCG GACTACACCC CTACGAGCCG ATCAGCGACT TCAGCGCGAG CTTCGCCATC
GGGTTGCACA TGCCCACCGT TCTGAACCGG GTCGAGTACA CCGACGAGGA ACAGCGTTTC
ACGGTTCACC TGGTGGATCC GGGCACCGGC TCCTGGGCGT CCTGGCACGT CGACCCCGAC
CGCGGGGAGA CCGGTTACGA AGTCCACCAG CACGGGCCTC GGCGCCTGTT CTCCGAACTG
GAGGCCGCCT ACACGTGGTG GCAGGAGGAG GGACCGCCCG AGCACACGCG GTTCGGACTC
ACCGTTTCAG CAGAACGGCA GAACGCATGG CTGGACCACG AGGGGCGTCC CGTTCTCACC
GCACCCTGA
 
Protein sequence
MLPQHEKLVS RLTQAGNLDQ HWHGAFAAVE RHRFLPGRIT APDGTTVDRD GDHDGDRERW 
LELAYDDIPV ITQVDDGTGE GSGYPASSAS QPSIVADMLH RLDVFPGMRV LEVGTGTGYN
AGLLSHRLGG ENVTTVEIDA DLAEQARVRL LDAGFAAHVV TGDGTRGWPK RAPYDRVLST
AAVQRVPYAW VAQSRPGGRI LTPWGTAFHN GALAELRVGP DGSARGHFAG DVAFMWVRDQ
RIPRRVVETH VRPEEQEFTR SRTGLHPYEP ISDFSASFAI GLHMPTVLNR VEYTDEEQRF
TVHLVDPGTG SWASWHVDPD RGETGYEVHQ HGPRRLFSEL EAAYTWWQEE GPPEHTRFGL
TVSAERQNAW LDHEGRPVLT AP