Gene Mjls_0298 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMjls_0298 
Symbol 
ID4876044 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium sp. JLS 
KingdomBacteria 
Replicon accessionNC_009077 
Strand
Start bp330010 
End bp331428 
Gene Length1419 bp 
Protein Length472 aa 
Translation table11 
GC content70% 
IMG OID640137612 
Productbetaine-aldehyde dehydrogenase 
Protein accessionYP_001068602 
Protein GI126432911 
COG category[C] Energy production and conversion 
COG ID[COG1012] NAD-dependent aldehyde dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCACGACT ACACCCGGAA GACCCTGTTC ATCGACGGCC GCTGGGCGAC CCCGGACGGC 
GGCGACGCGA TCGAGGTCAT CGACCCCGCA ACCGAGCAGG TGATCGGATC GGTCCCCGAC
GGCACAACCG CCGACGTCGA CGCGGCGGTC GCCGCGGCGC GCCGAGCCTT CGACCCGCTG
ATCACCGTCG CCGAACGGCG CGAGCGCCTC GATCGGGTGA TCGCCGCCAT GGAGAAGCGG
CTTCCCGACA TCGGCGAGAC GATCACCCGC GAGATGGGCG CCCCCGTGCG GATCGCGCAG
GGGGTGCAGA CCAAGGTGCC GCTGGCGGTG GCCCGTGGCA TCGCCGACGT ACTCGCCACC
TTCGAATTCG AAGAGCGGAT GGGCAATTCG CTGGTCGTCC GCGAACCCTA CGGGGTGGTC
GGCGCCATCA CGCCGTGGAA CTACCCGCTC TACCAGGTGG TCGCCAAGGT GGTGCCGGCG
ATCGCGGCGG GCTGCACCGT CGTGCTGAAG CCGAGCAACG AGGCGCCGCT GTCGGTGTTC
GAGTTCGTCG AGGCCCTCGA CGAGGCCGGC CTGCCGCCCG GGGTGGTCAA CCTCGTCTCC
GGCAGCGGAC GCGTGGTGGG CGAGCGGATC GCCGAGCACC CGGATGTCGA CCTCGTGTCG
TTCACCGGAT CGACGGCGGT CGGCCAACGG GTCGCCGAGC TCGCCGCGAA AACGGTGAAG
AAGGTGGCGC TCGAGCTGGG CGGCAAGTCG GCCAACGTCA TCCTCGAGGG TGGCGACCTG
ACGACGGCGG TGAAGGTCGG TGTCGGGAAC GCATACCTCA ACGGAGGCCA GACCTGTATG
GCCTGGACGC GGATGCTGGT GCCGGAGAGC CGTTACGGCG AGGCGCTCGA GCTGGTCGAG
GCGGCCGCCG CCAGGTACAC CGTCGGTGAC CCGCTCGACC CGGCCACGCG TATCGGACCG
TCGGCGTCGG CATCGCAGTT CGCCACCGTG CGCGGATTCA TCGAGCGCGC CCAGCGCGAC
GGCGCGCGGC TGATCACCGG TGGCGCCGAG AAGATCCGCG ACATCGGCTA CTACGTCGCG
CCGACCGTCT TCGCCGACGT CGACCCCGAT TCCGAACTCG GTCAGGAGGA GGTCTTCGGA
CCCGTCCTCG CGGTGATCCC GTTCCGCGAC GAGGACGACG CCGTGCGGAT CGCCAACGGC
ACCCCGTACG GGCTGGCGGG TGCGGTGTGG GCCGGAGACC TCGATCATGC GATCGCCTTC
GCGCGCCGGG TGCAGACCGG GCAGCTCGAC CTCAACGGCG GCGCGTACAA CCCCGTGGCG
CCGTTCGGCG GGTACAAGAA GTCCGGCGTC GGCCGGGAAC TCGGCCGCGC GGGCTTCGAA
GAGTTCCTGC AGACCAAATC CCTGCAGCTG CCCGCATGA
 
Protein sequence
MHDYTRKTLF IDGRWATPDG GDAIEVIDPA TEQVIGSVPD GTTADVDAAV AAARRAFDPL 
ITVAERRERL DRVIAAMEKR LPDIGETITR EMGAPVRIAQ GVQTKVPLAV ARGIADVLAT
FEFEERMGNS LVVREPYGVV GAITPWNYPL YQVVAKVVPA IAAGCTVVLK PSNEAPLSVF
EFVEALDEAG LPPGVVNLVS GSGRVVGERI AEHPDVDLVS FTGSTAVGQR VAELAAKTVK
KVALELGGKS ANVILEGGDL TTAVKVGVGN AYLNGGQTCM AWTRMLVPES RYGEALELVE
AAAARYTVGD PLDPATRIGP SASASQFATV RGFIERAQRD GARLITGGAE KIRDIGYYVA
PTVFADVDPD SELGQEEVFG PVLAVIPFRD EDDAVRIANG TPYGLAGAVW AGDLDHAIAF
ARRVQTGQLD LNGGAYNPVA PFGGYKKSGV GRELGRAGFE EFLQTKSLQL PA