Gene Mkms_3749 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMkms_3749 
Symbol 
ID4611684 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium sp. KMS 
KingdomBacteria 
Replicon accessionNC_008705 
Strand
Start bp3968385 
End bp3969890 
Gene Length1506 bp 
Protein Length501 aa 
Translation table11 
GC content68% 
IMG OID639793429 
Productaldehyde dehydrogenase 
Protein accessionYP_939732 
Protein GI119869780 
COG category[C] Energy production and conversion 
COG ID[COG1012] NAD-dependent aldehyde dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value0.651796 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTCAGA CGCCCACGGT GAGCGCGGAT CGCCAGAGTG CCGCCGGCTC CCGGGCCGGT 
GACGTGCAGG CGGATCGGCG GCTGTTGATC GACGGTCGGC TGGTCGACAC CGGACGGGTG
TTTCCCTCCC TCAATCCCGC CACCGGCCAG GTGCTCGGGT ACGCGCCGGA TGCGACCGTC
GCCGACGCCG AAGCCGCCGT GGCCGCGGCG CGACGCGCCT TCGACACGAC CGATTGGGCG
ACGAACGTCG AACTGCGGTT GCGCTGCCTC GACCAGTTGC ACACCGCCTT GGTCGAGCAT
CGCGACGAAC TGGCCGCACT GACGATCGCG GAGGTGGGCG CCACCGAGGC GCTGTGCCAG
GGCGCGCAAC TCGACCAGCC GATCGCGATC GTGCGCTACT ACGCCGACCT GCTCGCCGAC
TACCCGATGA CCGAAGACCT CGGCAACATC GAGAGCCGGG GCATGCAGCA CCACCGCTGG
GTCGAGAAGG AAGCCGCGGG CGTCGTGGCG GCGATCATCG CCTACAACTA TCCGAACCAA
CTCGCGCTGG CCAAACTCGC GCCGGCGCTG GCCGCCGGTT GCACCGTCGT CCTCAAGTCC
GCACCCGACA CGCCGTTGAT CACCCTGGCC CTCGGCGAGT TGATCGCCGA GCACACCGAC
ATCCCGGCCG GTGTCGTCAA CGTGCTCTCC GGCGCCGACC CGGAGGTGGG CGCGGTGCTG
ACCACCAGCC CCGACGTCGA CATGGTCACC TTCACCGGTT CGACCCCCAC CGGGCGCCGC
ATCATGGCCG CCGCGAGCGA GACGCTCAAG AAGGTGTTCC TCGAACTCGG TGGCAAGTCC
GCGGCAATCG TGCTCGACGA CGCCGACTTC AACACCGCGG CACTGTTCTC GGCGTTCTCG
ATGGTCACCC ACGCCGGCCA GGGTTGCGCG CTGACGTCCC GGCTGCTGGT GCCGGCCCGG
CACAAGGACG AGATCGTCGA GAAGATCAAG AACAACTTCG GGTTGGTGCG CTTCGGAGAT
CCAGCCGATC CGTCCACCTA CATGGGTCCG CTGATCAGTG AGAAGCAGCG CGACAAGGTC
GACGGCATGG TCCAAAGGGC CGTCGCCGCA GGGGCATCAC TGGTCACCGG CGGCGAGAAG
GTCGACCCCG GCTACTTCTA CTCGCCGACG CTGCTCGCCG ATGTCGACCC CGACAGCGAG
ATCGCACAGG AGGAGGTCTT CGGCCCCGTC CTGGTGGTGA TCGCCTACGA AGACGACGAC
GACGCCGTCC GCATCGCCAA CAACTCCATC TACGGGTTGT CCGGCGCGGT GTTCGGCAGC
GAGGAGCGAG CGCTGGCGGT CGCCCGCCGC ATCCGCACCG GGACGTTCTC GATCAACGGC
GGCAACTACT TCAGCCCCGA CAGCCCGTTC GGCGGCTACA AACAGTCCGG CATCGGCCGC
GAGATGGGCA CCGCAGGCCT CGAGGAGTTC CTCGAATCCA AGACATTCGC GACGGTGGTG
GGCTGA
 
Protein sequence
MAQTPTVSAD RQSAAGSRAG DVQADRRLLI DGRLVDTGRV FPSLNPATGQ VLGYAPDATV 
ADAEAAVAAA RRAFDTTDWA TNVELRLRCL DQLHTALVEH RDELAALTIA EVGATEALCQ
GAQLDQPIAI VRYYADLLAD YPMTEDLGNI ESRGMQHHRW VEKEAAGVVA AIIAYNYPNQ
LALAKLAPAL AAGCTVVLKS APDTPLITLA LGELIAEHTD IPAGVVNVLS GADPEVGAVL
TTSPDVDMVT FTGSTPTGRR IMAAASETLK KVFLELGGKS AAIVLDDADF NTAALFSAFS
MVTHAGQGCA LTSRLLVPAR HKDEIVEKIK NNFGLVRFGD PADPSTYMGP LISEKQRDKV
DGMVQRAVAA GASLVTGGEK VDPGYFYSPT LLADVDPDSE IAQEEVFGPV LVVIAYEDDD
DAVRIANNSI YGLSGAVFGS EERALAVARR IRTGTFSING GNYFSPDSPF GGYKQSGIGR
EMGTAGLEEF LESKTFATVV G