Gene Mkms_4738 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMkms_4738 
Symbol 
ID4616153 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium sp. KMS 
KingdomBacteria 
Replicon accessionNC_008705 
Strand
Start bp4965407 
End bp4966930 
Gene Length1524 bp 
Protein Length507 aa 
Translation table11 
GC content70% 
IMG OID639794430 
Productbetaine-aldehyde dehydrogenase 
Protein accessionYP_940719 
Protein GI119870767 
COG category[C] Energy production and conversion 
COG ID[COG1012] NAD-dependent aldehyde dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCGGAC TTCCGCATTA CCGGATGTAC GTCGACGGCG AGTGGCGCGA CGCCGCGGAG 
TCGATCGAGG TGCGCAGCCC GGCAACCGGC GCCCCCGTCG CGACGGTGGC CTACGGTGAC
CTGACCGCCG TCGACGACGC CGTGGCGGCG GCCAGGGCCG CGCACGAGGC TGGGGTGTGG
CGATCGATGC CGCCGCAGCA GCGGGCCGAT CTGCTCGACG CCATCGCCGA CAAGCTCGCC
GCCCGGTCCG ACGAGCTGAC CGCGCTGCAG GTCAGGGAGA ACGGTGCGAC CGTGCGCGGT
GCCGGCGCGT TCCTGATCGG CTACGCCATC GCGAACCTGA GGTACTTCGC CTCGCTTGCG
CGCAGCTACG CGTTCCAGAC CAGCGGACCG CTGATCGAGG CGCCGACGCT GGCCTCCGGC
CTGATCCTGC GGGAGCCGGT CGGGGTGTGC GCGGGCATCA TCCCGTGGAA CTTCCCACTG
CTGCTGGCGG TCTGGAAGCT GGGACCGGCG CTGGCGGCGG GCAACACCGT CGTGCTCAAA
CCCGACGACC AGACCCCGCT GACGCTGCTC GAACTCGCCC GCGCCGCAGA CGAAGTCGGG
CTGCCCGCCG GGGTGCTCAA CGTGGTGACC GGGCCGGGTC CGGTGGCCGG CGCCCGGCTG
GCCGAACACC CCGACGTCCG CAAGATCGCG TTCACCGGGT CCACCGAGGT GGGCAAGGGT
GTCATGCGGG CCGCGGCCGA CAACGTCAAG AAGGTCACCC TCGAACTGGG CGGCAAGGGC
GCCAACATCG TGCTCGAGGA CGCCGATCTC GACCTTGCCG TGGACGGTTC GCTGTTCGCC
TTCCTGATGA TGAGCGGGCA GGCCTGTGAA TCCGGGACGC GACTGCTGGT GCACGAATCC
GTTCACGACG AGTTCGTGCG GCGGTTGGTG GCCCGGGCCG AGACGCTGGT GATGGGCGAT
CCGATGAGCC CGGCGACCGA TCTGGGACCG CTGGTGTCGG CCAAGCAGAA GGCCCGTGTC
GAGAAGTACA TCGCGCTCGG TCAGGAGGAG GGCTGCCGGA TGGCCTTCCA GGGCACCGTC
CCGTCGGATC CCGCGCTGGC CGAGGGGCAT TGGGTGCCGC CGGTCATCCT GACCGGGGCC
ACCAACCAGA TGCGGATCGC CCGCGAGGAG ATCTTCGGCC CGGTGCTGGT GGTCATCCCG
TTCCGCGACG ACGACGATGC GGTCGCGATC GCCAACGACA GCGAGTACGG GCTGTCGGCG
GGGGTGTGGA GCGCCGACAA CGGCCGCGCC CTGGGGATCG CCCGCCGGCT GGAGTCGGGA
ACGGTGTGGG TCAACGACTG GCACATGGTC AACGCGATGT ACCCGTTCGG CGGGGTCAAA
CAGAGCGGAC TGGGTCGTGA ACTCGGCCCG GACGCGCTCG ACGAGTACAC CGAGCCCAAG
TTCGTCCACA TCGACCTGAC CAACGACCGT CGCAAACGTG CCTTCGCCGT GGTCGTATCC
GCGGCGGCAG CCGAATCCGA CTGA
 
Protein sequence
MTGLPHYRMY VDGEWRDAAE SIEVRSPATG APVATVAYGD LTAVDDAVAA ARAAHEAGVW 
RSMPPQQRAD LLDAIADKLA ARSDELTALQ VRENGATVRG AGAFLIGYAI ANLRYFASLA
RSYAFQTSGP LIEAPTLASG LILREPVGVC AGIIPWNFPL LLAVWKLGPA LAAGNTVVLK
PDDQTPLTLL ELARAADEVG LPAGVLNVVT GPGPVAGARL AEHPDVRKIA FTGSTEVGKG
VMRAAADNVK KVTLELGGKG ANIVLEDADL DLAVDGSLFA FLMMSGQACE SGTRLLVHES
VHDEFVRRLV ARAETLVMGD PMSPATDLGP LVSAKQKARV EKYIALGQEE GCRMAFQGTV
PSDPALAEGH WVPPVILTGA TNQMRIAREE IFGPVLVVIP FRDDDDAVAI ANDSEYGLSA
GVWSADNGRA LGIARRLESG TVWVNDWHMV NAMYPFGGVK QSGLGRELGP DALDEYTEPK
FVHIDLTNDR RKRAFAVVVS AAAAESD