Gene Mkms_1831 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMkms_1831 
Symbol 
ID4613758 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium sp. KMS 
KingdomBacteria 
Replicon accessionNC_008705 
Strand
Start bp1943213 
End bp1944469 
Gene Length1257 bp 
Protein Length418 aa 
Translation table11 
GC content70% 
IMG OID639791497 
Productaldehyde dehydrogenase 
Protein accessionYP_937822 
Protein GI119867870 
COG category[C] Energy production and conversion 
COG ID[COG1012] NAD-dependent aldehyde dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGGGCGC TCCCGGTGAT CGAGCACACC AGGTCCCATG TGGCGAAATG GATGCGGCGC 
ACCACGCTGT TGCGTCCGGC GCGGCTGGCA GGGCTGCGGG CCGAGGTCGA ACCCGTCCCG
GTCGGTGTGG TCGGGATCGT CGGACCGTGG AACTTCCCCG TCAACCTCGT CGTCCTGCCC
GCGGCCGCCG CCTTCGCGGC GGGCAACCGG GTGATGATCA AGATGTCGGA GATCACCGCG
CACACCGCCG AACTGCTCGA GGCCCGCGCC CCCGAGTACT TCGACGCGGC CGAACTCACC
GTGGTCACTG GTGGGCCGGA CACCGCCGCG GCGTTCACCG CGCTGCCCTT CGACCACCTG
TTCTTCACCG GTTCACCGGC CGTCGGCGTG CACGTGCAGC GCGCCGCCGC GGCGAACCTC
GTCCCGGTCA CGCTCGAGCT CGGAGGTAAG AATCCGGCCG TGGTCGGGCC GCGTGCCGAC
CTCCGGCGCG CAGCCGTGCG GATCGCGCAG GGCCGCATGG TCAATGGTGG GCAGGTCTGC
GTCTGCCCCG ATTACGTCCT GGTGCCCGAG TACCTCGTCG ACGAGTTCAG CGCGACCGTG
CTCGCCACGT GGCGGCGGAT GTTCCCGTCG ATCACCGGTA GCGAGGACTA CTGCTCGTCG
GTCAACGACG CCAACTTCGA CCGGGTCGTC GGCCTGATCG ACGACGCCCG TGCCGGCGGC
GCCCGCGTGA ACAGCGTTGT CCCACCGGGT GAAACGCTTC CGGACCGGAG ATCGCGCAAG
ATCGCGCCCA CGCTGATCCG CGACGTGACA CCGACGATGC GCATCGCCTC CGAGGAGGTG
TTCGGGCCGG TGTTGTCCGT GCTCGGGTAT TCCACGACCG ACGAGGTGAT CGACCACATC
AACAGCCGTC CCGCTCCGCT GGTGGCCTAT TGGTTCGGCC CCGACGACCA GGATTTCCGC
ACCTTTGTGC GCCGGACACG CAGCGGCGGG GTGGCCCGCA ACGACTTTGC CGCACAGATG
ATCCCGTCCG ACGCGCCGTT CGGTGGGGTG GGGCGCAGTG GGATGGGCGC CTACCATGGC
AAGGCCGGGT TCGACACCTT CAGCCACCAC CGATCCGTGG TGGGCAGCGA TCTGCCGTTC
TCGATCACCG GCAGCGCCGC ACCTCCGTTC GGGGCCGCCA TGCGGCGCAG CACCGAGTTC
CGGCTGCGGA TGGCCCGCAG GCGCAACCAT CGCCGGCTCC GGCGCAGCCA CGGTTGA
 
Protein sequence
MGALPVIEHT RSHVAKWMRR TTLLRPARLA GLRAEVEPVP VGVVGIVGPW NFPVNLVVLP 
AAAAFAAGNR VMIKMSEITA HTAELLEARA PEYFDAAELT VVTGGPDTAA AFTALPFDHL
FFTGSPAVGV HVQRAAAANL VPVTLELGGK NPAVVGPRAD LRRAAVRIAQ GRMVNGGQVC
VCPDYVLVPE YLVDEFSATV LATWRRMFPS ITGSEDYCSS VNDANFDRVV GLIDDARAGG
ARVNSVVPPG ETLPDRRSRK IAPTLIRDVT PTMRIASEEV FGPVLSVLGY STTDEVIDHI
NSRPAPLVAY WFGPDDQDFR TFVRRTRSGG VARNDFAAQM IPSDAPFGGV GRSGMGAYHG
KAGFDTFSHH RSVVGSDLPF SITGSAAPPF GAAMRRSTEF RLRMARRRNH RRLRRSHG