Gene Mkms_4051 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMkms_4051 
Symbol 
ID4611991 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium sp. KMS 
KingdomBacteria 
Replicon accessionNC_008705 
Strand
Start bp4277720 
End bp4278895 
Gene Length1176 bp 
Protein Length391 aa 
Translation table11 
GC content70% 
IMG OID639793735 
Productmalate dehydrogenase 
Protein accessionYP_940033 
Protein GI119870081 
COG category[C] Energy production and conversion 
COG ID[COG0281] Malic enzyme 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.812595 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value0.971942 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGCTGAAA CTGTTGTTGG TTCCTCACAG GTCGTCATCA GTGACGACGA GATCTTCGCC 
GCTCACGAGG GCGGCAAGCT CGCGGTGCAA CTGAACGAAC CGCTGGACAC GCAGCGCGCC
CTGTCGATCG CCTACACCCC GGGTGTCGCC CAGGTCAGCC GCGCCATCGC CAAAGACCCG
GCCCTGGCCG CCAGGTACAC GTGGGCCAAC CGGCTCGTCG CGGTGGTCAG CGACGGCAGT
GCGGTGCTCG GGCTCGGCGA TATCGGCGCG GCCGCGTCGC TGCCGGTGAT GGAGGGTAAG
AGCGCGCTGT TCAAGACATT CGGCGGCCTC GACTCGATCC CGATCGTGCT CGACACCAAG
GACCCCGACG AGATCGTCGA GACGCTGGTG CGGCTGCGCC CGACGTTCGG GGCGGTCAAC
CTCGAGGACA TCTCGGCGCC GCGCTGCTTC GAGATCGAAC GCCGCGTCAT CGAGGCCCTC
GACTGCCCGG TCATGCACGA CGATCAGCAC GGCACCGCGA TCGTCGTACT CGCCGCGCTG
ATGGGTGCGG CCCGGGTCCT CGGCCGGGAC GCGGCCACGC TGCGGGTCGT GATCTCCGGT
GCGGGCGCGG CGGGCGTCGC GTGTGCCAAC ATCCTGCTGG CCGCCGGCAT CAGCGACGTC
ACCGTGCTCG ACAGCAAGGG CATCGTCCAC AGCGGCCGTG ACGACCTCAA CTCCTTCAAG
GCCGAACTCG CCGAACGCAC CAACCCGGCC GGCCGCACCG GCGGTGTGGC CGAGGCCCTC
GACGGCGCCG ACATGTTCCT CGGACTCTCG GCCGGTGTCG TCGCGGAGGA GTTGATCGCG
ACGATGGCGC CGGGCGGCAT CGTGTTCGCG CTGTCCAACC CCGATCCGGA GATCCACCCG
GACCTCGCGC GTAAGTACGC CGCCGTCGTG GCGACCGGGC GCAGCGACTT CCCGAACCAG
ATCAACAACG TGCTGGCGTT CCCCGGCGTG TTCCGCGGTG CGCTCGACGC CGGGGCGCGC
CGGATCACCG AGCGGATGAA GGTGGCCGCG GCCGAGGCGA TCTTCTCGGT GGTCGGCGAC
GACCTGGCCG TCGACCACAT CGTGCCGAGC GCGCTCGATC CGCGGGTGGC CCCTGCGGTC
GCCGCCGCGG TCGGTGCGGC CTCGCAGGTC GGCTGA
 
Protein sequence
MAETVVGSSQ VVISDDEIFA AHEGGKLAVQ LNEPLDTQRA LSIAYTPGVA QVSRAIAKDP 
ALAARYTWAN RLVAVVSDGS AVLGLGDIGA AASLPVMEGK SALFKTFGGL DSIPIVLDTK
DPDEIVETLV RLRPTFGAVN LEDISAPRCF EIERRVIEAL DCPVMHDDQH GTAIVVLAAL
MGAARVLGRD AATLRVVISG AGAAGVACAN ILLAAGISDV TVLDSKGIVH SGRDDLNSFK
AELAERTNPA GRTGGVAEAL DGADMFLGLS AGVVAEELIA TMAPGGIVFA LSNPDPEIHP
DLARKYAAVV ATGRSDFPNQ INNVLAFPGV FRGALDAGAR RITERMKVAA AEAIFSVVGD
DLAVDHIVPS ALDPRVAPAV AAAVGAASQV G