Gene Mkms_1110 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMkms_1110 
Symbol 
ID4614488 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium sp. KMS 
KingdomBacteria 
Replicon accessionNC_008705 
Strand
Start bp1196615 
End bp1197814 
Gene Length1200 bp 
Protein Length399 aa 
Translation table11 
GC content70% 
IMG OID639790786 
Productbranched-chain alpha-keto acid dehydrogenase subunit E2 
Protein accessionYP_937113 
Protein GI119867161 
COG category[C] Energy production and conversion 
COG ID[COG0508] Pyruvate/2-oxoglutarate dehydrogenase complex, dihydrolipoamide acyltransferase (E2) component, and related enzymes 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCGAGT TCCGGATGCC CGCGCTCGGC TCGGACATGG ACGAGGGGAC CCTCGACCAA 
TGGCTGGTCA AACCGGGCGA CACCGTCACC AGGGGCCAGG TCGTGGCGGT GGTCGAGACC
ACCAAGGCCG CCGTCGAGGT CGAATGCTGG CAGGAGGGGA CCGTCGACCG CCTCCTGGTG
CCCGAAGGCC AGACCGTCCG GGTCGGAACG CCGCTGGCCA CGCTGCTGGC TCCGGGCGAA
ACACCCGCAC CGACTGCACC GGCGGTGCCT CGCACGATGC GGGAATCGCC GGTGGCCGTC
GAGAGACCAG AAGGCGCAGG CAGGCCTGCT CCGGCCGCGG GGCCAGCCAT CGCGACCCGG
CCGCATCGCC GGTGGGTCTC CCCGGCGGCC CGCCGCGTGG CAGCGACGCT GGACATCGAC
GCCGATACCC TCACCGGCAC CGGTCCGCAG GGCGCGGTCA CCATTCGCGA CGTGGAACAG
GCGGCAGCGT CGAGGAAGCA GCCGGCCGAC GGGCGAACCG TACGGGATCG GTCCGTGGCG
ATGCGCGCGT CGATCGCCGC GGCGATGAGC CGGTCGAAGC GCGAGATTCC GCACTACTAC
CTGGCCGACG AAGTCCTCAT GGACCCGGCG CTGGCATGGC TGGCTGAGCG CAACGCCGCG
CGATCCATCA CCGAACGGGT GTTGCCGGCG GTGCTGCAGA TCAAGGCCGT CGCGGCGGCA
GCGGACCGCT TTCCCGAGTT CAACGGCTTC TGGCGCGACG ACGCGTTCGT CGGTGCCGAC
GGCGTCCACG TCGGTGTCGC CATCTCACTT CGCGGTGGGG GCCTGGTCGC ACCCGCGATC
CACGACGTCC CCGACAGGAG CCTCGACGAC CTCATGGGGG CCCTGACCGA CCTGGTGGCG
CGCGCCCGGG CCGGCTCGCT GCGCAGTTCG GAGATGTCTG ATCCCTCCAT CACGATCACC
AACCTGGGCG ACCAGGGGGT GGACACGGTG TTCGGCGTCA TCTATCCGCC ACAGGTCGCC
CTGGTGGGCT TCGGCAAGCC GGTGCAACGG GTATGTGCCG TCGACGGTGG TATTCGTATC
GCGACCGCGC TGACCGCCAC TCTGGCAGCG GATCACCGGG CCAGCGATGG ACACCGCGGT
GCGCTCTTCC TCGCCGCGAT CAACGAGATC CTGCAGCAGC CGCAGAAGTT GGAGAAGTGA
 
Protein sequence
MTEFRMPALG SDMDEGTLDQ WLVKPGDTVT RGQVVAVVET TKAAVEVECW QEGTVDRLLV 
PEGQTVRVGT PLATLLAPGE TPAPTAPAVP RTMRESPVAV ERPEGAGRPA PAAGPAIATR
PHRRWVSPAA RRVAATLDID ADTLTGTGPQ GAVTIRDVEQ AAASRKQPAD GRTVRDRSVA
MRASIAAAMS RSKREIPHYY LADEVLMDPA LAWLAERNAA RSITERVLPA VLQIKAVAAA
ADRFPEFNGF WRDDAFVGAD GVHVGVAISL RGGGLVAPAI HDVPDRSLDD LMGALTDLVA
RARAGSLRSS EMSDPSITIT NLGDQGVDTV FGVIYPPQVA LVGFGKPVQR VCAVDGGIRI
ATALTATLAA DHRASDGHRG ALFLAAINEI LQQPQKLEK