Gene Mkms_5835 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMkms_5835 
Symbol 
ID4610543 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium sp. KMS 
KingdomBacteria 
Replicon accessionNC_008704 
Strand
Start bp43827 
End bp44876 
Gene Length1050 bp 
Protein Length349 aa 
Translation table11 
GC content64% 
IMG OID639789489 
Product4-hydroxy-2-ketovalerate aldolase 
Protein accessionYP_935824 
Protein GI119855221 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0119] Isopropylmalate/homocitrate/citramalate synthases 
TIGRFAM ID[TIGR03217] 4-hydroxy-2-oxovalerate aldolase 


Plasmid Coverage information

Num covering plasmid clones71 
Plasmid unclonability p-value0.633164 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones68 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACTGAGA CGGTTAGCCG GCCCCATGCG ACTGAGGGTG CGGCGCTCTA CATTCAGGAC 
GTCACACTGC GCGATGGTAT GCATGCCATG CGCCACCGGA TCAGTCCGGA GAAGGTCGCG
GCGATCGCAG GCGCACTCGA CACTGCCGGA GTCGACGCCA TCGAAGTCAC CCACGGTGAC
GGCCTGGCCG GGCACAGCCT GACCTACGGT CCCGGGAGCA ACACCGACTG GGAATGGATC
GAAGCGGCCG CAGACGTCGT ACACCGCGCC AAACTCACGA CTCTGCTGTT GCCTGGGGTC
GGGACGGTCC GCGAACTCGA GCACGCCTAC AAACTGGGGG TGACCTCGGT CCGGGTCGCA
ACGCACTGCA CCGAGGCCGA TGTCTCGGCA CAGCACATCG GAACGGCCCG CGAACTGGGC
ATGGATGTTT CCGGGTTTCT GATGATGTCG CACCTCGCCG AACCCTCACA TCTGGCTGCC
CAGGCCAAGC TGATGGAATC CTATGGCGCG CATTGCGTTT ATGTCACCGA TTCCGGTGGG
CGGTTGACGA TGGGCAGTGT CCGGGACCGG GTGCGTGCGT ATCGCGACGT GCTCGATGCC
GGTACGCAGA TCGGCATTCA CGCGCACCAA AATCTGTCGT TGTCGGTGGC CAATACCGTG
GTGGCCGTTG AGGAAGGTGT CACCCGGGTT GACGCCTCGC TGGCCGGTCA CGGCGCCGGG
GCGGGCAATT GCCCGATCGA GCCGTTCATC GCCGTGGCCG ATCTCCATGG CTGGAAGCAC
AACTGTGATC TCTTCGGGCT GCAGGACGCC GCCGACGACA TCGTCCGACC GCTGCAGGAT
CGGCCGGTCC AAGTCGACCG GGAGACCCTC ACCCTGGGAT ACGCAGGCGT GTACTCGAGC
TTCCTGCGTC ATGCCGAAGC CGCCGCGAAA CAGTACGGCC TCGACACTCG TGCGATCCTG
CTCGCGGTCG GCGAACGCGG ACTAGTCGGA GGACAGGAAG ACCTCATCCC CGACATCGCG
CTCGATCTAC AACAGAACTT ACGCCGATAG
 
Protein sequence
MTETVSRPHA TEGAALYIQD VTLRDGMHAM RHRISPEKVA AIAGALDTAG VDAIEVTHGD 
GLAGHSLTYG PGSNTDWEWI EAAADVVHRA KLTTLLLPGV GTVRELEHAY KLGVTSVRVA
THCTEADVSA QHIGTARELG MDVSGFLMMS HLAEPSHLAA QAKLMESYGA HCVYVTDSGG
RLTMGSVRDR VRAYRDVLDA GTQIGIHAHQ NLSLSVANTV VAVEEGVTRV DASLAGHGAG
AGNCPIEPFI AVADLHGWKH NCDLFGLQDA ADDIVRPLQD RPVQVDRETL TLGYAGVYSS
FLRHAEAAAK QYGLDTRAIL LAVGERGLVG GQEDLIPDIA LDLQQNLRR