Gene Mmcs_5438 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMmcs_5438 
Symbol 
ID4114523 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium sp. MCS 
KingdomBacteria 
Replicon accessionNC_008147 
Strand
Start bp19193 
End bp20242 
Gene Length1050 bp 
Protein Length349 aa 
Translation table11 
GC content64% 
IMG OID638034593 
Product4-hydroxy-2-ketovalerate aldolase 
Protein accessionYP_642594 
Protein GI108802398 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0119] Isopropylmalate/homocitrate/citramalate synthases 
TIGRFAM ID[TIGR03217] 4-hydroxy-2-oxovalerate aldolase 


Plasmid Coverage information

Num covering plasmid clones50 
Plasmid unclonability p-value0.113513 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.601738 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACTGAGA CGGTTAGCCG GCCCCATGCG ACTGAGGGTG CGGCGCTCTA CATTCAGGAC 
GTCACACTGC GCGATGGTAT GCATGCCATG CGCCACCGGA TCAGTCCGGA GAAGGTCGCG
GCGATCGCAG GCGCACTCGA CACTGCCGGA GTCGACGCCA TCGAAGTCAC CCACGGTGAC
GGCCTGGCCG GGCACAGCCT GACCTACGGT CCCGGGAGCA ACACCGACTG GGAATGGATC
GAAGCGGCCG CAGACGTCGT ACACCGCGCC AAACTCACGA CTCTGCTGTT GCCTGGGGTC
GGGACGGTCC GCGAACTCGA GCACGCCTAC AAACTGGGGG TGACCTCGGT CCGGGTCGCA
ACGCACTGCA CCGAGGCCGA TGTCTCGGCA CAGCACATCG GAACGGCCCG CGAACTGGGC
ATGGATGTTT CCGGGTTTCT GATGATGTCG CACCTCGCCG AACCCTCACA TCTGGCTGCC
CAGGCCAAGC TGATGGAATC CTATGGCGCG CATTGCGTTT ATGTCACCGA TTCCGGTGGG
CGGTTGACGA TGGGCAGTGT CCGGGACCGG GTGCGTGCGT ATCGCGACGT GCTCGATGCC
GGTACGCAGA TCGGCATTCA CGCGCACCAA AATCTGTCGT TGTCGGTGGC CAATACCGTG
GTGGCCGTTG AGGAAGGTGT CACCCGGGTT GACGCCTCGC TGGCCGGTCA CGGCGCCGGG
GCGGGCAATT GCCCGATCGA GCCGTTCATC GCCGTGGCCG ATCTCCATGG CTGGAAGCAC
AACTGTGATC TCTTCGGGCT GCAGGACGCC GCCGACGACA TCGTCCGACC GCTGCAGGAT
CGGCCGGTCC AAGTCGACCG GGAGACCCTC ACCCTGGGAT ACGCAGGCGT GTACTCGAGC
TTCCTGCGTC ATGCCGAAGC CGCCGCGAAA CAGTACGGCC TCGACACTCG TGCGATCCTG
CTCGCGGTCG GCGAACGCGG ACTAGTCGGA GGACAGGAAG ACCTCATCCC CGACATCGCG
CTCGATCTAC AACAGAACTT ACGCCGATAG
 
Protein sequence
MTETVSRPHA TEGAALYIQD VTLRDGMHAM RHRISPEKVA AIAGALDTAG VDAIEVTHGD 
GLAGHSLTYG PGSNTDWEWI EAAADVVHRA KLTTLLLPGV GTVRELEHAY KLGVTSVRVA
THCTEADVSA QHIGTARELG MDVSGFLMMS HLAEPSHLAA QAKLMESYGA HCVYVTDSGG
RLTMGSVRDR VRAYRDVLDA GTQIGIHAHQ NLSLSVANTV VAVEEGVTRV DASLAGHGAG
AGNCPIEPFI AVADLHGWKH NCDLFGLQDA ADDIVRPLQD RPVQVDRETL TLGYAGVYSS
FLRHAEAAAK QYGLDTRAIL LAVGERGLVG GQEDLIPDIA LDLQQNLRR