Gene Mlg_0544 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMlg_0544 
Symbol 
ID4270299 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAlkalilimnicola ehrlichii MLHE-1 
KingdomBacteria 
Replicon accessionNC_008340 
Strand
Start bp592257 
End bp593273 
Gene Length1017 bp 
Protein Length338 aa 
Translation table11 
GC content66% 
IMG OID638125285 
Productketol-acid reductoisomerase 
Protein accessionYP_741388 
Protein GI114319705 
COG category[E] Amino acid transport and metabolism
[H] Coenzyme transport and metabolism 
COG ID[COG0059] Ketol-acid reductoisomerase 
TIGRFAM ID[TIGR00465] ketol-acid reductoisomerase 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.105491 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value0.00154571 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGCAAGTCT ATTACGATAA GGACGCCGAT CTTTCCATCA TTCAGGGCAA GAAGGTCGCC 
GTCATCGGCT ACGGCTCCCA GGGCCATGCC CATGCCAACA ACCTGAAAGA GTCGGGTGTG
GACGTGGTGG TGGGGCTGCG CGAGGGCTCC AGCTCCGCGG CCAAGGCGCA AAAGGCCGGC
CTGGCCGTGG CCAGCATCGA GGACGCCGCC GCCCAGGCGG ACGTGGTCAT GATCCTGGCC
CCAGACGAGC ACCAGGCGGT GATCTACCAC AACCAGATCG CCCCCAACGT GAAGCCCGGT
GCGGCCATCG CCTTTGCCCA CGGCTTCAAC ATCCATTTCG GCCAGATCCA GCCCGCCGCC
GACCTGGACG TGATCATGGT CGCGCCCAAG GGCCCGGGCC ACCTGGTGCG CTCCACCTAT
GTGGAGGGCG GCGGCGTGCC CAGCCTGATC GCCATCCACC AGGACGCCAC CGGCAAGGCC
AAGGACATCG CCCTGTCCTA TGCCTCCGCC AACGGCGGTG GCCGTGCCGG TGTCATCGAG
ACCAGCTTCC GCGAGGAGAC CGAGACCGAC CTGTTCGGCG AGCAGGCGGT GCTCTGCGGC
GGTATCACCT CGCTGATCCA GGCCGGGTTT GAGACCCTGG TCGAGGCGGG CTACGCCCCC
GAGATGGCCT ACTTCGAGTG CCTGCACGAG ACCAAGCTGA TCGTCGATCT GCTCTACCAG
GGCGGCATCG CCAACATGCG CTACTCCATC TCCAACACTG CCGAGTACGG TGACTTCACT
CGCGGCCCGC GGGTGATCAA CGAGGAGAGC CGCGAGGCCA TGCGCGAGAT CCTGGCCGAG
ATCCAGGAGG GCGAGTTCGC CCGCGAGTTC GTGCTGGAGA ACCAGGCCGG CTGCCCGACC
CTCACCGCCC GCCGCCGGCT CGCCGCCGAG CACGAGATCG AGGTGGTGGG CGAGCGCCTG
CGCGGCATGA TGCCCTGGAT CAACGCCAAC AAGCTGGTGG ACAAGGACAA GAACTGA
 
Protein sequence
MQVYYDKDAD LSIIQGKKVA VIGYGSQGHA HANNLKESGV DVVVGLREGS SSAAKAQKAG 
LAVASIEDAA AQADVVMILA PDEHQAVIYH NQIAPNVKPG AAIAFAHGFN IHFGQIQPAA
DLDVIMVAPK GPGHLVRSTY VEGGGVPSLI AIHQDATGKA KDIALSYASA NGGGRAGVIE
TSFREETETD LFGEQAVLCG GITSLIQAGF ETLVEAGYAP EMAYFECLHE TKLIVDLLYQ
GGIANMRYSI SNTAEYGDFT RGPRVINEES REAMREILAE IQEGEFAREF VLENQAGCPT
LTARRRLAAE HEIEVVGERL RGMMPWINAN KLVDKDKN