Gene Mlg_1553 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMlg_1553 
Symbol 
ID4269927 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAlkalilimnicola ehrlichii MLHE-1 
KingdomBacteria 
Replicon accessionNC_008340 
Strand
Start bp1775217 
End bp1776908 
Gene Length1692 bp 
Protein Length563 aa 
Translation table11 
GC content71% 
IMG OID638126310 
Productdihydroxy-acid dehydratase 
Protein accessionYP_742390 
Protein GI114320707 
COG category[E] Amino acid transport and metabolism
[G] Carbohydrate transport and metabolism 
COG ID[COG0129] Dihydroxyacid dehydratase/phosphogluconate dehydratase 
TIGRFAM ID[TIGR00110] dihydroxy-acid dehydratase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.143997 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones39 
Fosmid unclonability p-value0.944928 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTGAGA ACCGCCGCAG TCGGGTCGTG ACCGCAGGCC TGCGCCGCAC CCCCAACCGG 
GCCATGCTGC GCGCCACCGG CTTCCGGGAC GAGGACTTCG ACAAGCCCAT CATCGGCGTC
GCCAACGCCC ACAGCACCAT CACCCCCTGC AACATGCATA TCGCCGCGCT CACCCGGCGG
GCGGTGGAGG CGTTGCAGGC GGCCGGTGCC ATGGCACCGG AATTTGGCGT GCCCACCGTC
TCGGACGGCA TCGCCATGGG CACGCCGGGG ATGCGCTACT CGCTGGTCTC GCGCGAGGTG
ATCGCCGACG CCATCGAGAC GGTGTGCGAG GGCCAGAGCC TGGACGGCGT GCTGGCCACC
GGCGGCTGCG ACAAGAACAT GCCCGGGGCT ATGATCGCCA TCGCGCGCAT GAACATACCG
GCCCTGTTTG TCTACGGCGG CACCATCCGC CCCGGCTACT ACAAGGGCGA GCGGCTGGAC
ATCGTCTCGG CGTTCGAGGC GGTGGGCCAG GCCTCCGCGG GCAGGATGAG TGACGAGGAC
CTGCTTGGGG TGGAGCGCAA CGCCTGCCCC GGCGCCGGCT CCTGTGGCGG GATGTACACC
GCCAACACCA TGTCCAGCGC CTTCGAGGCC ATGGGCATGA GCCTGCCCGC CAGCTCCACC
GTGGCCGCCG AGGCGGAGGA GAAGGGCGAC GACGTGGCGG CCGCCGCGCG GGTGCTGATG
GAGGCGGTGC GGGCGGATCG CAAGCCCCGC GACATCCTCA CCCGCGAGGC CTTCGAGAAC
GGCTTCGCCC TGGTGATGGC CGTGGGCGGG TCCACCAACG CGGTGCTGCA CTTGCTGGCC
ATCGCCGATG CGGCCGATGT GCCGTTGTCG CTGGACGACG TGGAGCGTAT CCGGCAGCAG
GTGCCCGTGC TCTGCGATCT GCGGCCCTCG GGCCGCTACG TGACCACTGA ATTTCACGAG
GTGGGCGGCA CGCCGCAAGT GCTGCGCCTG CTGCTGGACG CGGGCCTGCT CCACGGCGAC
TGCCTCACTA TTACCGGGCA GACGCTTGCC GAGACCCTGG CCGACGTGCC CCCGACCCCC
GCGAAGGATC AGGACATCAT CCGGACCCTG GACAACCCCC TCTACCCGGT GGGCCATCTG
GCCATCCTGC GTGGAAACCT GGCAGAGGAG GGGGCCGTGG CCAAGGTGTC CGGCCTTAAG
CAGCGCCGCA TCGTCGGACC GGCGCGGGTA TTCGAGGGTG AGGAGGACTG CCTGGAGGCG
ATCCTGGCCG GCCAGATTAA GCCCGGCGAC GTGGTGGTCA TCCGCCACGA GGGGCCGAAG
GGCGGGCCCG GCATGCGCGA GATGCTCTCG CCCACCGCGG CGTTGATGGG GGCCGGGCTG
GGGGAGAGCG TCGGCCTGAT CACCGATGGG CGCTTCTCCG GTGGCACCCG GGGGCTGGTG
GTGGGCCACG TGGCGCCGGA GGCCGCAGCG GGTGGCACCA TCGCCCTGGT CCGGGAAGGC
GACACCGTCA CCATCGATGC GGACGCCAAC CGGCTCACCC TGGAGGTGGA CGAGGCGGAG
CTCGCCCGCC GGCGTGCGGC CTGGCAGCCG CCGGAACCGA GAGCGACCCG GGGCGTATTG
GGCAAGTACG CGCAGATGGT CGGCTCGGCG GCGCAGGGTG CGGTCACCGA TCGCTGGAGC
GGCGGCGCGT GA
 
Protein sequence
MSENRRSRVV TAGLRRTPNR AMLRATGFRD EDFDKPIIGV ANAHSTITPC NMHIAALTRR 
AVEALQAAGA MAPEFGVPTV SDGIAMGTPG MRYSLVSREV IADAIETVCE GQSLDGVLAT
GGCDKNMPGA MIAIARMNIP ALFVYGGTIR PGYYKGERLD IVSAFEAVGQ ASAGRMSDED
LLGVERNACP GAGSCGGMYT ANTMSSAFEA MGMSLPASST VAAEAEEKGD DVAAAARVLM
EAVRADRKPR DILTREAFEN GFALVMAVGG STNAVLHLLA IADAADVPLS LDDVERIRQQ
VPVLCDLRPS GRYVTTEFHE VGGTPQVLRL LLDAGLLHGD CLTITGQTLA ETLADVPPTP
AKDQDIIRTL DNPLYPVGHL AILRGNLAEE GAVAKVSGLK QRRIVGPARV FEGEEDCLEA
ILAGQIKPGD VVVIRHEGPK GGPGMREMLS PTAALMGAGL GESVGLITDG RFSGGTRGLV
VGHVAPEAAA GGTIALVREG DTVTIDADAN RLTLEVDEAE LARRRAAWQP PEPRATRGVL
GKYAQMVGSA AQGAVTDRWS GGA