Gene Mlg_0418 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMlg_0418 
Symbol 
ID4269457 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAlkalilimnicola ehrlichii MLHE-1 
KingdomBacteria 
Replicon accessionNC_008340 
Strand
Start bp468155 
End bp470008 
Gene Length1854 bp 
Protein Length617 aa 
Translation table11 
GC content69% 
IMG OID638125148 
Productdihydroxyacid dehydratase 
Protein accessionYP_741262 
Protein GI114319579 
COG category[E] Amino acid transport and metabolism
[G] Carbohydrate transport and metabolism 
COG ID[COG0129] Dihydroxyacid dehydratase/phosphogluconate dehydratase 
TIGRFAM ID[TIGR00110] dihydroxy-acid dehydratase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0213625 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value0.00250401 
Fosmid HitchhikerNo 
Fosmid clonabilitydecreased coverage 
 

Sequence

Gene sequence
ATGCCGCAGT ATCGCTCCAA GACGTCCACC GCCGGTCGCA ACATGGCGGG CGCCCGCGCC 
CTCTGGCGGG CCACCGGCAT GAAGGATGGC GACTTCGACA AGCCCATCAT CGCCGTCGCC
AACTCCTTCA CCCAGTTCGT CCCCGGCCAC GTCCACCTGA AGGACCTGGG GCAGCTGGTG
ATCGAAGAGA TCGAAAAGGC CGGCGGCGTG GGCAAGGAGT TCGACACCAT CGCCGTGGAC
GACGGCATCG CCATGGGCCA CGACGGCATG CTCTACAGCC TGCCCAGCCG GGACATCATT
GCCGACTCGG TGGAGTACAT GGTCAATGCC CACTGCGCCG ACGCGCTGGT GTGCATCTCC
AACTGCGACA AGATCACCCC CGGCATGCTG ATGGCCGCCA TGCGGCTGAA CATCCCCGTG
GTGTTCGTCT CCGGCGGGCC CATGGAGGCG GGCAAGACGA AGCTGGCCAG CGGCGAGGAG
ATCGCCACCG ACCTGGTGGA TGCCATGGTG GCCGCGGCCA ACCCGGAGGT CTCCGACGAG
GACGTGGCCA TCTACGAGCG CTCCGCCTGC CCCACCTGCG GCTCCTGCTC CGGCATGTTC
ACCGCCAACT CCATGAACTG CCTCACCGAG GCCCTGGGCC TGAGCCTGCC GGGCAACGGC
TCGCTGCTGG CCACCCACAC CGACCGCGAG CGCCTGTTCC GCGATGCCGG GCGCACGGTG
GTGGAACTGG CCCGGCGCTA CTACGAGCAG GACGACGAGC GCTGCCTGCC GCGCAACATC
GCCAGCCGGT CCGCCTTCCG CAACGCCATG AGCCTGGACA TCGCCATGGG CGGGTCCACC
AACACGGTGC TGCACCTGCT GGCCGCCGCC CAGGAGGGCG AGGTCAAGTT CGACATGGTC
GACATCGACA AGCTCTCCCG GAAGGTGCCC AACCTCTGCA AGGTCGCCCC CGCCACCCCG
CTCTTTCACA TGGAGGACGT GCACCGGGCC GGCGGCATCA TGGGCATCCT CGGCGAGCTG
GACCGGGCCA ACCTGCTGGA CACCACCGTG CCCACGGTCC ACAGCGACAC CCTGGCCGAG
GCCCTGGAAC GCTGGGACGT CAAACGCACC GACGACCCGG CGGTGCACGA CTTCTTCAAG
GCCGGTCCGG CCGGCGTCCC CAGCCAGACC GCCTTCAGCC AGTCCACCCG CTTCGAAGAA
CTGGACCTGG ACCGGGAGTG CGGCTGCATC CGTACCCTGG AGCACGCCTA CAGCAAGGAC
GGCGGGCTGG CGGTGCTCTA CGGCAACCTG GCCGAGCGGG GCTGCATCGT GAAGACCGCC
GGGGTGGATG AGTCCATCCT CACCTTCGAA GGGCCGGCGG TGATCTTCGA GAGCCAGGAC
GCCGCCGTGG AGGGGATTCT CGGCGGCCAG GTGAAGAAGG GCAATGTGGT CATCATCCGC
TACGAGGGGC CGCGGGGCGG GCCGGGCATG CAGGAGATGC TCTACCCCAC CAGCTACCTC
AAGTCCCGCG GCCTGGGCAA GGACTGCGCG CTGATCACCG ACGGCCGCTT CTCCGGCGGC
ACCTCGGGGC TGTCCATCGG CCACGTCTCC CCGGAGGCGG CCGAGGGCGG CAACATCGCC
CTGATCGAGC CGGGCGACCG GATCTGCATC GACATCCCCA AGCGCAGCAT CCGCATCGAT
ATCAGCGACG AGGAATTGGC CCGCCGCCGC GAGGCCATGG CGGCCAAGGG CCGCGATGCC
TGGAAGCCCG CCGCCCCCCG CCAGCGCAAG GTCAGCACGG CCCTGAAAGC CTACGCCAAG
CTGACCACCA GCGCGGACAA GGGCGCGGTG CGAAACCTGG ACCTGCTGGA CTGA
 
Protein sequence
MPQYRSKTST AGRNMAGARA LWRATGMKDG DFDKPIIAVA NSFTQFVPGH VHLKDLGQLV 
IEEIEKAGGV GKEFDTIAVD DGIAMGHDGM LYSLPSRDII ADSVEYMVNA HCADALVCIS
NCDKITPGML MAAMRLNIPV VFVSGGPMEA GKTKLASGEE IATDLVDAMV AAANPEVSDE
DVAIYERSAC PTCGSCSGMF TANSMNCLTE ALGLSLPGNG SLLATHTDRE RLFRDAGRTV
VELARRYYEQ DDERCLPRNI ASRSAFRNAM SLDIAMGGST NTVLHLLAAA QEGEVKFDMV
DIDKLSRKVP NLCKVAPATP LFHMEDVHRA GGIMGILGEL DRANLLDTTV PTVHSDTLAE
ALERWDVKRT DDPAVHDFFK AGPAGVPSQT AFSQSTRFEE LDLDRECGCI RTLEHAYSKD
GGLAVLYGNL AERGCIVKTA GVDESILTFE GPAVIFESQD AAVEGILGGQ VKKGNVVIIR
YEGPRGGPGM QEMLYPTSYL KSRGLGKDCA LITDGRFSGG TSGLSIGHVS PEAAEGGNIA
LIEPGDRICI DIPKRSIRID ISDEELARRR EAMAAKGRDA WKPAAPRQRK VSTALKAYAK
LTTSADKGAV RNLDLLD