Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mlg_0418 |
Symbol | |
ID | 4269457 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Alkalilimnicola ehrlichii MLHE-1 |
Kingdom | Bacteria |
Replicon accession | NC_008340 |
Strand | - |
Start bp | 468155 |
End bp | 470008 |
Gene Length | 1854 bp |
Protein Length | 617 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 638125148 |
Product | dihydroxyacid dehydratase |
Protein accession | YP_741262 |
Protein GI | 114319579 |
COG category | [E] Amino acid transport and metabolism [G] Carbohydrate transport and metabolism |
COG ID | [COG0129] Dihydroxyacid dehydratase/phosphogluconate dehydratase |
TIGRFAM ID | [TIGR00110] dihydroxy-acid dehydratase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.0213625 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 23 |
Fosmid unclonability p-value | 0.00250401 |
Fosmid Hitchhiker | No |
Fosmid clonability | decreased coverage |
| |
Sequence |
Gene sequence | ATGCCGCAGT ATCGCTCCAA GACGTCCACC GCCGGTCGCA ACATGGCGGG CGCCCGCGCC CTCTGGCGGG CCACCGGCAT GAAGGATGGC GACTTCGACA AGCCCATCAT CGCCGTCGCC AACTCCTTCA CCCAGTTCGT CCCCGGCCAC GTCCACCTGA AGGACCTGGG GCAGCTGGTG ATCGAAGAGA TCGAAAAGGC CGGCGGCGTG GGCAAGGAGT TCGACACCAT CGCCGTGGAC GACGGCATCG CCATGGGCCA CGACGGCATG CTCTACAGCC TGCCCAGCCG GGACATCATT GCCGACTCGG TGGAGTACAT GGTCAATGCC CACTGCGCCG ACGCGCTGGT GTGCATCTCC AACTGCGACA AGATCACCCC CGGCATGCTG ATGGCCGCCA TGCGGCTGAA CATCCCCGTG GTGTTCGTCT CCGGCGGGCC CATGGAGGCG GGCAAGACGA AGCTGGCCAG CGGCGAGGAG ATCGCCACCG ACCTGGTGGA TGCCATGGTG GCCGCGGCCA ACCCGGAGGT CTCCGACGAG GACGTGGCCA TCTACGAGCG CTCCGCCTGC CCCACCTGCG GCTCCTGCTC CGGCATGTTC ACCGCCAACT CCATGAACTG CCTCACCGAG GCCCTGGGCC TGAGCCTGCC GGGCAACGGC TCGCTGCTGG CCACCCACAC CGACCGCGAG CGCCTGTTCC GCGATGCCGG GCGCACGGTG GTGGAACTGG CCCGGCGCTA CTACGAGCAG GACGACGAGC GCTGCCTGCC GCGCAACATC GCCAGCCGGT CCGCCTTCCG CAACGCCATG AGCCTGGACA TCGCCATGGG CGGGTCCACC AACACGGTGC TGCACCTGCT GGCCGCCGCC CAGGAGGGCG AGGTCAAGTT CGACATGGTC GACATCGACA AGCTCTCCCG GAAGGTGCCC AACCTCTGCA AGGTCGCCCC CGCCACCCCG CTCTTTCACA TGGAGGACGT GCACCGGGCC GGCGGCATCA TGGGCATCCT CGGCGAGCTG GACCGGGCCA ACCTGCTGGA CACCACCGTG CCCACGGTCC ACAGCGACAC CCTGGCCGAG GCCCTGGAAC GCTGGGACGT CAAACGCACC GACGACCCGG CGGTGCACGA CTTCTTCAAG GCCGGTCCGG CCGGCGTCCC CAGCCAGACC GCCTTCAGCC AGTCCACCCG CTTCGAAGAA CTGGACCTGG ACCGGGAGTG CGGCTGCATC CGTACCCTGG AGCACGCCTA CAGCAAGGAC GGCGGGCTGG CGGTGCTCTA CGGCAACCTG GCCGAGCGGG GCTGCATCGT GAAGACCGCC GGGGTGGATG AGTCCATCCT CACCTTCGAA GGGCCGGCGG TGATCTTCGA GAGCCAGGAC GCCGCCGTGG AGGGGATTCT CGGCGGCCAG GTGAAGAAGG GCAATGTGGT CATCATCCGC TACGAGGGGC CGCGGGGCGG GCCGGGCATG CAGGAGATGC TCTACCCCAC CAGCTACCTC AAGTCCCGCG GCCTGGGCAA GGACTGCGCG CTGATCACCG ACGGCCGCTT CTCCGGCGGC ACCTCGGGGC TGTCCATCGG CCACGTCTCC CCGGAGGCGG CCGAGGGCGG CAACATCGCC CTGATCGAGC CGGGCGACCG GATCTGCATC GACATCCCCA AGCGCAGCAT CCGCATCGAT ATCAGCGACG AGGAATTGGC CCGCCGCCGC GAGGCCATGG CGGCCAAGGG CCGCGATGCC TGGAAGCCCG CCGCCCCCCG CCAGCGCAAG GTCAGCACGG CCCTGAAAGC CTACGCCAAG CTGACCACCA GCGCGGACAA GGGCGCGGTG CGAAACCTGG ACCTGCTGGA CTGA
|
Protein sequence | MPQYRSKTST AGRNMAGARA LWRATGMKDG DFDKPIIAVA NSFTQFVPGH VHLKDLGQLV IEEIEKAGGV GKEFDTIAVD DGIAMGHDGM LYSLPSRDII ADSVEYMVNA HCADALVCIS NCDKITPGML MAAMRLNIPV VFVSGGPMEA GKTKLASGEE IATDLVDAMV AAANPEVSDE DVAIYERSAC PTCGSCSGMF TANSMNCLTE ALGLSLPGNG SLLATHTDRE RLFRDAGRTV VELARRYYEQ DDERCLPRNI ASRSAFRNAM SLDIAMGGST NTVLHLLAAA QEGEVKFDMV DIDKLSRKVP NLCKVAPATP LFHMEDVHRA GGIMGILGEL DRANLLDTTV PTVHSDTLAE ALERWDVKRT DDPAVHDFFK AGPAGVPSQT AFSQSTRFEE LDLDRECGCI RTLEHAYSKD GGLAVLYGNL AERGCIVKTA GVDESILTFE GPAVIFESQD AAVEGILGGQ VKKGNVVIIR YEGPRGGPGM QEMLYPTSYL KSRGLGKDCA LITDGRFSGG TSGLSIGHVS PEAAEGGNIA LIEPGDRICI DIPKRSIRID ISDEELARRR EAMAAKGRDA WKPAAPRQRK VSTALKAYAK LTTSADKGAV RNLDLLD
|
| |