Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GWCH70_2014 |
Symbol | |
ID | 7978968 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacillus sp. WCH70 |
Kingdom | Bacteria |
Replicon accession | NC_012793 |
Strand | - |
Start bp | 2074199 |
End bp | 2075875 |
Gene Length | 1677 bp |
Protein Length | 558 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 644798837 |
Product | dihydroxy-acid dehydratase |
Protein accession | YP_002950007 |
Protein GI | 239827383 |
COG category | [E] Amino acid transport and metabolism [G] Carbohydrate transport and metabolism |
COG ID | [COG0129] Dihydroxyacid dehydratase/phosphogluconate dehydratase |
TIGRFAM ID | [TIGR00110] dihydroxy-acid dehydratase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.0410747 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TTGAGAAAGC TTCGCAGCGA TATGATCAAA AAAGGATTTG ACCGGGCGCC GCACCGCAGC CTGTTGCGCG CGGCAGGCGT AAAGGAAGAA GATTTTGACA AGCCGTTTAT CGCCGTGGTG AACTCGTATA TTGACATTAT TCCGGGACAC GTGCATTTGC AGGAGTTCGG GAAAATTGTC AAAGAAGCGA TTCGCGAAGC GGGCGGAGTC CCGTTTGAAA TGAATACAAT CGGCGTCGAC GATGGGATTG CGATGGGGCA TATCGGCATG CGCTATTCGC TTCCAAGCCG CGAAATTATC GCGGATTCGA TCGAAACGGT GATTTCCGCG CATTGGTTTG ATGGAATGGT ATGCATTCCA AACTGTGATA AAATTACGCC GGGCATGATG ATGGCGGCGA TGCGGTTAAA CATTCCGACG ATTTTTGTCA GCGGCGGGCC GATGAAGGCT GGCGTGACGA GCGACGGGCG AAAAATTTCG CTTTCTTCCG TATTTGAAGG GGTCGGCGCT TATCAGGCGG GAAAAATTGA TGAAAAAGGA TTGCAGGAAT TAGAAAAATA CGGCTGTCCG ACGTGCGGCT CCTGTTCGGG CATGTTTACA GCAAACTCGA TGAACTGTTT GGCGGAAGCG CTAGGTCTCG CACTGCCAGG AAATGGAACC ATTTTAGCGG TCGATCCGGC ACGGAAAGAA TTGGTTCGCC AATCGGCGAA ACAATTAATG TATTTGATCG AGCATGACAT TAAACCGAGC GATATTGTCA CAGAAAAAGC GATTGATAAC GCGTTTGCGC TTGATATGGC GCTTGGCGGC TCGACAAATA CCGTATTACA TACGCTTGCA ATTGCAAACG AGGCAGGGAT CGATTATTCG CTCGAACGAA TTAACGAAAT CGCTGCAAGG GTGCCGCATC TTGCGAAACT AGCGCCAGCT TCCGATGTGC ATATTGAAGA CTTGCACGAA GCGGGCGGCG TATCAGCCGT ATTAAATGAA TTAGCGAAAA AAGAAGGAAC GCTGCATCTC GATACGTTGA CCGTAACCGG AAAAACGCTT GGGGAAAACA TCGCCGGCTG TGAAGTCAAA GACTATAATG TCATCCGCCC AATTGATAAC CCGTATTCCG AAACAGGCGG ACTTGCCGTA TTGTTTGGAA ATCTCGCTCC AGATGGCGCG ATCATTAAAA CAGGAGGCGT GCAAGCCGGC ATTACGCGCC ATGAAGGGCC AGCGATCGTA TTTGATTCGC AAGAAGAGGC GCTCGAAGGC ATTGCGAGCG GGAAAGTAAA ACCAGGCCAT GTTGTCGTCA TCCGCTATGA AGGGCCAAAA GGAGGACCGG GAATGCCGGA AATGCTCGCG CCAACTTCGC AAATCGTCGG CATGGGGCTT GGAACAAAAG TCGCGCTTAT CACCGACGGA CGTTTCTCAG GGGCATCCCG CGGTTTATCG GTTGGACACG TTTCCCCAGA AGCGGCGGAA GGCGGTCCGA TTGCCTTTAT TGAAGACGGA GACATTATTG AAATCGATAT TACGAACAGA ACGATTAACG CAAAGCTTTC TGACGAAGAA TGGGAAAAAC GGAAAGCGAA CTGGAAAGGG TTTGAACCGA AAGTAAAAAC CGGCTACCTC GCACGCTACT CGAAGCTCGT TACTTCCGCG AGCACGGGCG GGATTATGAA AATTTAA
|
Protein sequence | MRKLRSDMIK KGFDRAPHRS LLRAAGVKEE DFDKPFIAVV NSYIDIIPGH VHLQEFGKIV KEAIREAGGV PFEMNTIGVD DGIAMGHIGM RYSLPSREII ADSIETVISA HWFDGMVCIP NCDKITPGMM MAAMRLNIPT IFVSGGPMKA GVTSDGRKIS LSSVFEGVGA YQAGKIDEKG LQELEKYGCP TCGSCSGMFT ANSMNCLAEA LGLALPGNGT ILAVDPARKE LVRQSAKQLM YLIEHDIKPS DIVTEKAIDN AFALDMALGG STNTVLHTLA IANEAGIDYS LERINEIAAR VPHLAKLAPA SDVHIEDLHE AGGVSAVLNE LAKKEGTLHL DTLTVTGKTL GENIAGCEVK DYNVIRPIDN PYSETGGLAV LFGNLAPDGA IIKTGGVQAG ITRHEGPAIV FDSQEEALEG IASGKVKPGH VVVIRYEGPK GGPGMPEMLA PTSQIVGMGL GTKVALITDG RFSGASRGLS VGHVSPEAAE GGPIAFIEDG DIIEIDITNR TINAKLSDEE WEKRKANWKG FEPKVKTGYL ARYSKLVTSA STGGIMKI
|
| |