Gene GWCH70_2014 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGWCH70_2014 
Symbol 
ID7978968 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacillus sp. WCH70 
KingdomBacteria 
Replicon accessionNC_012793 
Strand
Start bp2074199 
End bp2075875 
Gene Length1677 bp 
Protein Length558 aa 
Translation table11 
GC content50% 
IMG OID644798837 
Productdihydroxy-acid dehydratase 
Protein accessionYP_002950007 
Protein GI239827383 
COG category[E] Amino acid transport and metabolism
[G] Carbohydrate transport and metabolism 
COG ID[COG0129] Dihydroxyacid dehydratase/phosphogluconate dehydratase 
TIGRFAM ID[TIGR00110] dihydroxy-acid dehydratase 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.0410747 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGAGAAAGC TTCGCAGCGA TATGATCAAA AAAGGATTTG ACCGGGCGCC GCACCGCAGC 
CTGTTGCGCG CGGCAGGCGT AAAGGAAGAA GATTTTGACA AGCCGTTTAT CGCCGTGGTG
AACTCGTATA TTGACATTAT TCCGGGACAC GTGCATTTGC AGGAGTTCGG GAAAATTGTC
AAAGAAGCGA TTCGCGAAGC GGGCGGAGTC CCGTTTGAAA TGAATACAAT CGGCGTCGAC
GATGGGATTG CGATGGGGCA TATCGGCATG CGCTATTCGC TTCCAAGCCG CGAAATTATC
GCGGATTCGA TCGAAACGGT GATTTCCGCG CATTGGTTTG ATGGAATGGT ATGCATTCCA
AACTGTGATA AAATTACGCC GGGCATGATG ATGGCGGCGA TGCGGTTAAA CATTCCGACG
ATTTTTGTCA GCGGCGGGCC GATGAAGGCT GGCGTGACGA GCGACGGGCG AAAAATTTCG
CTTTCTTCCG TATTTGAAGG GGTCGGCGCT TATCAGGCGG GAAAAATTGA TGAAAAAGGA
TTGCAGGAAT TAGAAAAATA CGGCTGTCCG ACGTGCGGCT CCTGTTCGGG CATGTTTACA
GCAAACTCGA TGAACTGTTT GGCGGAAGCG CTAGGTCTCG CACTGCCAGG AAATGGAACC
ATTTTAGCGG TCGATCCGGC ACGGAAAGAA TTGGTTCGCC AATCGGCGAA ACAATTAATG
TATTTGATCG AGCATGACAT TAAACCGAGC GATATTGTCA CAGAAAAAGC GATTGATAAC
GCGTTTGCGC TTGATATGGC GCTTGGCGGC TCGACAAATA CCGTATTACA TACGCTTGCA
ATTGCAAACG AGGCAGGGAT CGATTATTCG CTCGAACGAA TTAACGAAAT CGCTGCAAGG
GTGCCGCATC TTGCGAAACT AGCGCCAGCT TCCGATGTGC ATATTGAAGA CTTGCACGAA
GCGGGCGGCG TATCAGCCGT ATTAAATGAA TTAGCGAAAA AAGAAGGAAC GCTGCATCTC
GATACGTTGA CCGTAACCGG AAAAACGCTT GGGGAAAACA TCGCCGGCTG TGAAGTCAAA
GACTATAATG TCATCCGCCC AATTGATAAC CCGTATTCCG AAACAGGCGG ACTTGCCGTA
TTGTTTGGAA ATCTCGCTCC AGATGGCGCG ATCATTAAAA CAGGAGGCGT GCAAGCCGGC
ATTACGCGCC ATGAAGGGCC AGCGATCGTA TTTGATTCGC AAGAAGAGGC GCTCGAAGGC
ATTGCGAGCG GGAAAGTAAA ACCAGGCCAT GTTGTCGTCA TCCGCTATGA AGGGCCAAAA
GGAGGACCGG GAATGCCGGA AATGCTCGCG CCAACTTCGC AAATCGTCGG CATGGGGCTT
GGAACAAAAG TCGCGCTTAT CACCGACGGA CGTTTCTCAG GGGCATCCCG CGGTTTATCG
GTTGGACACG TTTCCCCAGA AGCGGCGGAA GGCGGTCCGA TTGCCTTTAT TGAAGACGGA
GACATTATTG AAATCGATAT TACGAACAGA ACGATTAACG CAAAGCTTTC TGACGAAGAA
TGGGAAAAAC GGAAAGCGAA CTGGAAAGGG TTTGAACCGA AAGTAAAAAC CGGCTACCTC
GCACGCTACT CGAAGCTCGT TACTTCCGCG AGCACGGGCG GGATTATGAA AATTTAA
 
Protein sequence
MRKLRSDMIK KGFDRAPHRS LLRAAGVKEE DFDKPFIAVV NSYIDIIPGH VHLQEFGKIV 
KEAIREAGGV PFEMNTIGVD DGIAMGHIGM RYSLPSREII ADSIETVISA HWFDGMVCIP
NCDKITPGMM MAAMRLNIPT IFVSGGPMKA GVTSDGRKIS LSSVFEGVGA YQAGKIDEKG
LQELEKYGCP TCGSCSGMFT ANSMNCLAEA LGLALPGNGT ILAVDPARKE LVRQSAKQLM
YLIEHDIKPS DIVTEKAIDN AFALDMALGG STNTVLHTLA IANEAGIDYS LERINEIAAR
VPHLAKLAPA SDVHIEDLHE AGGVSAVLNE LAKKEGTLHL DTLTVTGKTL GENIAGCEVK
DYNVIRPIDN PYSETGGLAV LFGNLAPDGA IIKTGGVQAG ITRHEGPAIV FDSQEEALEG
IASGKVKPGH VVVIRYEGPK GGPGMPEMLA PTSQIVGMGL GTKVALITDG RFSGASRGLS
VGHVSPEAAE GGPIAFIEDG DIIEIDITNR TINAKLSDEE WEKRKANWKG FEPKVKTGYL
ARYSKLVTSA STGGIMKI