Gene GWCH70_1191 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGWCH70_1191 
Symbol 
ID7979302 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacillus sp. WCH70 
KingdomBacteria 
Replicon accessionNC_012793 
Strand
Start bp1243076 
End bp1244632 
Gene Length1557 bp 
Protein Length518 aa 
Translation table11 
GC content46% 
IMG OID644798144 
Productphosphodiesterase 
Protein accessionYP_002949317 
Protein GI239826693 
COG category[R] General function prediction only 
COG ID[COG1418] Predicted HD superfamily hydrolase 
TIGRFAM ID[TIGR00277] uncharacterized domain HDIG
[TIGR03319] conserved hypothetical protein YmdA/YtgF 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000197151 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGGGTCAA TCATCATCTC CGCTTTGCTT GCCTTAGTTG TCGGTGCCGT TGTCGGCTTT 
TTTATTCGAA AATCCATTGC AGAAGCGAAA ATTGGCGGTG CACAAGCAGC TGCAAACCAA
ATCATTGAAG ATGCGAAACG AGAAGCTGAT GCGCTGAAAA AGGAAGCGCT TCTTGAAGCA
AAGGATGAAA TTCATAAACT TCGTACAGAG GTTGAACGTG AAATTCGCGA TCGAAGAAGC
GAGTTGCAAA AACAAGAAAA CCGATTGCTG CAAAAAGAAG AAAATCTTGA CCGAAAAGAT
GAGGCGCTGA ATAAACGAGA AGCGCTCTTA GAATCGAAAG AGGAAGCACT GAATCAAAGA
CAACAACATA TTGAACAGAT GGAAAGCAAA GTGGAAGAGC TCGTTCAAAA GGAACAAATG
GAATTGGAAC GAATTTCTGG TCTAACACGC GAAGAAGCAC GCCAAGTTAT TTTGGAGCGT
GTCGAAAAAG AGCTATCTCA TGAAATTGCA ATGATGGTGA AAGAAGCCGA GACCCGCGCG
AAAGAAGAGG CGGATAAAAG AGCAAAAGCA ATTTTATCGC TGGCGATTCA GCGCTGTGCG
GCTGACCATG TCGCCGAAAC GACCGTATCT GTCGTTAATT TGCCAAACGA TGAAATGAAA
GGCCGGATCA TTGGTCGTGA AGGACGGAAT ATTCGTACGC TTGAAACGCT CACCGGTATT
GATTTAATTA TCGATGATAC GCCGGAGGCA GTTATTTTAT CGGGATTTGA TCCAATCCGC
CGTGAAACGG CTAGAATTGC TTTAGACAAA CTTGTTCAAG ATGGACGCAT TCACCCGGCA
AGAATTGAGG AAATGGTCGA AAAAGCAAGA CGTGAAGTGG ATGAGCATAT TCGTGAAGTC
GGCGAACAAA CCACCTTTGA AGTTGGCGTT CACGGGTTAC ATCCGGATTT AATAAAAATT
TTAGGACGCC TCAAATTCCG GACAAGCTAC GGGCAAAACG TCTTGAAGCA TTCAATTGAA
GTGGCGTTTT TAGCCGGGTT GATGGCGGCG GAACTTGGCG AAGATGAAAT GTTAGCAAGA
CGTGCTGGCC TCCTGCACGA TATTGGCAAG GCGATTGACC ATGAAGTGGA AGGAAGCCAT
GTTGAAATCG GTGTAGAATT GGCGACAAAA TATAAAGAAC ACCCGGTTGT CATTAACAGC
ATCGCTTCCC ATCATGGTGA TACGGAGCCA ACTTCCGTCA TTGCCGTGCT CGTTGCAGCG
GCTGATGCAC TTTCTGCGGC AAGACCGGGA GCGCGCAGTG AAACATTGGA AAACTATATT
CGCCGCCTCG AAAAATTGGA GGAAATCGCT GAATCGTACG AAGGTGTGGA GAAATCATAT
GCGATTCAAG CAGGTCGAGA AGTGCGTATT ATGGTGAAGC CGGATATGAT TGATGATTTA
GAAGCGCATC GATTGGCGCG GGAAATTCGT AAACGGATCG AGGAGGAACT CGATTATCCG
GGACACATTA AGGTTACCGT TATTCGTGAA ACAAGAGCGG TAGAATATGC AAAATAA
 
Protein sequence
MGSIIISALL ALVVGAVVGF FIRKSIAEAK IGGAQAAANQ IIEDAKREAD ALKKEALLEA 
KDEIHKLRTE VEREIRDRRS ELQKQENRLL QKEENLDRKD EALNKREALL ESKEEALNQR
QQHIEQMESK VEELVQKEQM ELERISGLTR EEARQVILER VEKELSHEIA MMVKEAETRA
KEEADKRAKA ILSLAIQRCA ADHVAETTVS VVNLPNDEMK GRIIGREGRN IRTLETLTGI
DLIIDDTPEA VILSGFDPIR RETARIALDK LVQDGRIHPA RIEEMVEKAR REVDEHIREV
GEQTTFEVGV HGLHPDLIKI LGRLKFRTSY GQNVLKHSIE VAFLAGLMAA ELGEDEMLAR
RAGLLHDIGK AIDHEVEGSH VEIGVELATK YKEHPVVINS IASHHGDTEP TSVIAVLVAA
ADALSAARPG ARSETLENYI RRLEKLEEIA ESYEGVEKSY AIQAGREVRI MVKPDMIDDL
EAHRLAREIR KRIEEELDYP GHIKVTVIRE TRAVEYAK