Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GWCH70_1191 |
Symbol | |
ID | 7979302 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacillus sp. WCH70 |
Kingdom | Bacteria |
Replicon accession | NC_012793 |
Strand | + |
Start bp | 1243076 |
End bp | 1244632 |
Gene Length | 1557 bp |
Protein Length | 518 aa |
Translation table | 11 |
GC content | 46% |
IMG OID | 644798144 |
Product | phosphodiesterase |
Protein accession | YP_002949317 |
Protein GI | 239826693 |
COG category | [R] General function prediction only |
COG ID | [COG1418] Predicted HD superfamily hydrolase |
TIGRFAM ID | [TIGR00277] uncharacterized domain HDIG [TIGR03319] conserved hypothetical protein YmdA/YtgF |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.000197151 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGGGTCAA TCATCATCTC CGCTTTGCTT GCCTTAGTTG TCGGTGCCGT TGTCGGCTTT TTTATTCGAA AATCCATTGC AGAAGCGAAA ATTGGCGGTG CACAAGCAGC TGCAAACCAA ATCATTGAAG ATGCGAAACG AGAAGCTGAT GCGCTGAAAA AGGAAGCGCT TCTTGAAGCA AAGGATGAAA TTCATAAACT TCGTACAGAG GTTGAACGTG AAATTCGCGA TCGAAGAAGC GAGTTGCAAA AACAAGAAAA CCGATTGCTG CAAAAAGAAG AAAATCTTGA CCGAAAAGAT GAGGCGCTGA ATAAACGAGA AGCGCTCTTA GAATCGAAAG AGGAAGCACT GAATCAAAGA CAACAACATA TTGAACAGAT GGAAAGCAAA GTGGAAGAGC TCGTTCAAAA GGAACAAATG GAATTGGAAC GAATTTCTGG TCTAACACGC GAAGAAGCAC GCCAAGTTAT TTTGGAGCGT GTCGAAAAAG AGCTATCTCA TGAAATTGCA ATGATGGTGA AAGAAGCCGA GACCCGCGCG AAAGAAGAGG CGGATAAAAG AGCAAAAGCA ATTTTATCGC TGGCGATTCA GCGCTGTGCG GCTGACCATG TCGCCGAAAC GACCGTATCT GTCGTTAATT TGCCAAACGA TGAAATGAAA GGCCGGATCA TTGGTCGTGA AGGACGGAAT ATTCGTACGC TTGAAACGCT CACCGGTATT GATTTAATTA TCGATGATAC GCCGGAGGCA GTTATTTTAT CGGGATTTGA TCCAATCCGC CGTGAAACGG CTAGAATTGC TTTAGACAAA CTTGTTCAAG ATGGACGCAT TCACCCGGCA AGAATTGAGG AAATGGTCGA AAAAGCAAGA CGTGAAGTGG ATGAGCATAT TCGTGAAGTC GGCGAACAAA CCACCTTTGA AGTTGGCGTT CACGGGTTAC ATCCGGATTT AATAAAAATT TTAGGACGCC TCAAATTCCG GACAAGCTAC GGGCAAAACG TCTTGAAGCA TTCAATTGAA GTGGCGTTTT TAGCCGGGTT GATGGCGGCG GAACTTGGCG AAGATGAAAT GTTAGCAAGA CGTGCTGGCC TCCTGCACGA TATTGGCAAG GCGATTGACC ATGAAGTGGA AGGAAGCCAT GTTGAAATCG GTGTAGAATT GGCGACAAAA TATAAAGAAC ACCCGGTTGT CATTAACAGC ATCGCTTCCC ATCATGGTGA TACGGAGCCA ACTTCCGTCA TTGCCGTGCT CGTTGCAGCG GCTGATGCAC TTTCTGCGGC AAGACCGGGA GCGCGCAGTG AAACATTGGA AAACTATATT CGCCGCCTCG AAAAATTGGA GGAAATCGCT GAATCGTACG AAGGTGTGGA GAAATCATAT GCGATTCAAG CAGGTCGAGA AGTGCGTATT ATGGTGAAGC CGGATATGAT TGATGATTTA GAAGCGCATC GATTGGCGCG GGAAATTCGT AAACGGATCG AGGAGGAACT CGATTATCCG GGACACATTA AGGTTACCGT TATTCGTGAA ACAAGAGCGG TAGAATATGC AAAATAA
|
Protein sequence | MGSIIISALL ALVVGAVVGF FIRKSIAEAK IGGAQAAANQ IIEDAKREAD ALKKEALLEA KDEIHKLRTE VEREIRDRRS ELQKQENRLL QKEENLDRKD EALNKREALL ESKEEALNQR QQHIEQMESK VEELVQKEQM ELERISGLTR EEARQVILER VEKELSHEIA MMVKEAETRA KEEADKRAKA ILSLAIQRCA ADHVAETTVS VVNLPNDEMK GRIIGREGRN IRTLETLTGI DLIIDDTPEA VILSGFDPIR RETARIALDK LVQDGRIHPA RIEEMVEKAR REVDEHIREV GEQTTFEVGV HGLHPDLIKI LGRLKFRTSY GQNVLKHSIE VAFLAGLMAA ELGEDEMLAR RAGLLHDIGK AIDHEVEGSH VEIGVELATK YKEHPVVINS IASHHGDTEP TSVIAVLVAA ADALSAARPG ARSETLENYI RRLEKLEEIA ESYEGVEKSY AIQAGREVRI MVKPDMIDDL EAHRLAREIR KRIEEELDYP GHIKVTVIRE TRAVEYAK
|
| |