Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GWCH70_0688 |
Symbol | |
ID | 7978868 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacillus sp. WCH70 |
Kingdom | Bacteria |
Replicon accession | NC_012793 |
Strand | + |
Start bp | 752403 |
End bp | 755258 |
Gene Length | 2856 bp |
Protein Length | 951 aa |
Translation table | 11 |
GC content | 38% |
IMG OID | 644797673 |
Product | diguanylate cyclase/phosphodiesterase with PAS/PAC sensor(s) |
Protein accession | YP_002948847 |
Protein GI | 239826223 |
COG category | [T] Signal transduction mechanisms |
COG ID | [COG5001] Predicted signal transduction protein containing a membrane domain, an EAL and a GGDEF domain |
TIGRFAM ID | [TIGR00229] PAS domain S-box [TIGR00254] diguanylate cyclase (GGDEF) domain |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.00399648 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAACATGT ACCGTCTACA TTGTCATAAT ACAGAAGAAT TGTATGACTT TATCGCACAG CACCACCTTG CCCAATATAA ACATATTTTT GTACAAGTTG CAGCCAATAA AGTCGATCAG CTTGAACTTC GGAAAATGAT TGGGTTGCTT CAGCGTTATC TTCCACAAGC GCAGCTATTT GGCGTCACAT ACGGCGAACA TTTTGGTTTC GATGACAAAT TTTTTATATG TTTTACTGTG TTCGAAAAAG TTAGCGTTCA CTCTGTTTTA TTATCATACG AAGAATTTGC AAATGAACTT GAGCTTGTAA CATATATTTC GGATACGCTT ATAACGGAAG AAACAAATCT TCTTCTTTTA TTTACTGACC AGAATAGCAA TCTTCATTCC CTTATTCGCC ATATACCACT AGCAAATGAA AAAACGGTTG TGATTGCCGG GCGAATTAAT GAAGGAGAGC GTTTGTTTTC GCATGAAGGA TTTGTCACTG GCGGTATGCT GGCAATTAGT TTTAACGGTT CTGCTTTTCG TGTTCAATCA TCACATCCGT TTTTATGGGA ACCAGTTGGC ACTGCTTTTC GAATAACGAA ATGCAGCGGA AATAAGTTAT ACGAGCTTGA TGGAAAAAAA GCAATAAAAT TGCTGCAACG TTATTTAGGA AAGGAATTTA TCGATCGATT GCCTTTTTCA GGAGCGGAAT TTCCGTTCAT TATTGAGAAA AATGGCCATA AAGAATGTTT ATCCATCGTT AAAGTAAATG AGGATGGTTC TATTGAACTA AACGGTTACG TCCATCAAGG GGAGAAGGTT AAGTTTAGCT ACGTTCATTT AACGTCATTG GTTTGGACTA TATCCGATGA ATTAAACAAA TTGACGAAAA AATACACGGA AGCGATTTTT TTCTATCGTA GCATCGCCGT ACAAGGCTAT GCATATCCAG TGCTAGAGCA AATAACGACA ACGCTGGAGC AAGTTGCTCC TACTTTTACG CCATTTATAT TTGTAGAACT GGTGATCAAG GATAGAGATA TACGTTCAGC CACTTTTAGC ATGCTTGCGT TATCGGAGGG AAATCAGAAC GGAGAGAATA GCAGTGCTTC CGTTTCTTTA TCTATTCCTG AGATGTTTCA AGGGGTGATG ACGCTAGCGA ATTTAATGTC CACCTCCTCT CGAGAAATGG AGAGGCTTCG CGTTCATGTA CAAATATCAC AATCACTTTT TGAACATAAT ACAGATATTG TATATTCTAC TGATTTACAT GGGAATTTTA CGAACGTGAA TCCCGCTTTC GAGAAAATTC TAGGTTACAC AAAAGAGGAA ATTTTACATA CGAACGCTTT AAAATACATG CATCCGAATG ATGTACCTCG TGTTAGCAGA CATTTTTATC GAGCTTTGCG TGGAAAAATT CAATATTATA ATTTGGAGAT TCCTACAAAA TCAGGGGAAA CATTATTATT CCAAATTAAA AACGTTCCAA TTATCGTTGA TGGGAAAAAA GTAGGCATTT ACGGGATTGG AAGAGATATT ACCGAACAAA AAAAAGCGGA AGAAAAAATT TCATATTTAG CGTATTATGA CCCGGATACC CATCTTCCAA ACCGGACAAA GTTTATGGAA ATCATCGATG AACAGTTAGA AAAGGCGAAG CAAAAAAATA GGAAGCTAGC AGTCGTATTC ATCGATTTAG ACCGTTTTAA GCTAGTTAAC GATAGTATAG GGCATTATGC GGGAGATGAA ATTTTAAAAC AAGTCGTTCA GCGCATTCAG CATGTTTTGC CAGCTGGAGC GTATTTAGGA AGGTTTCATG GGGATAAGTT TTGTTTACTT TTAACAGAGC GCACCGATTC AGAAGGAGTG TTTAAAACAG CAGCACACAT TTCAAAAGAA GTGATGAAGC CGATTGTATA TGAAAACAAA GAATTTTTTA TCACTATAAG TATTGGAATC AGCTTCTATC CAAACGATGG TGTGGATAAA CATTCATTGC TCAAAAATGC TGATATCGCC CTAAATAAAG CAAAACAGAG TGGGGGAAAT CGAATACAAT TTTATTCTGC AAAAATGAAT GAAGAAACGT TATACCGTTT AGAGATGGAG AGATATTTGC GAAAAGCACT GGAAAATCAA GAATTTTTTC TATGCTATCA GCCGATTATT GATATACATA AAGGTGTTAT TGTCGGAAAC GAAGCATTAA TTCGTTGGCG CCATCCAAAG CTAGGACTTG TTAGACCGAA TGAATTTATT TCACTAGCAG AAGAGACGGG GCTCATTCAT GAAATTGGAA GATGGGTGTT AGTAACGGCT TGCAAACAAA CAAAAAAATG GCAGCAATTA TGGAATCAAC AACTCTTCGT TTCTGTGAAT GTGTCGGCGA GACAATTTCA GCATGAAGGC TTTATTGATG ATGTAAAACA GGCGCTAGAG CAATCGCAGC TCTCCCCAAA TTGCTTACAT TTAGAATTAA CGGAGAACTC TATGCTTCGT AATCTTCATT ACAGTATTCA AGTAATGAAA GAATTACAGC AGCTCGGTGT CGGTATTGCT ATTGATGATT TTGGAAGTGG ATACGCTTCT TTTAGTTATT TAAAAAATTT ACCAGTAAAC ATATTAAAAA TTGATCGTTC ATTTGTTGAG CAAATGCATA CAAATTCTTC GGATATCGCG ATCGTAAAGG CAATTATTAC GATGGGGCAC GGATTAGGGC TAAAAACAGT AGCCGAAGGG GTGGAAACGG TCGAGCAGCT GGAATTGTTA AAAATGCTGC ATTGCCATCA CGCACAAGGA TATGCATTGT ATCGTCCGGT AACCGCAGAA GAATTGTCAA CATATATGAC GGTAAGCCAC AAATAG
|
Protein sequence | MNMYRLHCHN TEELYDFIAQ HHLAQYKHIF VQVAANKVDQ LELRKMIGLL QRYLPQAQLF GVTYGEHFGF DDKFFICFTV FEKVSVHSVL LSYEEFANEL ELVTYISDTL ITEETNLLLL FTDQNSNLHS LIRHIPLANE KTVVIAGRIN EGERLFSHEG FVTGGMLAIS FNGSAFRVQS SHPFLWEPVG TAFRITKCSG NKLYELDGKK AIKLLQRYLG KEFIDRLPFS GAEFPFIIEK NGHKECLSIV KVNEDGSIEL NGYVHQGEKV KFSYVHLTSL VWTISDELNK LTKKYTEAIF FYRSIAVQGY AYPVLEQITT TLEQVAPTFT PFIFVELVIK DRDIRSATFS MLALSEGNQN GENSSASVSL SIPEMFQGVM TLANLMSTSS REMERLRVHV QISQSLFEHN TDIVYSTDLH GNFTNVNPAF EKILGYTKEE ILHTNALKYM HPNDVPRVSR HFYRALRGKI QYYNLEIPTK SGETLLFQIK NVPIIVDGKK VGIYGIGRDI TEQKKAEEKI SYLAYYDPDT HLPNRTKFME IIDEQLEKAK QKNRKLAVVF IDLDRFKLVN DSIGHYAGDE ILKQVVQRIQ HVLPAGAYLG RFHGDKFCLL LTERTDSEGV FKTAAHISKE VMKPIVYENK EFFITISIGI SFYPNDGVDK HSLLKNADIA LNKAKQSGGN RIQFYSAKMN EETLYRLEME RYLRKALENQ EFFLCYQPII DIHKGVIVGN EALIRWRHPK LGLVRPNEFI SLAEETGLIH EIGRWVLVTA CKQTKKWQQL WNQQLFVSVN VSARQFQHEG FIDDVKQALE QSQLSPNCLH LELTENSMLR NLHYSIQVMK ELQQLGVGIA IDDFGSGYAS FSYLKNLPVN ILKIDRSFVE QMHTNSSDIA IVKAIITMGH GLGLKTVAEG VETVEQLELL KMLHCHHAQG YALYRPVTAE ELSTYMTVSH K
|
| |