Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GWCH70_0213 |
Symbol | |
ID | 7977956 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacillus sp. WCH70 |
Kingdom | Bacteria |
Replicon accession | NC_012793 |
Strand | + |
Start bp | 230837 |
End bp | 233023 |
Gene Length | 2187 bp |
Protein Length | 728 aa |
Translation table | 11 |
GC content | 44% |
IMG OID | 644797191 |
Product | RNA binding S1 domain protein |
Protein accession | YP_002948410 |
Protein GI | 239825786 |
COG category | [K] Transcription |
COG ID | [COG2183] Transcriptional accessory protein |
TIGRFAM ID | [TIGR00426] competence protein ComEA helix-hairpin-helix repeat region |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.231146 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TTGTTAAATA GAGAAGCGTT GATGGACCTG ATAGCGAATG AATTACATTT GTCCACAAAG CAAGTTAGCA ATGTGATTTC TCTTTCTGAG GAAGGAAATA CGGTGCCGTT TATTGCCCGC TATCGGAAAG AGATGACAGG CGCCTTAGAT GAAGTGCAAA TTCGTGATAT TTTGGAGAAA TGGAATTACC TACAAAACTT AGAACAGCGG AAAGAAGAAG TCCTTCGCCT TATTGATGAA CAAGGAAAAC TAACGGATGA TCTAAAAAAC GCGATCATCA ATGCTACGAA ACTGCAGCAA GTGGAAGATT TGTATCGTCC TTATAGACAA AAACGACGTA CAAAAGCCAC CATTGCTAAG GAAAAAGGGT TAGAGCCTCT TGCGGAATGG TTATGGACGT GTCCAATGAG GCCGCGGCCG GAAGAAAAAG CGCAAGAGTT TATTCAACCA GAAAAAGAAG TACGTACCGT TGAAGAGGCG CTTCAAGGTG CGAAAGACAT CATCGCTGAA AAGGTATCGG ATGATGCACA ATTTCGCCAA TGGATTCGCC AGCACACATG GAAAAAAGGC GTGATCATAT CGACTGTCAA AGAGTCGGAA AATGATGAGA AAAAAGTATA TGAAATGTAT TACGAATATG AAGAGCCAGT ACATAGGATT GTTCCGCATC GTGTATTAGC GCTCAATCGC GGCGAAAAAG AAGGAGTGTT GCGTGTTTCC ATTCAGGCGC CGGTAGAAGA TATTATGACA TACTTACAAA AGCACATTAT TACAAATCCG CAATCTCCCG CCGCTTCCCT CCTTTCTGAA GCGATTGAGG ACGGCTACAA AAGGCTTATT GAACCGTCAA TCGAGCGGGA TATTCGCAAT GAATTAACCG AAAAAGCGGA AGAGCGGGCG ATTCATATTT TTGCGGAGAA CTTACGCAAG CTATTGCTTC AGCCGCCGTT AAAAGGGAAG ATTGTTCTTG GTATAGATCC TGCCTATCGA ACGGGATGCA AGCTGGCGGT GGTCGACGAA ACAGGCAAAT TGCTGAAAAT CGATGTCATT TACCCTCATC CCCCGCAGCA ACAGATAGAA GAAGCGAGAG AAAAGTTAAT CCGCATTATC GAAGAATATC ATGTCGAAAT GATTGCCATT GGGAACGGAA CTGCATCAAG GGAAACGGAG CAATTTGTGG CGGACACCTT AAAACAAGTA GATAAAGAAA TTTTTTACCT TATTGTCAAT GAAGCGGGAG CGAGCGTCTA TTCCGCCTCT GACCTTGCCC GTCAAGAGTT TCCAGATTTA CAGGTAGAAG AACGGAGCGC AGTTTCGATC GCAAGGCGTG TACAAGACCC GCTTGCCGAA CTGGTGAAAA TTGATCCAAA ATCAGTAGGA GTTGGCCAAT ATCAGCACGA CGTTTCACAA AAAAAATTAG CGCAATCATT GCAGTTTGTC GTGGAAACCG TTGTTAACCA AGTTGGCGTC AACGTCAACA CTGCCTCCGT CTCTTTATTG CAATACGTAT CAGGGCTTAC GAAAACAGTA TCGGAAAATA TCGTAAAACG CCGTGAGGAA CAAGGAAAAT TTAAAAACCG CGAGGAATTA AAATCGATAC CGCGGCTTGG TGCTAAAACA TATGAACAGT GTATCGGATT TTTACGCATT ATTGATGGAG ACGAACCGCT CGACCGTACG CCGATTCATC CTGAGCGGTA TGAAGAAGTG AAAAGGCTGT TGCACCAAAT TGGTTTTACA ACTGAACATA TCGGAAGTGA AGAGCTTCGT CAGGCATTGC AATCTCTTTC CATTCCTGAC ACGGCTGCTG AACTTGGCAT CGGAGAATTG ACATTACAAG ACATTATCGA CGCTTTAATT CGTCCAGAAC GTGATCCTCG CGATGAGCTG CCAAAGCCGT TATTACGAAA AGACATTTTA AAAATGGAAG ATTTAAAAAG GGGAATGGAG TTAGAAGGAA CGGTGCGGAA CGTCGTCGAT TTCGGAGCGT TTGTGGATAT TGGGGTTAAG CAGGATGGGC TTGTTCACAT TTCAAAATTA AGCAAGCAAT ATGTACGTCA TCCGCTTGAC GTTGTATCAG TAGGCGATGT GGTAAAAGTT TGGGTTGACA ATGTAGATCT CGATAAAGGA AGAATTTCTT TATCTATGAT TCCACCGGAA GAATCAGAAA AAACACTGCT TTCATGA
|
Protein sequence | MLNREALMDL IANELHLSTK QVSNVISLSE EGNTVPFIAR YRKEMTGALD EVQIRDILEK WNYLQNLEQR KEEVLRLIDE QGKLTDDLKN AIINATKLQQ VEDLYRPYRQ KRRTKATIAK EKGLEPLAEW LWTCPMRPRP EEKAQEFIQP EKEVRTVEEA LQGAKDIIAE KVSDDAQFRQ WIRQHTWKKG VIISTVKESE NDEKKVYEMY YEYEEPVHRI VPHRVLALNR GEKEGVLRVS IQAPVEDIMT YLQKHIITNP QSPAASLLSE AIEDGYKRLI EPSIERDIRN ELTEKAEERA IHIFAENLRK LLLQPPLKGK IVLGIDPAYR TGCKLAVVDE TGKLLKIDVI YPHPPQQQIE EAREKLIRII EEYHVEMIAI GNGTASRETE QFVADTLKQV DKEIFYLIVN EAGASVYSAS DLARQEFPDL QVEERSAVSI ARRVQDPLAE LVKIDPKSVG VGQYQHDVSQ KKLAQSLQFV VETVVNQVGV NVNTASVSLL QYVSGLTKTV SENIVKRREE QGKFKNREEL KSIPRLGAKT YEQCIGFLRI IDGDEPLDRT PIHPERYEEV KRLLHQIGFT TEHIGSEELR QALQSLSIPD TAAELGIGEL TLQDIIDALI RPERDPRDEL PKPLLRKDIL KMEDLKRGME LEGTVRNVVD FGAFVDIGVK QDGLVHISKL SKQYVRHPLD VVSVGDVVKV WVDNVDLDKG RISLSMIPPE ESEKTLLS
|
| |