Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GWCH70_0653 |
Symbol | |
ID | 7978837 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacillus sp. WCH70 |
Kingdom | Bacteria |
Replicon accession | NC_012793 |
Strand | + |
Start bp | 714926 |
End bp | 717319 |
Gene Length | 2394 bp |
Protein Length | 797 aa |
Translation table | 11 |
GC content | 42% |
IMG OID | 644797638 |
Product | YhgE/Pip N-terminal domain protein |
Protein accession | YP_002948812 |
Protein GI | 239826188 |
COG category | [S] Function unknown |
COG ID | [COG1511] Predicted membrane protein |
TIGRFAM ID | [TIGR03057] X-X-X-Leu-X-X-Gly heptad repeats [TIGR03061] YhgE/Pip N-terminal domain [TIGR03062] YhgE/Pip C-terminal domain |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 1 |
Plasmid unclonability p-value | 0.000000732474 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCGAGGAC TCTCATTATT ATGGAAAGAG GCGCATGCGA TTTTTCGTAA TCGAAAGACA TTGATATCGA TTATTGCTGT TATATGCATT CCGATATTAT ATAGTGGAAT GTTTTTATGG GCGTTTTGGG ATCCATACGC TCATCTGGAT AAATTGCCTG TTGCCGTGGT GAACAACGAC AAAGGAGCAG CGATGAACGG TGAAAAATTG GAAATTGGCG ATAAGTTAGT AGAAAAGCTG AAAGAAAACA AGAAATTTGA TTGGCATTTT GTTTCTGAAA AAGAAGCGGA AAAAGGATTG CAACATCAAA AGTATTATAT GGCCATTGAA ATTCCGGAAG ATTTCTCGGA AAATGCTACG ACTTTACAGG ATGAACATCC AAAACCGATG AAACTCATCT ATAAGCCAAA CGAAGGATTT AACTTCCTAT CGGCGCAAAT TGGCGATAGT GCCGTAGAGA AGATTAAAGA AGAAGTTTCC AATACGGTGA CAGAAACGTA TGCGGAAGCG ATGTTTGAAA ATATCCGCGA GATGGCGAAA GGACTTGATC GGGCTAGCGA AGGTGCAAAA CAGCTGCATG ACGGCATTCA AAAAGCAAAT GATGGCGGTG TCTCGCTGCA AAGAGGTCTG CATTCTGCGA AAGAAGGAAG CGGAAAATTA GCGAAAGGAG CGCATGCGGC AAAAAATGGT GCTAATGAGC TTTATAAAAA CTTAAAACTG CTCGCTGAAA AATCGCTTAC GTTTGAAAAT GGATTGCAAT CGGCTAGCGA AGGCGCTAAT CAGCTAAACG CCGGGCTCTA TCAGCTGCAA AATGGTTTTA CAAAAATGCA GGACGGCCAT TCACGACTAT TAGCTGGCGC GAAACAAGTG GAGAATGGTG CGAAACAACT GTCCGGCGGT CTTCATCAAT CGCTTGATGG AATGCAACAA ATGAAAGGAA ACATTCCGTC ATTGACACAA GGATCGCAAA ATCTACAAAA TGGAGCACAA AAGCTTTCCG CATCGATGGA ACAATGGAAG CAAGGGGCTG AGAAGACAAA TAAAGGAGCT GTCCAAGTAA GCCAAGGATT AGAAAAAGCA GTAGCCCAAT TGGACGCGTT AGCAGCACAA GCAACAGATC CAAAGGAAAA AGCGCTATTA CAAACTTTGA AAGAACAATT GCTTCCTCTA TCCGAAGGAA GCAAACAAGT CGCTCAAGGA ATGGAACAGT TATCGAATAG TGCCAGCCAA TTAAAAACAG GTGCGGATCA GCTTGCTGCG GGAGCTTCTA GGCTCCATAA CGGCCAGCTT GCGTTAAGCG AAGGAGTCGA AAAGCTTTTA GCAGGCCAGC AACGATTAGC AAGCGGTGCT GATGCGCTTG TCGCCGGTCA ATCAAAAGTG GTGCAAGGAT TGACAGTGTT TGGGAAAAAA TTACAAGAAG GAAAAAATGG TGTTGATCAG TTAGTCGCGG GAAGTGATCG ATTGTCTTCT GGCTTGCATC AGCTAGCCCA AGGATCAAAC AAATTAAAAG ATGGCGCCCA TCAGCTTGCG GATGGATCCG GAAAGCTTGC CGATGGCATG AATGACCTTG AAAATGGCAC TGTGTCGCTT TCCAACGGCA TGAATCAACT TGCTAACGGT TTTGATCAGC TTGTAAGTGG AATGAAAAAA TTAGAGGATG GATCGAATGA GTTGGCAGAT AAACTAGCTG ATGGCGCGAA AAAAGCGAGT GATGTCAAAG CAAATGATGA TACTTATAAA ATGTTTGCCG ATCCAGTGAA AAAGAAAAAT GAAAAAATGA ATCATGTGCC AAACTATGGT ACCGGATTTA CACCATACTT CTTATCGCTA GGATTGTTTG TCGGCGCCCT TGTTTTGTCC ATTGTGTTTC CATTGCGTGA ACCAGCGGAC GTGCCAAAAT CAGGATTTAG CTGGTTTATT GGGAAATTTG GTATTTTAGT AATTGTTGGG ATTATCCAAT CATTGCTGGC AGATGCCTTA TTACTTGGAG CATTAGACAT TCATGTCCAA AGTGTTCCTA GATTTATACT ATTTACGATG ATCACAAGCA TTACGTTTAT TGCGTTAATT CAATTCCTAG TGACGACGTT GGAAGATCCG GGGCGTTTTA TTGCGATTGT TGTATTAATT TTGCAACTTA CAACAAGTGC TGGAACGTTC CCGCTTGAAT TGATTCCAGA CGTATTACAA TACTTCAATG CTTGGTTGCC GATGACATAT TCGGTGCTTG GATTTAAAGC GGTTATTTCT AGTGGAGATT TTGCGTTTAT GTGGCATAAC GCAGCTGTTT TAGCCTTGTT TATTGCCATT TTCATGGTAG GAACAATTTT ATACTTTATT GCACAGCATA AACGTCAGTT TGATAAACAG ATGAACGAAG CTTCCGAAGC GTAA
|
Protein sequence | MRGLSLLWKE AHAIFRNRKT LISIIAVICI PILYSGMFLW AFWDPYAHLD KLPVAVVNND KGAAMNGEKL EIGDKLVEKL KENKKFDWHF VSEKEAEKGL QHQKYYMAIE IPEDFSENAT TLQDEHPKPM KLIYKPNEGF NFLSAQIGDS AVEKIKEEVS NTVTETYAEA MFENIREMAK GLDRASEGAK QLHDGIQKAN DGGVSLQRGL HSAKEGSGKL AKGAHAAKNG ANELYKNLKL LAEKSLTFEN GLQSASEGAN QLNAGLYQLQ NGFTKMQDGH SRLLAGAKQV ENGAKQLSGG LHQSLDGMQQ MKGNIPSLTQ GSQNLQNGAQ KLSASMEQWK QGAEKTNKGA VQVSQGLEKA VAQLDALAAQ ATDPKEKALL QTLKEQLLPL SEGSKQVAQG MEQLSNSASQ LKTGADQLAA GASRLHNGQL ALSEGVEKLL AGQQRLASGA DALVAGQSKV VQGLTVFGKK LQEGKNGVDQ LVAGSDRLSS GLHQLAQGSN KLKDGAHQLA DGSGKLADGM NDLENGTVSL SNGMNQLANG FDQLVSGMKK LEDGSNELAD KLADGAKKAS DVKANDDTYK MFADPVKKKN EKMNHVPNYG TGFTPYFLSL GLFVGALVLS IVFPLREPAD VPKSGFSWFI GKFGILVIVG IIQSLLADAL LLGALDIHVQ SVPRFILFTM ITSITFIALI QFLVTTLEDP GRFIAIVVLI LQLTTSAGTF PLELIPDVLQ YFNAWLPMTY SVLGFKAVIS SGDFAFMWHN AAVLALFIAI FMVGTILYFI AQHKRQFDKQ MNEASEA
|
| |