Gene GWCH70_0653 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGWCH70_0653 
Symbol 
ID7978837 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacillus sp. WCH70 
KingdomBacteria 
Replicon accessionNC_012793 
Strand
Start bp714926 
End bp717319 
Gene Length2394 bp 
Protein Length797 aa 
Translation table11 
GC content42% 
IMG OID644797638 
ProductYhgE/Pip N-terminal domain protein 
Protein accessionYP_002948812 
Protein GI239826188 
COG category[S] Function unknown 
COG ID[COG1511] Predicted membrane protein 
TIGRFAM ID[TIGR03057] X-X-X-Leu-X-X-Gly heptad repeats
[TIGR03061] YhgE/Pip N-terminal domain
[TIGR03062] YhgE/Pip C-terminal domain 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000000732474 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGAGGAC TCTCATTATT ATGGAAAGAG GCGCATGCGA TTTTTCGTAA TCGAAAGACA 
TTGATATCGA TTATTGCTGT TATATGCATT CCGATATTAT ATAGTGGAAT GTTTTTATGG
GCGTTTTGGG ATCCATACGC TCATCTGGAT AAATTGCCTG TTGCCGTGGT GAACAACGAC
AAAGGAGCAG CGATGAACGG TGAAAAATTG GAAATTGGCG ATAAGTTAGT AGAAAAGCTG
AAAGAAAACA AGAAATTTGA TTGGCATTTT GTTTCTGAAA AAGAAGCGGA AAAAGGATTG
CAACATCAAA AGTATTATAT GGCCATTGAA ATTCCGGAAG ATTTCTCGGA AAATGCTACG
ACTTTACAGG ATGAACATCC AAAACCGATG AAACTCATCT ATAAGCCAAA CGAAGGATTT
AACTTCCTAT CGGCGCAAAT TGGCGATAGT GCCGTAGAGA AGATTAAAGA AGAAGTTTCC
AATACGGTGA CAGAAACGTA TGCGGAAGCG ATGTTTGAAA ATATCCGCGA GATGGCGAAA
GGACTTGATC GGGCTAGCGA AGGTGCAAAA CAGCTGCATG ACGGCATTCA AAAAGCAAAT
GATGGCGGTG TCTCGCTGCA AAGAGGTCTG CATTCTGCGA AAGAAGGAAG CGGAAAATTA
GCGAAAGGAG CGCATGCGGC AAAAAATGGT GCTAATGAGC TTTATAAAAA CTTAAAACTG
CTCGCTGAAA AATCGCTTAC GTTTGAAAAT GGATTGCAAT CGGCTAGCGA AGGCGCTAAT
CAGCTAAACG CCGGGCTCTA TCAGCTGCAA AATGGTTTTA CAAAAATGCA GGACGGCCAT
TCACGACTAT TAGCTGGCGC GAAACAAGTG GAGAATGGTG CGAAACAACT GTCCGGCGGT
CTTCATCAAT CGCTTGATGG AATGCAACAA ATGAAAGGAA ACATTCCGTC ATTGACACAA
GGATCGCAAA ATCTACAAAA TGGAGCACAA AAGCTTTCCG CATCGATGGA ACAATGGAAG
CAAGGGGCTG AGAAGACAAA TAAAGGAGCT GTCCAAGTAA GCCAAGGATT AGAAAAAGCA
GTAGCCCAAT TGGACGCGTT AGCAGCACAA GCAACAGATC CAAAGGAAAA AGCGCTATTA
CAAACTTTGA AAGAACAATT GCTTCCTCTA TCCGAAGGAA GCAAACAAGT CGCTCAAGGA
ATGGAACAGT TATCGAATAG TGCCAGCCAA TTAAAAACAG GTGCGGATCA GCTTGCTGCG
GGAGCTTCTA GGCTCCATAA CGGCCAGCTT GCGTTAAGCG AAGGAGTCGA AAAGCTTTTA
GCAGGCCAGC AACGATTAGC AAGCGGTGCT GATGCGCTTG TCGCCGGTCA ATCAAAAGTG
GTGCAAGGAT TGACAGTGTT TGGGAAAAAA TTACAAGAAG GAAAAAATGG TGTTGATCAG
TTAGTCGCGG GAAGTGATCG ATTGTCTTCT GGCTTGCATC AGCTAGCCCA AGGATCAAAC
AAATTAAAAG ATGGCGCCCA TCAGCTTGCG GATGGATCCG GAAAGCTTGC CGATGGCATG
AATGACCTTG AAAATGGCAC TGTGTCGCTT TCCAACGGCA TGAATCAACT TGCTAACGGT
TTTGATCAGC TTGTAAGTGG AATGAAAAAA TTAGAGGATG GATCGAATGA GTTGGCAGAT
AAACTAGCTG ATGGCGCGAA AAAAGCGAGT GATGTCAAAG CAAATGATGA TACTTATAAA
ATGTTTGCCG ATCCAGTGAA AAAGAAAAAT GAAAAAATGA ATCATGTGCC AAACTATGGT
ACCGGATTTA CACCATACTT CTTATCGCTA GGATTGTTTG TCGGCGCCCT TGTTTTGTCC
ATTGTGTTTC CATTGCGTGA ACCAGCGGAC GTGCCAAAAT CAGGATTTAG CTGGTTTATT
GGGAAATTTG GTATTTTAGT AATTGTTGGG ATTATCCAAT CATTGCTGGC AGATGCCTTA
TTACTTGGAG CATTAGACAT TCATGTCCAA AGTGTTCCTA GATTTATACT ATTTACGATG
ATCACAAGCA TTACGTTTAT TGCGTTAATT CAATTCCTAG TGACGACGTT GGAAGATCCG
GGGCGTTTTA TTGCGATTGT TGTATTAATT TTGCAACTTA CAACAAGTGC TGGAACGTTC
CCGCTTGAAT TGATTCCAGA CGTATTACAA TACTTCAATG CTTGGTTGCC GATGACATAT
TCGGTGCTTG GATTTAAAGC GGTTATTTCT AGTGGAGATT TTGCGTTTAT GTGGCATAAC
GCAGCTGTTT TAGCCTTGTT TATTGCCATT TTCATGGTAG GAACAATTTT ATACTTTATT
GCACAGCATA AACGTCAGTT TGATAAACAG ATGAACGAAG CTTCCGAAGC GTAA
 
Protein sequence
MRGLSLLWKE AHAIFRNRKT LISIIAVICI PILYSGMFLW AFWDPYAHLD KLPVAVVNND 
KGAAMNGEKL EIGDKLVEKL KENKKFDWHF VSEKEAEKGL QHQKYYMAIE IPEDFSENAT
TLQDEHPKPM KLIYKPNEGF NFLSAQIGDS AVEKIKEEVS NTVTETYAEA MFENIREMAK
GLDRASEGAK QLHDGIQKAN DGGVSLQRGL HSAKEGSGKL AKGAHAAKNG ANELYKNLKL
LAEKSLTFEN GLQSASEGAN QLNAGLYQLQ NGFTKMQDGH SRLLAGAKQV ENGAKQLSGG
LHQSLDGMQQ MKGNIPSLTQ GSQNLQNGAQ KLSASMEQWK QGAEKTNKGA VQVSQGLEKA
VAQLDALAAQ ATDPKEKALL QTLKEQLLPL SEGSKQVAQG MEQLSNSASQ LKTGADQLAA
GASRLHNGQL ALSEGVEKLL AGQQRLASGA DALVAGQSKV VQGLTVFGKK LQEGKNGVDQ
LVAGSDRLSS GLHQLAQGSN KLKDGAHQLA DGSGKLADGM NDLENGTVSL SNGMNQLANG
FDQLVSGMKK LEDGSNELAD KLADGAKKAS DVKANDDTYK MFADPVKKKN EKMNHVPNYG
TGFTPYFLSL GLFVGALVLS IVFPLREPAD VPKSGFSWFI GKFGILVIVG IIQSLLADAL
LLGALDIHVQ SVPRFILFTM ITSITFIALI QFLVTTLEDP GRFIAIVVLI LQLTTSAGTF
PLELIPDVLQ YFNAWLPMTY SVLGFKAVIS SGDFAFMWHN AAVLALFIAI FMVGTILYFI
AQHKRQFDKQ MNEASEA