Gene GWCH70_2562 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGWCH70_2562 
Symbol 
ID7976326 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacillus sp. WCH70 
KingdomBacteria 
Replicon accessionNC_012793 
Strand
Start bp2587185 
End bp2588852 
Gene Length1668 bp 
Protein Length555 aa 
Translation table11 
GC content45% 
IMG OID644799363 
Producttype II secretion system protein E 
Protein accessionYP_002950523 
Protein GI239827899 
COG category[N] Cell motility
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG2804] Type II secretory pathway, ATPase PulE/Tfp pilus assembly pathway, ATPase PilB 
TIGRFAM ID[TIGR02533] general secretory pathway protein E
[TIGR02538] type IV-A pilus assembly ATPase PilB 


Plasmid Coverage information

Num covering plasmid clones41 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAGAAAA AACAAGAGCG GAAACGTTTA GGAGATTTAT TAGTGGAAGC GGGGCTCATT 
ACGGAGGAAC AGCTGGAGGA AGCGTTAAAA GAAAAAGCTC CCGGCCAAAA GCTGGGCGAT
GCGCTCTTGC AGCGTGGGTA TATTACGGAA CAGCAATTAA TTGAAGTGCT TGAATTTCAG
TTAGGCATCC CGCATGTCAG TTTATATCGC TATCCGATCG ATCCAAAGGT GACAAATCTC
ATTTCAAAAG AATTTGCCAA GCGGCATATG GTGATGCCTT TAAAAATTGA GGGAGAACGC
TTGCTTGTGG CAATGGCTGA TCCGATGGAC TTTTTTGTCA TTGATGATTT GCGCCTTTCG
ACAGGGTTCC ATATTGAAAC GGCGATTGCC TCGAAAGATG ATATTTTGCG CGCCATTAAT
AAGTATTACG ACATTGATGA ATCGGTAGAA GATTTTTTGC AAATGGCTCC CGCAACGGAA
ACGGTCGAAG AGGAACGAAT AACCGAAGAG GATTCTCCGA TTGTCTGGCT TGTGAACCAA
ATTTTGCAGC TCGCCGTTGA ACAGCGGGCA AGCGATGTCC ATATTGATCC ACAGGAAACG
AAAGTGCTTG TCCGTTATCG CATCGACGGT ATATTGCGGA CAGACCGCGC GCTTCCAAAA
CATATGCAAA GCATGCTGAC AGCTAGAATT AAAATTTTAG CGAATATGGA TATTACCGAA
CATCGCATTC CGCAAGATGG GCGGATTAAA ATGAACATTG ACTTCCATCC GGTCGATTTG
CGCGTTTCCA CATTGCCGAC GGTTTACGGT GAAAAAATTG TGATGCGCAT CCTTGATTTA
GGGGCGGCAT TAAATGATAT TCATAAGCTC GGATTTAATC AATTGAATTT ACAGCGGTTT
ATTGAACTGA TTGAGAGACC AACCGGAATT GTGCTGATCA CTGGCCCGAC TGGGGCGGGG
AAATCATCAA CGCTATATGC GGCGCTAAAC CATTTAAACA GCGAAGAAGT AAATATTATT
ACGATCGAAG ACCCGGTCGA ATATGAAATT GAAGGCGTCA ATCAAATTCA AGTCAATCCA
AATGTCGGAT TGACGTTTGC GCAAGGATTG CGCTCCATTT TGCGGCAAGA TCCAAACATT
ATTATGGTCG GAGAAATCCG CGACCGTGAA ACGGCAGAAG TTGCGATTCG CGCGTCATTA
ACTGGTCATT TGGTGTTAAG TACTCTTCAT ACAAACGACG CGCTAAGCAC GATCACGCGC
TTGATTGATA TGGGAATTGA GCCGTTTCTT GTGGCCGCCT CTTTAGCCGG CGTTGTTTCC
CAGCGGCTCG TCCGCCGCGT CTGCCGCGAT TGTCAAGAAG AGCAGGAGCC GACAAAGAGG
GAAATCGAAA TTTTTGCCAG CCGCGGCATG AAAATCGATA AGCTCGTTCG CGGCCGCGGC
TGCCCAACAT GCAATATGAC AGGTTATAAA GGACGAATCG CCATTCATGA ACTGCTTGTG
ATGACCGATG AGATGCGCCG CGTGATTTTA AATAAAGAGC CATTTTCGAA ATTGCGCGAG
CTTGCCATTA AAAACCGAAT GATTTTTTTG ATTGATGACG GATTATTAAA AGTAAAACAA
GGGCTAACGA CGCTAGAAGA GGTATTGAAA GTGGCGATTT TAAGTTAA
 
Protein sequence
MKKKQERKRL GDLLVEAGLI TEEQLEEALK EKAPGQKLGD ALLQRGYITE QQLIEVLEFQ 
LGIPHVSLYR YPIDPKVTNL ISKEFAKRHM VMPLKIEGER LLVAMADPMD FFVIDDLRLS
TGFHIETAIA SKDDILRAIN KYYDIDESVE DFLQMAPATE TVEEERITEE DSPIVWLVNQ
ILQLAVEQRA SDVHIDPQET KVLVRYRIDG ILRTDRALPK HMQSMLTARI KILANMDITE
HRIPQDGRIK MNIDFHPVDL RVSTLPTVYG EKIVMRILDL GAALNDIHKL GFNQLNLQRF
IELIERPTGI VLITGPTGAG KSSTLYAALN HLNSEEVNII TIEDPVEYEI EGVNQIQVNP
NVGLTFAQGL RSILRQDPNI IMVGEIRDRE TAEVAIRASL TGHLVLSTLH TNDALSTITR
LIDMGIEPFL VAASLAGVVS QRLVRRVCRD CQEEQEPTKR EIEIFASRGM KIDKLVRGRG
CPTCNMTGYK GRIAIHELLV MTDEMRRVIL NKEPFSKLRE LAIKNRMIFL IDDGLLKVKQ
GLTTLEEVLK VAILS