Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GWCH70_2562 |
Symbol | |
ID | 7976326 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacillus sp. WCH70 |
Kingdom | Bacteria |
Replicon accession | NC_012793 |
Strand | - |
Start bp | 2587185 |
End bp | 2588852 |
Gene Length | 1668 bp |
Protein Length | 555 aa |
Translation table | 11 |
GC content | 45% |
IMG OID | 644799363 |
Product | type II secretion system protein E |
Protein accession | YP_002950523 |
Protein GI | 239827899 |
COG category | [N] Cell motility [U] Intracellular trafficking, secretion, and vesicular transport |
COG ID | [COG2804] Type II secretory pathway, ATPase PulE/Tfp pilus assembly pathway, ATPase PilB |
TIGRFAM ID | [TIGR02533] general secretory pathway protein E [TIGR02538] type IV-A pilus assembly ATPase PilB |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 41 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAGAAAA AACAAGAGCG GAAACGTTTA GGAGATTTAT TAGTGGAAGC GGGGCTCATT ACGGAGGAAC AGCTGGAGGA AGCGTTAAAA GAAAAAGCTC CCGGCCAAAA GCTGGGCGAT GCGCTCTTGC AGCGTGGGTA TATTACGGAA CAGCAATTAA TTGAAGTGCT TGAATTTCAG TTAGGCATCC CGCATGTCAG TTTATATCGC TATCCGATCG ATCCAAAGGT GACAAATCTC ATTTCAAAAG AATTTGCCAA GCGGCATATG GTGATGCCTT TAAAAATTGA GGGAGAACGC TTGCTTGTGG CAATGGCTGA TCCGATGGAC TTTTTTGTCA TTGATGATTT GCGCCTTTCG ACAGGGTTCC ATATTGAAAC GGCGATTGCC TCGAAAGATG ATATTTTGCG CGCCATTAAT AAGTATTACG ACATTGATGA ATCGGTAGAA GATTTTTTGC AAATGGCTCC CGCAACGGAA ACGGTCGAAG AGGAACGAAT AACCGAAGAG GATTCTCCGA TTGTCTGGCT TGTGAACCAA ATTTTGCAGC TCGCCGTTGA ACAGCGGGCA AGCGATGTCC ATATTGATCC ACAGGAAACG AAAGTGCTTG TCCGTTATCG CATCGACGGT ATATTGCGGA CAGACCGCGC GCTTCCAAAA CATATGCAAA GCATGCTGAC AGCTAGAATT AAAATTTTAG CGAATATGGA TATTACCGAA CATCGCATTC CGCAAGATGG GCGGATTAAA ATGAACATTG ACTTCCATCC GGTCGATTTG CGCGTTTCCA CATTGCCGAC GGTTTACGGT GAAAAAATTG TGATGCGCAT CCTTGATTTA GGGGCGGCAT TAAATGATAT TCATAAGCTC GGATTTAATC AATTGAATTT ACAGCGGTTT ATTGAACTGA TTGAGAGACC AACCGGAATT GTGCTGATCA CTGGCCCGAC TGGGGCGGGG AAATCATCAA CGCTATATGC GGCGCTAAAC CATTTAAACA GCGAAGAAGT AAATATTATT ACGATCGAAG ACCCGGTCGA ATATGAAATT GAAGGCGTCA ATCAAATTCA AGTCAATCCA AATGTCGGAT TGACGTTTGC GCAAGGATTG CGCTCCATTT TGCGGCAAGA TCCAAACATT ATTATGGTCG GAGAAATCCG CGACCGTGAA ACGGCAGAAG TTGCGATTCG CGCGTCATTA ACTGGTCATT TGGTGTTAAG TACTCTTCAT ACAAACGACG CGCTAAGCAC GATCACGCGC TTGATTGATA TGGGAATTGA GCCGTTTCTT GTGGCCGCCT CTTTAGCCGG CGTTGTTTCC CAGCGGCTCG TCCGCCGCGT CTGCCGCGAT TGTCAAGAAG AGCAGGAGCC GACAAAGAGG GAAATCGAAA TTTTTGCCAG CCGCGGCATG AAAATCGATA AGCTCGTTCG CGGCCGCGGC TGCCCAACAT GCAATATGAC AGGTTATAAA GGACGAATCG CCATTCATGA ACTGCTTGTG ATGACCGATG AGATGCGCCG CGTGATTTTA AATAAAGAGC CATTTTCGAA ATTGCGCGAG CTTGCCATTA AAAACCGAAT GATTTTTTTG ATTGATGACG GATTATTAAA AGTAAAACAA GGGCTAACGA CGCTAGAAGA GGTATTGAAA GTGGCGATTT TAAGTTAA
|
Protein sequence | MKKKQERKRL GDLLVEAGLI TEEQLEEALK EKAPGQKLGD ALLQRGYITE QQLIEVLEFQ LGIPHVSLYR YPIDPKVTNL ISKEFAKRHM VMPLKIEGER LLVAMADPMD FFVIDDLRLS TGFHIETAIA SKDDILRAIN KYYDIDESVE DFLQMAPATE TVEEERITEE DSPIVWLVNQ ILQLAVEQRA SDVHIDPQET KVLVRYRIDG ILRTDRALPK HMQSMLTARI KILANMDITE HRIPQDGRIK MNIDFHPVDL RVSTLPTVYG EKIVMRILDL GAALNDIHKL GFNQLNLQRF IELIERPTGI VLITGPTGAG KSSTLYAALN HLNSEEVNII TIEDPVEYEI EGVNQIQVNP NVGLTFAQGL RSILRQDPNI IMVGEIRDRE TAEVAIRASL TGHLVLSTLH TNDALSTITR LIDMGIEPFL VAASLAGVVS QRLVRRVCRD CQEEQEPTKR EIEIFASRGM KIDKLVRGRG CPTCNMTGYK GRIAIHELLV MTDEMRRVIL NKEPFSKLRE LAIKNRMIFL IDDGLLKVKQ GLTTLEEVLK VAILS
|
| |