Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcHS_A3137 |
Symbol | gspE2 |
ID | 5593799 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli HS |
Kingdom | Bacteria |
Replicon accession | NC_009800 |
Strand | - |
Start bp | 3145428 |
End bp | 3146921 |
Gene Length | 1494 bp |
Protein Length | 497 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 640922256 |
Product | general secretory pathway protein E |
Protein accession | YP_001459755 |
Protein GI | 157162437 |
COG category | [N] Cell motility [U] Intracellular trafficking, secretion, and vesicular transport |
COG ID | [COG2804] Type II secretory pathway, ATPase PulE/Tfp pilus assembly pathway, ATPase PilB |
TIGRFAM ID | [TIGR02533] general secretory pathway protein E |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 74 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGTGCCTG TAGCACAGGA AACCACCGCC AACACCGTGC GTCTGCCCTA CAGTTTCAGC CGTCGGTTTA GCCTGGTGGC ATGGTGCGAA GCGTCGCTGG AGATCCTCCA CGTTCATCCG CTATCGCTCT CTGTTTTGCA GGAGCTGCAG CGGGGGCTGA ACGCGCCCTT TACGCTGCGG CAAATCGACG AGGCCGAATT TGAACAGCGG CTGAATGCGG TCTGGCAGCG GGACTCTTCC GAGGCTCGCC AGCTGATGGA AGATCTCGGT TCTGCCGAGG ACTTTTTTAC CCTCGCGGAA GAACTGCCGG AAACGGAAGA GCTGCTGGAA AGCGACGACG ATGCGCCGAT CATCAAACTG ATCAACGCCA TGCTGGCAGA GGCAATCAAA GAAGGCGCTT CGGATATCCA CATTGAGACG TTTGAAAAGA GTCTGGTGAT CCGTTTTCGT GTTGACGGCA CATTACATGA AATGCTGCGT CCGGGGCGCA AACTGGCCTC GCTGCTGGTG TCGCGTATCA AGGTGATGGC GCGGCTGGAC ATTGCCGAAA AGCGCGTGCC GCAGGATGGA CGTATTGCGC TGTTGCTGGG CGGCCGGGCG ATTGACGTGC GTGTCTCCAC CATGCCTTCT GCCTGGGGCG AGCGCGTGGT GCTGCGACTG CTGGACAAAA ACCAGGCCCG CCTGACGCTG GAGCGTCTGG GGCTTAGCCA GCAACTGACC GCGCAGTTGC GCCAGCTGTT ACACAAACCG CACGGCATCT TTCTGGTGAC GGGGCCGACG GGTTCCGGCA AAAGCACCAC GCTGTACGCC GGATTGCAGG AGCTGAACAA CCATTCGCGC AACATTCTCA CGGTTGAAGA TCCCATCGAA TACATGATTG AAGGGATCGG TCAGACGCAG GTTAACACCC GCGTCGGCAT GACCTTTGCC CGTGGGCTGC GCGCGATTTT GCGTCAGGAC CCGGATGTGG TGATGGTCGG TGAAATCCGC GATACCGAAA CCGCAGAAAT CGCCGTCCAG GCTTCACTTA CCGGACACCT GGTCCTTTCC ACGCTGCATA CCAACACAGC GGTGGGGGCG ATCACACGTT TGCAGGATAT GGGCGTGGAG CCTTTCCTGC TCTCTTCCAG TCTGACGGGC GTGATGGCGC AGCGACTGGT TCGCACGCTG TGTCCCGACT GCCGCCAGCC CGCACCAGCC ACTGACGAAG AAAAACGCCT GCTGGGAATT ACCGACGCCC GTACCGTCAC TCTGTACCAT CCACAGGGCT GTCCCGCCTG TAATCACAAA GGTTTTCGCG GACGGACTGC CATCCATGAG CTGATCGTGG TGGATGCCAC ATTGCGTGAT TTGATCCACC GTCAGGCCGG GGAGCTGGAG CTGGAACGTT ATGTCCGACA ACACTCTGCG GGTATCCGCA GCAACGGCAT TGAGAAAGTG CTCGCCGGAG AAACCTCTCT CGATGAAGTT CTGCGGGTAA CCATGGAGGC GTAA
|
Protein sequence | MVPVAQETTA NTVRLPYSFS RRFSLVAWCE ASLEILHVHP LSLSVLQELQ RGLNAPFTLR QIDEAEFEQR LNAVWQRDSS EARQLMEDLG SAEDFFTLAE ELPETEELLE SDDDAPIIKL INAMLAEAIK EGASDIHIET FEKSLVIRFR VDGTLHEMLR PGRKLASLLV SRIKVMARLD IAEKRVPQDG RIALLLGGRA IDVRVSTMPS AWGERVVLRL LDKNQARLTL ERLGLSQQLT AQLRQLLHKP HGIFLVTGPT GSGKSTTLYA GLQELNNHSR NILTVEDPIE YMIEGIGQTQ VNTRVGMTFA RGLRAILRQD PDVVMVGEIR DTETAEIAVQ ASLTGHLVLS TLHTNTAVGA ITRLQDMGVE PFLLSSSLTG VMAQRLVRTL CPDCRQPAPA TDEEKRLLGI TDARTVTLYH PQGCPACNHK GFRGRTAIHE LIVVDATLRD LIHRQAGELE LERYVRQHSA GIRSNGIEKV LAGETSLDEV LRVTMEA
|
| |