Gene EcSMS35_3246 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_3246 
SymbolgspE 
ID6145654 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp3316624 
End bp3318117 
Gene Length1494 bp 
Protein Length497 aa 
Translation table11 
GC content58% 
IMG OID641618076 
Productgeneral secretory pathway protein GspE 
Protein accessionYP_001745226 
Protein GI170683342 
COG category[N] Cell motility
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG2804] Type II secretory pathway, ATPase PulE/Tfp pilus assembly pathway, ATPase PilB 
TIGRFAM ID[TIGR02533] general secretory pathway protein E 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones47 
Fosmid unclonability p-value0.827332 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTGCCTG TAGCACAGGA AACCACCGCT AACACCGTGC GTCTGCCCTA CAGTTTCAGC 
CGTCGGTTTA GCCTGGTGGC ATGGTGCGAA GCGTCGCTGG AGATCCTCCA TGTTCATCCG
TTGTCGCTCT CTGTTTTGCA GGAGCTACAA CGGGGGCTGA ACGCGCCCTT TACGCTGCGG
CAAATCGACG AGGCCGAATT TGAACAGCGG CTGAATGCGG TCTGGCAGCG GGACTCTTCC
GAAGCTCGCC AGCTGATGGA AGATCTCGGT TCCGCCGAGG ACTTTTTTAC CCTCGCTGAA
GAACTGCCGG AAACGGAAGA TCTGCTGGAA AGTGACGACG ATGCGCCGAT CATCAAACTG
ATCAACGCCA TGCTGGCAGA GGCAATCAAA GAAGGCGCTT CGGATATCCA CATCGAGACG
TTTGAAAAGA GTCTGGTGAT CCGTTTTCGT GTTGACGGCA CATTACATGA AATGTTGCGC
CCCGGTCGTA AACTGGCCTC GCTGCTGGTC TCGCGTATCA AGGTGATGGC GCGGCTGGAT
ATCGCCGAAA AGCGCGTACC GCAGGATGGC CGTATTGCGC TGCTGCTGGG CGGTCGGGCG
ATTGACGTCC GTGTATCTAC CATGCCTTCC GCCTGGGGGG AACGGGTGGT GCTGCGACTG
CTGGACAAAA ACCAGGCCCG CCTGACGCTG GAGCGTCTGG GGCTTAGCCA GCAACTGACC
GCGCAGTTGC GCCAGCTGTT ACACAAACCG CACGGCATCT TTCTGGTGAC GGGGCCGACG
GGTTCCGGCA AAAGCACCAC GCTGTACGCT GGATTGCAGG AGCTGAACAA CCACTCGCGT
AACATTCTCA CGGTTGAAGA CCCTATCGAA TACATGATTG AAGGGATCGG TCAGACGCAG
GTTAACACCC GCGTCGGCAT GACCTTCGCC CGTGGCCTGC GCGCGATTTT GCGTCAGGAC
CCGGATGTGG TGATGGTCGG TGAAATCCGC GATACCGAAA CCGCAGAAAT CGCTGTTCAG
GCTTCACTGA CCGGACACCT GGTACTTTCC ACCCTGCATA CCAACACAGC GGTGGGGGCG
ATCACGCGTT TGCAGGATAT GGGCGTGGAG CCTTTCCTGC TCTCTTCCAG TTTGACGGGC
GTGATGGCGC AGCGACTGGT TCGCACGCTG TGTCCCGATT GCCGCCAGTC CGCGCCTGCC
ACCAACGAAG AAAAACGCCT GCTGGGGATT ACCGATGCGC ATGCCGTCAC GCTGTACCAT
CCGCAGGGCT GCCCCGCCTG TAATCACAAA GGTTTTCGCG GACGTACTGC CATCCATGAG
CTGATTGTGG TGGACGCCAC ATTGCGTGAT TTGATCCACC GTCAGGCCGG GGAACTGGAG
CTGGAACGTT ATGTCCGGCA ACACTCTGCG GGTATCCGCA GCAACGGCAT TGAGAAAGTG
CTCGCCGGAG AAACCTCTCT CGATGAAGTT CTGCGGGTAA CCATGGAGGC GTAA
 
Protein sequence
MVPVAQETTA NTVRLPYSFS RRFSLVAWCE ASLEILHVHP LSLSVLQELQ RGLNAPFTLR 
QIDEAEFEQR LNAVWQRDSS EARQLMEDLG SAEDFFTLAE ELPETEDLLE SDDDAPIIKL
INAMLAEAIK EGASDIHIET FEKSLVIRFR VDGTLHEMLR PGRKLASLLV SRIKVMARLD
IAEKRVPQDG RIALLLGGRA IDVRVSTMPS AWGERVVLRL LDKNQARLTL ERLGLSQQLT
AQLRQLLHKP HGIFLVTGPT GSGKSTTLYA GLQELNNHSR NILTVEDPIE YMIEGIGQTQ
VNTRVGMTFA RGLRAILRQD PDVVMVGEIR DTETAEIAVQ ASLTGHLVLS TLHTNTAVGA
ITRLQDMGVE PFLLSSSLTG VMAQRLVRTL CPDCRQSAPA TNEEKRLLGI TDAHAVTLYH
PQGCPACNHK GFRGRTAIHE LIVVDATLRD LIHRQAGELE LERYVRQHSA GIRSNGIEKV
LAGETSLDEV LRVTMEA