Gene EcHS_A3137 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcHS_A3137 
SymbolgspE2 
ID5593799 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli HS 
KingdomBacteria 
Replicon accessionNC_009800 
Strand
Start bp3145428 
End bp3146921 
Gene Length1494 bp 
Protein Length497 aa 
Translation table11 
GC content59% 
IMG OID640922256 
Productgeneral secretory pathway protein E 
Protein accessionYP_001459755 
Protein GI157162437 
COG category[N] Cell motility
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG2804] Type II secretory pathway, ATPase PulE/Tfp pilus assembly pathway, ATPase PilB 
TIGRFAM ID[TIGR02533] general secretory pathway protein E 


Plasmid Coverage information

Num covering plasmid clones74 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGTGCCTG TAGCACAGGA AACCACCGCC AACACCGTGC GTCTGCCCTA CAGTTTCAGC 
CGTCGGTTTA GCCTGGTGGC ATGGTGCGAA GCGTCGCTGG AGATCCTCCA CGTTCATCCG
CTATCGCTCT CTGTTTTGCA GGAGCTGCAG CGGGGGCTGA ACGCGCCCTT TACGCTGCGG
CAAATCGACG AGGCCGAATT TGAACAGCGG CTGAATGCGG TCTGGCAGCG GGACTCTTCC
GAGGCTCGCC AGCTGATGGA AGATCTCGGT TCTGCCGAGG ACTTTTTTAC CCTCGCGGAA
GAACTGCCGG AAACGGAAGA GCTGCTGGAA AGCGACGACG ATGCGCCGAT CATCAAACTG
ATCAACGCCA TGCTGGCAGA GGCAATCAAA GAAGGCGCTT CGGATATCCA CATTGAGACG
TTTGAAAAGA GTCTGGTGAT CCGTTTTCGT GTTGACGGCA CATTACATGA AATGCTGCGT
CCGGGGCGCA AACTGGCCTC GCTGCTGGTG TCGCGTATCA AGGTGATGGC GCGGCTGGAC
ATTGCCGAAA AGCGCGTGCC GCAGGATGGA CGTATTGCGC TGTTGCTGGG CGGCCGGGCG
ATTGACGTGC GTGTCTCCAC CATGCCTTCT GCCTGGGGCG AGCGCGTGGT GCTGCGACTG
CTGGACAAAA ACCAGGCCCG CCTGACGCTG GAGCGTCTGG GGCTTAGCCA GCAACTGACC
GCGCAGTTGC GCCAGCTGTT ACACAAACCG CACGGCATCT TTCTGGTGAC GGGGCCGACG
GGTTCCGGCA AAAGCACCAC GCTGTACGCC GGATTGCAGG AGCTGAACAA CCATTCGCGC
AACATTCTCA CGGTTGAAGA TCCCATCGAA TACATGATTG AAGGGATCGG TCAGACGCAG
GTTAACACCC GCGTCGGCAT GACCTTTGCC CGTGGGCTGC GCGCGATTTT GCGTCAGGAC
CCGGATGTGG TGATGGTCGG TGAAATCCGC GATACCGAAA CCGCAGAAAT CGCCGTCCAG
GCTTCACTTA CCGGACACCT GGTCCTTTCC ACGCTGCATA CCAACACAGC GGTGGGGGCG
ATCACACGTT TGCAGGATAT GGGCGTGGAG CCTTTCCTGC TCTCTTCCAG TCTGACGGGC
GTGATGGCGC AGCGACTGGT TCGCACGCTG TGTCCCGACT GCCGCCAGCC CGCACCAGCC
ACTGACGAAG AAAAACGCCT GCTGGGAATT ACCGACGCCC GTACCGTCAC TCTGTACCAT
CCACAGGGCT GTCCCGCCTG TAATCACAAA GGTTTTCGCG GACGGACTGC CATCCATGAG
CTGATCGTGG TGGATGCCAC ATTGCGTGAT TTGATCCACC GTCAGGCCGG GGAGCTGGAG
CTGGAACGTT ATGTCCGACA ACACTCTGCG GGTATCCGCA GCAACGGCAT TGAGAAAGTG
CTCGCCGGAG AAACCTCTCT CGATGAAGTT CTGCGGGTAA CCATGGAGGC GTAA
 
Protein sequence
MVPVAQETTA NTVRLPYSFS RRFSLVAWCE ASLEILHVHP LSLSVLQELQ RGLNAPFTLR 
QIDEAEFEQR LNAVWQRDSS EARQLMEDLG SAEDFFTLAE ELPETEELLE SDDDAPIIKL
INAMLAEAIK EGASDIHIET FEKSLVIRFR VDGTLHEMLR PGRKLASLLV SRIKVMARLD
IAEKRVPQDG RIALLLGGRA IDVRVSTMPS AWGERVVLRL LDKNQARLTL ERLGLSQQLT
AQLRQLLHKP HGIFLVTGPT GSGKSTTLYA GLQELNNHSR NILTVEDPIE YMIEGIGQTQ
VNTRVGMTFA RGLRAILRQD PDVVMVGEIR DTETAEIAVQ ASLTGHLVLS TLHTNTAVGA
ITRLQDMGVE PFLLSSSLTG VMAQRLVRTL CPDCRQPAPA TDEEKRLLGI TDARTVTLYH
PQGCPACNHK GFRGRTAIHE LIVVDATLRD LIHRQAGELE LERYVRQHSA GIRSNGIEKV
LAGETSLDEV LRVTMEA