Gene EcHS_A3520 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcHS_A3520 
SymbolgspE1 
ID5593287 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli HS 
KingdomBacteria 
Replicon accessionNC_009800 
Strand
Start bp3502682 
End bp3504163 
Gene Length1482 bp 
Protein Length493 aa 
Translation table11 
GC content53% 
IMG OID640922637 
Productgeneral secretory pathway protein E 
Protein accessionYP_001460118 
Protein GI157162800 
COG category[N] Cell motility
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG2804] Type II secretory pathway, ATPase PulE/Tfp pilus assembly pathway, ATPase PilB 
TIGRFAM ID[TIGR02533] general secretory pathway protein E 


Plasmid Coverage information

Num covering plasmid clones33 
Plasmid unclonability p-value0.189411 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGAATTC ACTCACCGTA CCCCGCCAGT TGGGCGCTGG CACAACGAAT TGGTTATCTC 
TATTCAGAGG GCGAGATTAT TTATCTCGCC GATACGCCAT TCGAGCGGTT ACTCGATATT
CAACGTCAGG TTGGTCAGTG CCAGACCATG ACCAGTTTGT CACAGGCTGA TTTCGAAGCC
CGGCTGGAAG CGGTGTTCCA TCAGAATACC GGTGAGTCGC AACAGATTGC GCAGGATATC
GATCAATCCG TCGATCTTCT CTCGCTTTCG GAAGAGATGC CCGCAAATGA AGATCTCCTG
AATGAAGATT CAGCGGCACC GGTTATCCGC TTGATCAATG CGATTTTGAG TGAGGCCATC
AAAGAAACCG CCTCTGATAT CCACATTGAA ACCTATGAAA AAACAATGTC GATCCGTTTT
CGCATCGACG GCGTTTTGCG GACAATTTTA CAGCCAAACA AAAAACTGGC GGCACTGCTT
ATCTCCCGAA TTAAGGTCAT GGCTCGTCTT GATATCGCCG AAAAACGTAT TCCACAGGAT
GGAAGAATTA GTTTGCGTAT CGGGCGACGT AACATAGATG TCCGCGTATC CACACTGCCG
TCCATCTATG GTGAACGCGC CGTACTCCGC CTGCTGGATA AAAACAGCCT CCAGCTTTCA
TTGAACAACC TGGGGATGAC GGCAGCGGAT AAGCAGGATT TAGAAAATCT CATTCAGCTT
CCGCACGGTA TTATCCTGGT GACAGGGCCG ACAGGCTCCG GTAAAAGCAC CACGCTCTAC
GCCATCCTTT CGGCGCTGAA TACTCCCGGC CGCAATATTC TGACGGTAGA AGATCCCGTG
GAATATGAGC TGGAAGGCAT TGGGCAAACG CAGGTGAATA CCCGTGTGGA TATGTCTTTC
GCTCGCGGCC TGCGCGCCAT ACTTCGCCAG GACCCGGATG TCGTCATGGT GGGGGAAATT
CGTGATACAG AAACCGCGCA GATTGCGGTT CAGGCCTCGC TCACCGGCCA TCTGGTACTC
TCAACACTCC ACACTAACAG TGCATCAGGC GCAGTGACCC GGCTCCGCGA CATGGGCGTC
GAATCATTCC TGCTTTCGTC TTCCCTGGCA GGGATTATCG CGCAACGTCT GGTTCGTCGC
CTGTGTCCGC AATGCCGACA ATTCACGCCC GTATCACCCC AACAAGCGCA GATGTTTAAA
TATCATCAGC TCGCGGTGAC AACAATTGGC ACTCCCGTAG GCTGCCCTCA TTGCCATCAA
TCCGGCTATC AGGGGCGCAT GGCGATCCAC GAAATGATGG TGGTGACGCC GGAATTACGG
GCCGCTATTC ATGAAAATGT GGATGAACAA GCACTGGAGC GACTAGTCCG GCAACAACAC
AAGGCCTTAA TCAAAAATGG CCTGCAAAAA GTGATAAGCG GTGACACCTC CTGGGATGAG
GTTATGCGCG TCGCCAGTGC CACGCTGGAG AGCGAAGCAT GA
 
Protein sequence
MRIHSPYPAS WALAQRIGYL YSEGEIIYLA DTPFERLLDI QRQVGQCQTM TSLSQADFEA 
RLEAVFHQNT GESQQIAQDI DQSVDLLSLS EEMPANEDLL NEDSAAPVIR LINAILSEAI
KETASDIHIE TYEKTMSIRF RIDGVLRTIL QPNKKLAALL ISRIKVMARL DIAEKRIPQD
GRISLRIGRR NIDVRVSTLP SIYGERAVLR LLDKNSLQLS LNNLGMTAAD KQDLENLIQL
PHGIILVTGP TGSGKSTTLY AILSALNTPG RNILTVEDPV EYELEGIGQT QVNTRVDMSF
ARGLRAILRQ DPDVVMVGEI RDTETAQIAV QASLTGHLVL STLHTNSASG AVTRLRDMGV
ESFLLSSSLA GIIAQRLVRR LCPQCRQFTP VSPQQAQMFK YHQLAVTTIG TPVGCPHCHQ
SGYQGRMAIH EMMVVTPELR AAIHENVDEQ ALERLVRQQH KALIKNGLQK VISGDTSWDE
VMRVASATLE SEA