Gene ECH74115_1189 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_1189 
Symbol 
ID6969939 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp1198642 
End bp1200147 
Gene Length1506 bp 
Protein Length501 aa 
Translation table11 
GC content60% 
IMG OID643385186 
Producthead-tail preconnector protein GP5 
Protein accessionYP_002269682 
Protein GI209398436 
COG category[O] Posttranslational modification, protein turnover, chaperones
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG0616] Periplasmic serine proteases (ClpP class) 
TIGRFAM ID[TIGR00706] signal peptide peptidase SppA, 36K type 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones59 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAGACGTA ATCTTTCACA CATTATTGCC GCAGCATTCA ATGAACCGCT GCTTCTGGAG 
CCCGCCTATG CGCGGGTTTT CTTTTGCGCG CTCGGGCGCG AGATGGGGGC AGCAAGTCTT
TCGGTACCAC AGCAGCAGGT ACAGCTTGAT GCTCCCGGAA TGCTGGCTGA AACGGACGAG
TACATGGCCG GAGGTAAACG ACCGGCCCGT GTTTACCGGG TGGTGAACGG TATTGCTGTA
CTGCCGGTGA CCGGCACGCT GGTGCACCGG CTGGGGGGTA TGCGGCCATT TTCCGGAATG
ACAGGCTATG ACGGCATTGT CGCCTGTCTT CAGCAGGCAA TGGCGGATAG CCAGGTGCGG
GGCGTACTGC TGGACATTGA CAGTCCGGGC GGGCAGGCCG CCGGCGCGTT TGACTGCGCT
GACATGATTT ACCGCCTCCG TCAGCAGAAG CCGGTCTGGG CACTGTGCAA TGACACGGCC
TGTTCTGCAG CCATGCTGCT GGCGTCGGCC TGCTCCCGAC GGCTGGTTAC CCAGACATCC
CGTATCGGCT CCATTGGCGT GATGATGAGC CATGTCAGCT ATGCCGGTCA TCTGGCGCAG
GCCGGTGTGG ATATCACGCT GATTTACTCA GGGGCGCACA AGGTGGATGG CAATCAGTTT
GAAGCCTTAC CGGCAGAGGT TCGCCAGAAC ATGCAGCAGC GCATTGATGC GGCGCGCCGG
ATGTTTGCCG AAAAAGTGGC CATGTTTACC GGTCTGTCTG TTGATGCCGT CACGGGAACA
GAGGCCGCCG TTTTTGAAGG TCAGTCCGGC ATTGATGCCG GGCTGGCGGA TGAATTAGTC
AATGCGTCGG ATGCCATCAG TGTGATGGCC ACGGCGCTGA ACAGTAATGT CAGAGGAGGC
ACTATGCCGC AATTAACTGC AACGGAAGCC GCCGCGCAGG AGAACCAGCG AGTGATGGGG
ATCCTGACAT GCCAGGAAGC GAAAGGACGT GAACAGCTTG CCACGATGCT GGCAGGACAA
CAGGGCATGA GCGTTGAACA GGCCCGGGCG ATTCTGGCCG CGGCGGCACC GCAGCAGCCG
GTGGCATCCA CGCAGAGTGA AGCCGATCGC ATTATGGCGT GTGAAGAAGC GAACGGTCGT
GAACAACTGG CGGCAACGCT GGCGGCGATG CCGGAGATGA CGGTGGAAAA AGCCCGCCCG
ATCCTGGCTG CTTCACCGCA GGCGGATGCC GGACCCTCAC TCCGTGATCA GATCATGGCA
CTGGATGAGG CAAAAGGGGC TGAGGCGCAG GCTGAACAGC TGGCTGCCTG CCCGGGAATG
ACTGTGGAGA GCGCCCGGGC TGTGCTGGCT GCGGGATCAG GTAAGGCAGA ACCGGTCTCT
GCATCCACAA CCGCCCTGTT TGAACGCATC ATGGCGAACC ATTCACCGGC TGCGGTACAG
GGTGGCGTGC CACAGACGTC AGCAGACGGT GATGCGGACG TGAAAATGCT CATGGCCATG
CCATGA
 
Protein sequence
MRRNLSHIIA AAFNEPLLLE PAYARVFFCA LGREMGAASL SVPQQQVQLD APGMLAETDE 
YMAGGKRPAR VYRVVNGIAV LPVTGTLVHR LGGMRPFSGM TGYDGIVACL QQAMADSQVR
GVLLDIDSPG GQAAGAFDCA DMIYRLRQQK PVWALCNDTA CSAAMLLASA CSRRLVTQTS
RIGSIGVMMS HVSYAGHLAQ AGVDITLIYS GAHKVDGNQF EALPAEVRQN MQQRIDAARR
MFAEKVAMFT GLSVDAVTGT EAAVFEGQSG IDAGLADELV NASDAISVMA TALNSNVRGG
TMPQLTATEA AAQENQRVMG ILTCQEAKGR EQLATMLAGQ QGMSVEQARA ILAAAAPQQP
VASTQSEADR IMACEEANGR EQLAATLAAM PEMTVEKARP ILAASPQADA GPSLRDQIMA
LDEAKGAEAQ AEQLAACPGM TVESARAVLA AGSGKAEPVS ASTTALFERI MANHSPAAVQ
GGVPQTSADG DADVKMLMAM P