Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH74115_2774 |
Symbol | |
ID | 6966746 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli O157:H7 str. EC4115 |
Kingdom | Bacteria |
Replicon accession | NC_011353 |
Strand | - |
Start bp | 2591565 |
End bp | 2593070 |
Gene Length | 1506 bp |
Protein Length | 501 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 643386629 |
Product | head-tail preconnector protein GP5 |
Protein accession | YP_002271108 |
Protein GI | 209395709 |
COG category | [O] Posttranslational modification, protein turnover, chaperones [U] Intracellular trafficking, secretion, and vesicular transport |
COG ID | [COG0616] Periplasmic serine proteases (ClpP class) |
TIGRFAM ID | [TIGR00706] signal peptide peptidase SppA, 36K type |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.821156 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 68 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGAGACGTA ATCTTTCACA CATTATTGCC GCAGCATTCA ATGAACCGCT GCTTCTGGAG CCCGCCTATG CGCGGGTTTT CTTTTGCGCG CTCGGGCGCG AGATGGGGGC AGCAAGTCTT TCGGTACCAC AGCAGCAGGT ACAGCTTGAT GCTCCCGGAA TGCTGGCTGA AACGGACGAG TACATGGCCG GAGGTAAACG ACCGGCCCGT GTTTACCGGG TGGTGAACGG TATTGCTGTA CTGCCGGTGA CCGGCACGCT GGTGCACCGG CTGGGGGGTA TGCGGCCATT TTCCGGAATG ACAGGCTATG ACGGCATTGT CGCCTGTCTT CAGCAGGCAA TGGCGGATAG CCAGGTGCGG GGCGTACTGC TGGACATTGA CAGTCCGGGC GGGCAGGCCG CCGGCGCGTT TGACTGCGCT GACATGATTT ACCGCCTCCG TCAGCAGAAG CCGGTCTGGG CACTGTGCAA TGACACGGCC TGTTCTGCAG CCATGCTGCT GGCGTCGGCC TGCTCCCGAC GGCTGGTTAC CCAGACATCC CGTATCGGCT CCATTGGCGT GATGATGAGC CATGTCAGCT ATGCCGGTCA TCTGGCGCAG GCCGGTGTGG ATATCACGCT GATTTACTCA GGGGCGCACA AGGTGGATGG CAATCAGTTT GAAGCCTTAC CGGCAGAGGT TCGCCAGAAC ATGCAGCAGC GCATTGATGC GGCGCGCCGG ATGTTTGCCG AAAAAGTGGC CATGTTTACC GGTCTGTCTG TTGATGCCGT CACGGGAACA GAGGCCGCCG TTTTTGAAGG TCAGTCCGGC ATTGATGCCG GGCTGGCGGA TGAATTAGTC AATGCGTCGG ATGCCATCAG TGTGATGGCC ACGGCGCTGA ACAGTAATGT CAGAGGAGGC ACTATGCCGC AATTAACTGC AACGGAAGCC GCCGCGCAGG AGAACCAGCG AGTGATGGGG ATCCTGACAT GCCAGGAAGC GAAAGGACGT GAACAGCTTG CCACGATGCT GGCAGGACAA CAGGGCATGA GCGTTGAACA GGCCCGGGCG ATTCTGGCCG CGGCGGCACC GCAGCAGCCG GTGGCATCCA CGCAGAGTGA AGCCGATCGC ATTATGGCGT GTGAAGAAGC GAACGGTCGT GAACAACTGG CGGCAACGCT GGCGGCGATG CCGGAGATGA CGGTGGAAAA AGCCCGCCCG ATCCTGGCTG CTTCACCGCA GGCGGATGCC GGACCCTCAC TCCGTGATCA GATCATGGCA CTGGATGAGG CAAAAGGGGC TGAGGCGCAG GCTGAACAGC TGGCTGCCTG CCCGGGAATG ACTGTGGAGA GCGCCCGGGC TGTGCTGGCT GCGGGATCAG GTAAGGCAGA ACCGGTCTCT GCATCCACAA CCGCCCTGTT TGAACGCATC ATGGCGAACC ATTCACCGGC TGCGGTACAG GGTGGCGTGC CACAGACGTC AGCAGACGGT GATGCGGACG TGAAAATGCT CATGGCCATG CCATGA
|
Protein sequence | MRRNLSHIIA AAFNEPLLLE PAYARVFFCA LGREMGAASL SVPQQQVQLD APGMLAETDE YMAGGKRPAR VYRVVNGIAV LPVTGTLVHR LGGMRPFSGM TGYDGIVACL QQAMADSQVR GVLLDIDSPG GQAAGAFDCA DMIYRLRQQK PVWALCNDTA CSAAMLLASA CSRRLVTQTS RIGSIGVMMS HVSYAGHLAQ AGVDITLIYS GAHKVDGNQF EALPAEVRQN MQQRIDAARR MFAEKVAMFT GLSVDAVTGT EAAVFEGQSG IDAGLADELV NASDAISVMA TALNSNVRGG TMPQLTATEA AAQENQRVMG ILTCQEAKGR EQLATMLAGQ QGMSVEQARA ILAAAAPQQP VASTQSEADR IMACEEANGR EQLAATLAAM PEMTVEKARP ILAASPQADA GPSLRDQIMA LDEAKGAEAQ AEQLAACPGM TVESARAVLA AGSGKAEPVS ASTTALFERI MANHSPAAVQ GGVPQTSADG DADVKMLMAM P
|
| |