Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH74115_1539 |
Symbol | |
ID | 6972176 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli O157:H7 str. EC4115 |
Kingdom | Bacteria |
Replicon accession | NC_011353 |
Strand | + |
Start bp | 1505321 |
End bp | 1507258 |
Gene Length | 1938 bp |
Protein Length | 645 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 643385509 |
Product | putative prohead protease |
Protein accession | YP_002270003 |
Protein GI | 209400333 |
COG category | |
COG ID | |
TIGRFAM ID | [TIGR01543] phage prohead protease, HK97 family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 52 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGACTCTTA AACGGGCCTG TTCCCTGCTG ACGGTGAAAT CCTTCAGTGA GGATGAACGG GTGATCACCG GGATTGCGTC AACGCCTTCT CCGGATCGGG ATGGTGACAT CCTGGAGCCG GAGGGCGCGG AGTTTGGCAG TGCGATCCCG TTTCTCTGGC AGCATGACCA TTCCCGCCCG GTGGGGCAGT GTACGGTGCG CCGGGTCAGC GAAGGGCTGG AAATCACGGC AACACTGGTG AAGCCCGTAC CGGATATGCC GTCGCAACTG GCTGCCCGGC TGGATGAGGT CTGGGCGGCC ATTAAGACCG GGCTGGTCAG GGGGCTGTCC GTGGGCTTCC GTCCCCATGA ATACACCTTT CTGGACGGAG GCGGACTGCA TTTTCTGCGC TGGGAACTGA TGGAGGTGTC TGCCGTCACC GTGCCCGCGA ATGCGGAATG CACCATCCGG ACCATTAAAT CTTACGACCG CCCGTTTTCT GCCGCGTCCG GCAACCGGAA ACCGGTGGTG AAAATCGCAT CTTCTGCCGG CGCTGCGGCA CAGTCAACAA CCGTTTTTCA TAAGGAAAAG ACCATAATGA ATATTGGCGA ACAGATTAAA AGTTTTGAAA ACAAGCGTGC AGCGCTGGCA GCCTCCCTTG AGGAGGTCAT GACCAAAGCC GCAGAGGAAG GGCGCACGCT GGATGTGGAG GAGGAAGAGC ATTACGACAA CACCGCAGCG GAAATCCGTC AGGTGGATGC GCACCTGAAG CGCCTGCGTG AACTGGAAGC CGGTAAGGCC GCCACGGCGC AGCCGGTGAA ACAGGCCGGT AACGGGAATG TGGCCGCGGT GGCTTCTGCG CCGGTGATCC GTGTGGAGCA GAAACTGGAT AAGGGGATTG GCTTCGCCCG CTTTGCCAAA TCGCTGGCTG CGGCTAAAGG CGTCCGATCT GAAGCCCTGG AAGTGGCCCG TCGTCAGTAT CCGGATGACA GTCGTCTGCA TCATGTCCTG AAATCGGCAG TGGGCGCGGG GACCACCACG GATCCGCAGT GGGCAGGCAG CCTGTCTGAA TATCAGGAAT ACGCACAGGA CTTTATTGAT TACCTGCGTC CGCAGACCAT TATCGGGCGA TTTGGTCAGG GCGGGATCCC TGCACTTCGT CAGGTGCCGT TCAATATCCG TGTGCACGCC CAGGTGTCCG GCGGTGCTGC CGGCTGGGTG GGTGAGGGTA AGGCAAAACC CCTGACGAAG TTTGATTTTG AATCCATCAC CTTCAGTCAT GCGAAGGTGT CGGCCATTGC GGTACTGACG GAAGAGTTGA TCCGTTTTTC CAGTCCGGCT GCTGATGCAC TGGTCCGTAA TGCGCTGGCG GAAGCGGTGG TGGCGCGTCT GGATACAGAC TTTGTGGACC CGAAAAAAGC CGCAGTGGCA GATGTCTCCC CGGCGTCCAT CACCCATGAT GTGAAGGGCA CGGCATCAAC CGGTAACCCG GATGCGGATG CCGAGGCTGC GTTTGGACAG TTTGTGGCAG CAAACCTGCA GCCCACCGGT GCGGTCTGGC TGATGTCCAG CACCAATGCC CTGGCACTGT CCATGCGTAA AAATGCGCTG GGTCAGAAGG AATACCCGGA CATGACCCTG CTGGGTGGCT CCTTCCAGGG GCTGCCGGTG ATTGTCTCCC AGTACGTGGG TGACCAGCTG GTGCTGGTGA ATGCCCCGGA TATTTATCTG GCGGATGACG GCGGCGTGGC AGTGGATATG TCCCGCGAGG CATCACTGGA AATGCAGTCT GAGCCGGGCG GCGACAGTAC CACGCCGTCC CCGGTGGAGC TGGTTTCCAT GTTCCAGACA GGCAGCGTGG CCATCCGTGC GGAGCGCTGG ATCAACTGGC GTCGTCGCCG TACTGCGGCG GTGGCGGTGA TCACCGGAGT GAACTACGGC AGTGCGTCCG GCGGCTGA
|
Protein sequence | MTLKRACSLL TVKSFSEDER VITGIASTPS PDRDGDILEP EGAEFGSAIP FLWQHDHSRP VGQCTVRRVS EGLEITATLV KPVPDMPSQL AARLDEVWAA IKTGLVRGLS VGFRPHEYTF LDGGGLHFLR WELMEVSAVT VPANAECTIR TIKSYDRPFS AASGNRKPVV KIASSAGAAA QSTTVFHKEK TIMNIGEQIK SFENKRAALA ASLEEVMTKA AEEGRTLDVE EEEHYDNTAA EIRQVDAHLK RLRELEAGKA ATAQPVKQAG NGNVAAVASA PVIRVEQKLD KGIGFARFAK SLAAAKGVRS EALEVARRQY PDDSRLHHVL KSAVGAGTTT DPQWAGSLSE YQEYAQDFID YLRPQTIIGR FGQGGIPALR QVPFNIRVHA QVSGGAAGWV GEGKAKPLTK FDFESITFSH AKVSAIAVLT EELIRFSSPA ADALVRNALA EAVVARLDTD FVDPKKAAVA DVSPASITHD VKGTASTGNP DADAEAAFGQ FVAANLQPTG AVWLMSSTNA LALSMRKNAL GQKEYPDMTL LGGSFQGLPV IVSQYVGDQL VLVNAPDIYL ADDGGVAVDM SREASLEMQS EPGGDSTTPS PVELVSMFQT GSVAIRAERW INWRRRRTAA VAVITGVNYG SASGG
|
| |