Gene ECH74115_2246 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_2246 
Symbol 
ID6970471 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp2133972 
End bp2135909 
Gene Length1938 bp 
Protein Length645 aa 
Translation table11 
GC content60% 
IMG OID643386131 
Productputative prohead protease 
Protein accessionYP_002270618 
Protein GI209397565 
COG category 
COG ID 
TIGRFAM ID[TIGR01543] phage prohead protease, HK97 family 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value0.0000101798 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
GTGACTCTTA AACGGGCCTG TTCCCTGCTG ACGGTGAAAT CCTTCAGTGA GGATGAACGG 
GTGATCACCG GGATTGCGTC AACGCCTTCT CCGGATCGGG ATGGTGACAT CCTGGAGCCG
GAGGGCGCGG AGTTTGGCAG TGCGATCCCG TTTCTCTGGC AGCATGACCA TTCCCGCCCG
GTGGGGCAGT GTACGGTGCG CCGGGTCAGC GAAGGGCTGG AAATCACGGC AACACTGGTG
AAGCCCGTAC CGGATATGCC GTCGCAACTG GCTGCCCGGC TGGATGAGGT CTGGGCGGCC
ATTAAGACCG GGCTGGTCAG GGGGCTGTCC GTGGGCTTCC GTCCCCATGA ATACACCTTT
CTGGACGGAG GCGGACTGCA TTTTCTGCGC TGGGAACTGA TGGAGGTGTC TGCCGTCACC
GTGCCCGCGA ATGCGGAATG CACCATCCGG ACCATTAAAT CTTACGACCG CCCGTTTTCT
GCCGCGTCCG GCAACCGGAA ACCGGTGGTG AAAATCGCAT CTTCTGCCGG CGCTGCGGCA
CAGTCAACAA CCGTTTTTCA TAAGGAAAAG ACCATAATGA ATATTGGCGA ACAGATTAAA
AGTTTTGAAA ACAAGCGTGC AGCGCTGGCA GCCTCCCTTG AGGAGGTCAT GACCAAAGCC
GCAGAGGAAG GGCGCACGCT GGATGTGGAG GAGGAAGAGC ATTACGACAA CACCGCAGCG
GAAATCCGTC AGGTGGATGC GCACCTGAAG CGCCTGCGTG AACTGGAAGC CGGTAAGGCC
GCCACGGCGC AGCCGGTGAA ACAGGCCGGT AACGGGAATG TGGCCGCGGT GGCTTCTGCG
CCGGTGATCC GTGTGGAGCA GAAACTGGAT AAGGGGATTG GCTTCGCCCG CTTTGCCAAA
TCGCTGGCTG CGGCTAAAGG CGTCCGATCT GAAGCCCTGG AAGTGGCCCG TCGTCAGTAT
CCGGATGACA GTCGTCTGCA TCATGTCCTG AAATCGGCAG TGGGCGCGGG GACCACCACG
GATCCGCAGT GGGCAGGCAG CCTGTCTGAA TATCAGGAAT ACGCACAGGA CTTTATTGAT
TACCTGCGTC CGCAGACCAT TATCGGGCGA TTTGGTCAGG GCGGGATCCC TGCACTTCGT
CAGGTGCCAT TCAATATCCG TGTGCACGCC CAGGTGTCCG GCGGTGCTGC CGGCTGGGTG
GGTGAGGGTA AGGCAAAACC CCTGACGAAG TTTGATTTTG AATCCATCAC CTTCAGTCAT
GCGAAGGTGT CGGCCATTGC GGTACTGACG GAAGAATTGA TCCGTTTTTC CAGTCCGGCT
GCTGATGCAC TGGTCCGTAA TGCGCTGGCG GAAGCGGTGG TGGCGCGTCT GGATACAGAC
TTTGTGGACC CGAAAAAAGC CGCAGTGGCA GATGTCTCCC CGGCGTCCAT CACCCATGAT
GTGAAGGGCA CGGCATCAAC CGGTAACCCG GATGCGGATG CCGAGGCTGC GTTTGGACAG
TTTGTGGCAG CAAACCTGCA GCCCACCGGT GCGGTCTGGC TGATGTCCAG CACCAATGCC
CTGGCACTGT CCATGCGTAA AAATGCGCTG GGTCAGAAGG AATACCCGGA CATGACCCTG
CTGGGTGGCT CCTTCCAGGG GCTGCCGGTG ATTGTCTCCC AGTACGTGGG TGACCAGCTG
GTGCTGGTGA ATGCCCCGGA TATTTATCTG GCGGATGACG GCGGCGTGGC AGTGGATATG
TCCCGCGAGG CATCACTGGA AATGCAGTCT GAGCCGGGCG GCGACAGTAC CACGCCGTCC
CCGGTGGAGC TGGTTTCCAT GTTCCAGACA GGCAGCGTGG CCATCCGTGC GGAGCGCTGG
ATCAACTGGC GTCGTCGCCG TACTGCGGCG GTGGCGGTGA TCACCGGAGT GAACTACGGC
AGTGCGTCCG GCGGCTGA
 
Protein sequence
MTLKRACSLL TVKSFSEDER VITGIASTPS PDRDGDILEP EGAEFGSAIP FLWQHDHSRP 
VGQCTVRRVS EGLEITATLV KPVPDMPSQL AARLDEVWAA IKTGLVRGLS VGFRPHEYTF
LDGGGLHFLR WELMEVSAVT VPANAECTIR TIKSYDRPFS AASGNRKPVV KIASSAGAAA
QSTTVFHKEK TIMNIGEQIK SFENKRAALA ASLEEVMTKA AEEGRTLDVE EEEHYDNTAA
EIRQVDAHLK RLRELEAGKA ATAQPVKQAG NGNVAAVASA PVIRVEQKLD KGIGFARFAK
SLAAAKGVRS EALEVARRQY PDDSRLHHVL KSAVGAGTTT DPQWAGSLSE YQEYAQDFID
YLRPQTIIGR FGQGGIPALR QVPFNIRVHA QVSGGAAGWV GEGKAKPLTK FDFESITFSH
AKVSAIAVLT EELIRFSSPA ADALVRNALA EAVVARLDTD FVDPKKAAVA DVSPASITHD
VKGTASTGNP DADAEAAFGQ FVAANLQPTG AVWLMSSTNA LALSMRKNAL GQKEYPDMTL
LGGSFQGLPV IVSQYVGDQL VLVNAPDIYL ADDGGVAVDM SREASLEMQS EPGGDSTTPS
PVELVSMFQT GSVAIRAERW INWRRRRTAA VAVITGVNYG SASGG