Gene ECH74115_0913 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_0913 
Symbol 
ID6969975 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp922079 
End bp925492 
Gene Length3414 bp 
Protein Length1137 aa 
Translation table11 
GC content56% 
IMG OID643384935 
Producthypothetical protein 
Protein accessionYP_002269435 
Protein GI209399520 
COG category[S] Function unknown 
COG ID[COG4733] Phage-related protein, tail component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones59 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGTAAAG GAAGCAGTAA GGGGCATACC CCGCGCGAAG CGAAGGACAA CCTGAAGTCC 
ACGCAGCTGC TGAGTGTGAT CGATGCCATC AGCGAAGGAC CGATTGAAGG TCCGGTGGAT
GGATTAAAAA GCGTGCTGCT GAACAGTACG CCGGTGCTGG ACAGTGAAGG TAATACCAAC
ATCTCCGGTG TCACGGTGGT GTTCCGGGCC GGTGAGCAGG AGCAGACACC GCCGGAGGGA
TTTGAATCCT CCGGTTCTGA GACGGTGCTG GGTACGGAAG TGAAATACGA CACGCCGATC
ACCCGCACCA TCACGTCGGC AAACATCGAT CGTCTGCGCT TTACTTTCGG TGTGCAGGCA
CTGCGGGAAA CCACCTCAAA GGGGGACCGG AATCCGTCGG AAGTCCGCCT GCTGGTTCAG
ATACAGCGTA ATGGTGGCTG GGTGACGGAA AAAGACATCA CCATTAAGGG CAAAACCACG
TCGCAGTATC TGGCCTCGGT GGTGGTGGAT AACCTGCCGC CGCGCCCGTT TAATATCCGG
ATGCGCAGGA TGACGCCGGA CAGCACCACA GACCAGCTGC AGAACAAAAC GCTCTGGTCG
TCATACACCG AAATCATCGA TGTGAAACAG TGCTACCCGA ACACGGCACT GGTTGGCGTG
CAGGTGGACT CGGAGCAGTT TGGCAGTCAG CAGGTGAGCC GTAATTATCA TCTTCGCGGG
CGCATTCTGC AGGTGCCGTC AAACTATGAT CCGGAAAAAC GCACTTACAG CGGCATCTGG
GACGGAACGT TAAAACCGGC ATACAGCAAC AACATGGCCT GGTGTCTGTG GGATATGCTG
ACCCACCCGC GCTACGGCAT GGGGAAACGT CTTGGTGCGG CGGATGTGGA TAAATGGGCG
CTGTATGTCA TCGGCCAGTA CTGCGACCAG TCAGTACCGG ACGGCTTTGG CGGCACGGAG
CCGCGCATCA CCTGTAATGC GTACCTGACC ACGCAGCGCA AGGCGTGGGA TGTGCTCAGT
GATTTCTGCT CGGCGATGCG CTGTATGCCG GTATGGAACG GGCAGACGCT GACGTTCGTG
CAGGACCGGC CGTCGGATAA GGTGTGGACC TATAACCGCA GTAATGTGGT GATGCCGGAT
GATGGCGCGC CGTTCCGCTA CAGCTTTAGC GCCCTGAAAG ACCGCCATAA TGCCGTTGAG
GTGAACTGGA CTGACCCGGA CAACGGCTGG GAGACGGCGA CAGAGCTTGT GGAGGACACG
CAGGCCATTG CCCGTTACGG TCGTAACGTC ACGAAGATGG ATGCCTTTGG CTGTACCAGT
CGGGGGCAGG CACACCGTGC CGGGCTGTGG CTGATTAAAA CGGAACTGCT GGAAACGCAG
ACCGTGGACT TCAGCGTGGG CGCAGAAGGG CTTCGCCATG TACCGGGCGA TGTCATTGAA
ATCTGTGATG ATGACTATGC GGGGATCAGC ATCGGTGGGC GTGTGCTGGC GGTGAACAGC
CAGACCCGGA CGCTGACGCT CGACCGTGAA ATCACGCTGC CATCCTCCGG CACCACGCTG
ATAAGCCTGG TTGACGGAAG TGGCAATCCG GTCAGCGTGG AGGTTCAGTC CGTCACCGAC
GGCGTGAAGG TAAAAGTGAG CCGGGTTCCT GACGGTGTTG CTGAATACAG CGTGTGGGGG
CTGAAGCTGC CGACGCTGCG CCAGCGCCTG TTCCGCTGCG TGAGTATCCG TGAGAACGAC
GACGGCACGT ATGCCATCAC TGCCGTGCAG CATGTGCCGG AGAAAGAGGG CATCGTGGAT
AACGGGGCGC ACTTTGACGG TGACCAGAGC AGCACGGTGA ATGGTGTCAC GCCGCCAGCG
GTGCAGCACC TGACCGCCGA AGTCTCCGCA GACAGCGGGG AATATCAGGT GCTGGCGCGA
TGGGACACGC CGAAGGTGGT GAAGGGTGTG AGCTTCCTGC TTCGCCTGAC CGTGGCAGCG
GATGACGGCA GTGAGCGGCT GGTCAGCACG GCCCGGACGA CGGAAACCAC ATACCGCTTC
ACGCAGCTGG CGCTGGGGAA CTACAGGCTG ACAGTCCGGG CGGTAAATGC GTGGGGACAG
CAGGGCGATC CGGCGTCGGT ATCGTTCCGG ATTGCCGCAC CGGCAGCGCC GTCACAGATT
GAGCTGACAC CGGGCTATTT TCAGATAACC GCCACGCCGC ATCTTGCGGT TTATGATCCG
ACGGTACAGT TTGAGTTCTG GTTCTCGGAA ACGCGGATTG CGGATATCAG GCAGGTTGAA
ACCAGCGCGC GTTATCTTGG TACGGCGCTG TACTGGATAG CCGCCAGTAT CAATATCAAA
CCGGGCCATG ATTATTACTT TTATATCCGC AGTGTGAACA CCGTTGGCAA ATCGGCATTC
GTGGAGGCCG TCGGTCGGGC GAGCGATGAT GCGGAAGGTT ATCTGGATTT TTTCAAAGGC
AAGATAACCG AATCTCATCT CGGCAAGGAG CTGCTGGAAA AAGTCGATCT GACGGAGGAT
AACGCCAGCA GACTGGATGA GTTTTCGAAA GAGTGGAAGG ACGCTAACGA TAAATGGAAT
GCCATGTGGG GCGTCAAAAT TGAGCAGACC AAAGACGGCA AACATTATGT CGCGGGTATT
GGCCTCAGCA TGGAGGACAC GGAGGAAGGC AAGCTGAGCC AGTTTCTGGT TGCCGCCAAT
CGTATCGCGT TTATTGACCC GGCAAACGGG AATGAAACGC CGATGTTTGT GGCGCAGGGC
AACCAGATAT TCATGAACGA CGTGTTCCTG AAGCGCCTGA CGGCCCCCAC CATTACCAGC
GGTGGAAATC CACCGGTATT TTCCCTGACA TCAGACGGAA AGCTGACCGC TAAAAATGCG
GATATCAGTG GCAGTGTGAA TGCGAACTCC GGGACGCTCA ACAACGTCAC GGTAAATGAA
AACTGTACGA TTAAGGGCAT GCTGGAGGCG ACTCAGGTCA GAGGTGACTT CGTTAAAGCT
GTATCCAAAT CATTTCCGAA ACAGGCTGGT ACGTGGGGTA ACACGGAAAC ACCAAACGGG
ACGGTTACAG TCACCATCAG CGATGATCAT AACTTTGACC GTCAAATCAT TATTCCGCCC
ATTATCTTTA ACGGAATAGC GTATAGCGAT CCGGGAAGTG GTAATAACCC GGGAGGTACA
AGATACACGG GTTATGGTTT TGAAGTTCGC AAAAACGGTG TATTAATCGC ATCCAGAGAA
ACTAAAGGGG CCATTCCCGG TAGCTACAGT GCGGTTATTG ATATGCCGAG TGGCAGGGGA
AGCGTCACTC TGGAGTTTAA GGTTTTCCAT AAAGGCAATC AGCGGGCAGG TAATATCACC
GACTGTACGG TGATTGTGAC CAAAAAAGCG GCTTCCGGCA TCAGTATCCG TTGA
 
Protein sequence
MGKGSSKGHT PREAKDNLKS TQLLSVIDAI SEGPIEGPVD GLKSVLLNST PVLDSEGNTN 
ISGVTVVFRA GEQEQTPPEG FESSGSETVL GTEVKYDTPI TRTITSANID RLRFTFGVQA
LRETTSKGDR NPSEVRLLVQ IQRNGGWVTE KDITIKGKTT SQYLASVVVD NLPPRPFNIR
MRRMTPDSTT DQLQNKTLWS SYTEIIDVKQ CYPNTALVGV QVDSEQFGSQ QVSRNYHLRG
RILQVPSNYD PEKRTYSGIW DGTLKPAYSN NMAWCLWDML THPRYGMGKR LGAADVDKWA
LYVIGQYCDQ SVPDGFGGTE PRITCNAYLT TQRKAWDVLS DFCSAMRCMP VWNGQTLTFV
QDRPSDKVWT YNRSNVVMPD DGAPFRYSFS ALKDRHNAVE VNWTDPDNGW ETATELVEDT
QAIARYGRNV TKMDAFGCTS RGQAHRAGLW LIKTELLETQ TVDFSVGAEG LRHVPGDVIE
ICDDDYAGIS IGGRVLAVNS QTRTLTLDRE ITLPSSGTTL ISLVDGSGNP VSVEVQSVTD
GVKVKVSRVP DGVAEYSVWG LKLPTLRQRL FRCVSIREND DGTYAITAVQ HVPEKEGIVD
NGAHFDGDQS STVNGVTPPA VQHLTAEVSA DSGEYQVLAR WDTPKVVKGV SFLLRLTVAA
DDGSERLVST ARTTETTYRF TQLALGNYRL TVRAVNAWGQ QGDPASVSFR IAAPAAPSQI
ELTPGYFQIT ATPHLAVYDP TVQFEFWFSE TRIADIRQVE TSARYLGTAL YWIAASINIK
PGHDYYFYIR SVNTVGKSAF VEAVGRASDD AEGYLDFFKG KITESHLGKE LLEKVDLTED
NASRLDEFSK EWKDANDKWN AMWGVKIEQT KDGKHYVAGI GLSMEDTEEG KLSQFLVAAN
RIAFIDPANG NETPMFVAQG NQIFMNDVFL KRLTAPTITS GGNPPVFSLT SDGKLTAKNA
DISGSVNANS GTLNNVTVNE NCTIKGMLEA TQVRGDFVKA VSKSFPKQAG TWGNTETPNG
TVTVTISDDH NFDRQIIIPP IIFNGIAYSD PGSGNNPGGT RYTGYGFEVR KNGVLIASRE
TKGAIPGSYS AVIDMPSGRG SVTLEFKVFH KGNQRAGNIT DCTVIVTKKA ASGISIR