Gene ECH74115_2762 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_2762 
Symbol 
ID6968579 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp2580617 
End bp2584096 
Gene Length3480 bp 
Protein Length1159 aa 
Translation table11 
GC content58% 
IMG OID643386617 
Producthypothetical protein 
Protein accessionYP_002271096 
Protein GI209398947 
COG category[S] Function unknown 
COG ID[COG4733] Phage-related protein, tail component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones49 
Fosmid unclonability p-value0.619674 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGGTAAAG GGGGCGGCAA GGGGCACACG CCGGTAGAGG CAAAGGACAA TCTTAAGTCC 
ACGCAGATGA TGAGCGTGAT TGACGCCATT GGTGAAGGGC CGATTGAAGG TCCGGTGAAG
GGGCTGCAGA GTATTCTGGT GAACAAAACC CCGCTGACGG ACACGGACGG TAATCCCGTG
ATACACGGTG TGACTGCGGT CTGGCGTGCC GGGGAGCAGG AGCAGACACC ACCGGAAGGC
TTTGAGTCCT CCGGAGCTGA AACCGCACTG GGCGTGGAAG TGACGAAGGC AAAGCCGGTG
ACGCGCACCA TTACGTCCGC GAACATTGAC CGCCTGCGGG TCACCTTCGG GGTGCAGTCA
CTGGTGCAGA CCACCTCACA GGGTGACCGT AACCCGGCAT CCGTCCGACT GCTGATTCAG
TTGCAGCGTA ACGGTAACTG GGTGACGGAA AAGGATGTCA CCATTAACGG CAAGACCACC
TCGCAGTTCC TGGCGTCGGT GATTCTGGAT AATCTGCCGC CCCGGCCCTT TAACATCCGG
ATGGTCAGGG AGACGGCGGA CAGCACCACG GACCAGCTGC AGAACAGAAC GCTGTGGTCG
TCATACACCG AAATCATCGA TGTGAAACAG TGCTACCCGA ACACGGCCAT TGTGGGGCTG
CAGGTGGATG CGGAGCAGTT TGGCGGTCAG CAGATGACGG TGAACTACCA TATCCGCGGT
CGCATCATCC AGGTGCCGTC AAACTATGAC CCGGAAAAAC GCACGTACAG CGGCATCTGG
GACGGCAGCC TGAAACCGGC ATACAGCAAC AACCCTGCCT GGTGCCTGTG GGACATGCTG
ACCCACCCGC GCTACGGAAT GGGAAAACGC CTGGGGGCGG CGGATGTGGA CAAGTGGGCG
CTGTATGCCA TTGCGCAGTA CTGCGACCAG ACGGTCCCGG ATGGTTTCGG GGGCACAGAG
CCGCGGATGA CTTTCAATGC GTACCTGTCA CAACAGCGTA AGGCGTGGGA CGTTCTCAGT
GATTTCTGCT CGGCGATGCG CTGTATGCCG GTATGGAACG GGCAGACGCT GACGTTTGTG
CAGGACCGTC CGTCAGATGT GGTGTGGCCC TACACCAGCA GTGATGTGGT GGTGGATGAT
AACGGCGTGG GGTTTCGCTA CAGCTTCAGC GCCCTGAAGG ACCGCCACAC GGCGGTGGAG
GTGAATTACA CCGACCCGCA GAACGGCTGG CAGACCTCCA CGGAACTGGT GGAAGACCCG
GAAGCCATAC TGCGCTACGG GCGCAACCTG CTGAAGATGG ATGCGTTCGG CTGTACCAGT
CGCGGTCAGG CCCACCGTGC CGGGCTGTGG GTGATAAAGA CCGGACTGCT GGAAACGCAG
ACGGTGGATT TCACGCTCGG GTCACAGGGG CTGCGTCACA CACCCGGTGA CATTATTGAA
ATCTGTGATA ACGACTATGC CGGGACCATG ACCGGCGGAC GTATCCTGTC CATCGATGCC
GCCAGCCGCA CCCTGACACT GGACCGTGAG GTGACCCTGC CGGAGACCGG TGCCGCCACG
GTGAACCTGA TTAACGGCAG CGGTAAGCCG GTGAGCGTGG CCATCACTGC ACACCCCGCG
CCGGACCGGA TACAGGTCAG CACCCTGCCG GATGGCGTGG AGACATACGG TGTGTGGGGG
CTCTCCCTGC CGTCACTGCG TCGTCGCCTG TTCCGCTGTG TCTCCATCCG GGAAAACACG
GACGGCACCT TTGCCATCAC GGCAGTGCAG CACGTACCGG AAAAAGAAGC CATCGTGGAT
AACGGGGCGC ACTTTGACGG CGACCAGAGC GGCACCCTGA ACAGCGTCAT CCCTCCGGCA
GTGCAGCACC TGACGGTGGA GGTGAGTGCA GCTGACAGCC AGTATCTGGC GCAGGCGAAA
TGGGACACGC CGCGGGTGGT GAAGGGCGTG CGCTTCAGTC TGCGCCTGAC CAGTGGAAGC
GGTCAGGACA GCCGTCTGGT GACCACCGCC ATCACTGCGG ATACAGAGCA TCGTTTCAGT
GGTCTGCCGC TCGGGGAATA CACCCTGACA GTCAGGGCAA TTAACAGTTA TGGCCAGCAG
GGCGAACCGG CCACCACCAC CTTCCGGATT AACGCGCCAG CAAAACCCGC CACCATTGAA
CTGACGCCGG GGTATTTTCA GATAACGGCG GTACCGGTGC TGGCGGTGTA TGACCCGACG
GTGCAGTTTG AGTTCTGGTT TTCGGAAAAA CGCATCACGA ACACGGCACA GGTGGAAAAA
TCTGCCCGTT ATCTGGGGAG CGGCAGTCAG TGGACTGTCC AGGGAAGCCG GATTAAGCCG
GGGACGGATT TCTGGTTTTA CGTGCGCAGC GTCAACCTGG TGGGGAAATC TGCGTTTGTG
GAAGTCAGCG GGCAGCCCAG CAATGATGGT GAAGGGTATC TGGAATTTTT CCGGGAAAAA
ATAGGAAAAC TGCATCTGGC TCAGGGGCTA TGGGAGCTGA TAGACAACAG CCAGCTTGCG
GATGAGATGG CGGAGATGAA GACCACCATC ACGGAAACCC GCAATGAAAT CACACAGACG
GTCAGTAAAA CGCTGGAGAA CCAGAGCGCC ACTATACAGC AGATACAGCG CGTGCAGAAG
GACACAAATG ATGACCTGGC TGCGCTGTAC ATGCTGAAGG TTCAAAAAAC GAAAGACGGC
ATTCCCTATG TGGCCGGGAT TGGTGCAGGG ATTGAGGATA CTGATGGCCA GCCACTGAGC
AACATACTGC TGCTGGCTGA CCGTATCGCG ATGATAAATC CGGAGAGCGG CAACAGCACG
CCGTTATTTG TGGCGCAGGG GAATCAGCTG TTCATGAACG ACGTGTTCCT GAAGCGACTG
TTTGCGGTGA GTATCACCTC GTCCGGCAAT CCCCCGACGT TTTCCCTGAC GCCGGACGGG
CGACTGACGG CGAAAAATGC GGATATCAGT GGCAGTGTGA ATGCGAACTC AGGGACGCTC
AACAACGTCA CGATTAATGA GAACTGTCAG ATTAAGGGGA AACTGTCAGC CAACCAGATT
GAAGGCGATA TTGTCAAAAC GGTCAGCAAG TCTTTCCCCC GCACGAGCAC TTATGCCAGT
GGCACCATCA CGGTAAGAAT CAGTGATGAT CAGAAGTTTG ACCGGCAGGT CATGATACCG
CCAGTGTTAT TCCGCGGTGG TAAGCATGAG AATTTCAACA GTAATAACCA ACAGTCATAC
TGGTATTCAA CCTGCCGGTT AAGAGTGACC CGCAATGGTC AGGAGATTTT TAATCAGTCC
ACGACGGATG CTCAGGGCGT ATTTTCCTCA GTTATAGATA TGCCTGCCGG ACAGGGGACG
CTGACACTGA CATTCACCGT ATCTTCATCA GGAGCGAATA ACTGGACACC AACAACCAGT
ATCAGCGATC TGCTGGTTGT GGTGATGAAG AAAGCCACCG CAGGCATCAG TATCAGCTGA
 
Protein sequence
MGKGGGKGHT PVEAKDNLKS TQMMSVIDAI GEGPIEGPVK GLQSILVNKT PLTDTDGNPV 
IHGVTAVWRA GEQEQTPPEG FESSGAETAL GVEVTKAKPV TRTITSANID RLRVTFGVQS
LVQTTSQGDR NPASVRLLIQ LQRNGNWVTE KDVTINGKTT SQFLASVILD NLPPRPFNIR
MVRETADSTT DQLQNRTLWS SYTEIIDVKQ CYPNTAIVGL QVDAEQFGGQ QMTVNYHIRG
RIIQVPSNYD PEKRTYSGIW DGSLKPAYSN NPAWCLWDML THPRYGMGKR LGAADVDKWA
LYAIAQYCDQ TVPDGFGGTE PRMTFNAYLS QQRKAWDVLS DFCSAMRCMP VWNGQTLTFV
QDRPSDVVWP YTSSDVVVDD NGVGFRYSFS ALKDRHTAVE VNYTDPQNGW QTSTELVEDP
EAILRYGRNL LKMDAFGCTS RGQAHRAGLW VIKTGLLETQ TVDFTLGSQG LRHTPGDIIE
ICDNDYAGTM TGGRILSIDA ASRTLTLDRE VTLPETGAAT VNLINGSGKP VSVAITAHPA
PDRIQVSTLP DGVETYGVWG LSLPSLRRRL FRCVSIRENT DGTFAITAVQ HVPEKEAIVD
NGAHFDGDQS GTLNSVIPPA VQHLTVEVSA ADSQYLAQAK WDTPRVVKGV RFSLRLTSGS
GQDSRLVTTA ITADTEHRFS GLPLGEYTLT VRAINSYGQQ GEPATTTFRI NAPAKPATIE
LTPGYFQITA VPVLAVYDPT VQFEFWFSEK RITNTAQVEK SARYLGSGSQ WTVQGSRIKP
GTDFWFYVRS VNLVGKSAFV EVSGQPSNDG EGYLEFFREK IGKLHLAQGL WELIDNSQLA
DEMAEMKTTI TETRNEITQT VSKTLENQSA TIQQIQRVQK DTNDDLAALY MLKVQKTKDG
IPYVAGIGAG IEDTDGQPLS NILLLADRIA MINPESGNST PLFVAQGNQL FMNDVFLKRL
FAVSITSSGN PPTFSLTPDG RLTAKNADIS GSVNANSGTL NNVTINENCQ IKGKLSANQI
EGDIVKTVSK SFPRTSTYAS GTITVRISDD QKFDRQVMIP PVLFRGGKHE NFNSNNQQSY
WYSTCRLRVT RNGQEIFNQS TTDAQGVFSS VIDMPAGQGT LTLTFTVSSS GANNWTPTTS
ISDLLVVVMK KATAGISIS