Gene ECH74115_2873 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_2873 
Symbol 
ID6967166 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp2668207 
End bp2671686 
Gene Length3480 bp 
Protein Length1159 aa 
Translation table11 
GC content57% 
IMG OID643386719 
Producthypothetical protein 
Protein accessionYP_002271190 
Protein GI209399897 
COG category[S] Function unknown 
COG ID[COG4733] Phage-related protein, tail component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones32 
Fosmid unclonability p-value0.00489128 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGGGCAAAG GTGGTGGCAA GGCGCACACG CCGGTTGAGG CAAAGGACAA TCTTAAGTCC 
ACGCAGATGA TGAGCGTGAT TGATGCGATT GGTGAAGGGC CGATTGAAGG TCCGGTGAAG
GGGCTGCAGA GTATCCTGGT GAACAAAACC CCGCTGACGG ACACGGACGG TAATCCTGTG
ATACATGGTG TGACAGCGGT CTGGCGCGCC GGGGAGCAGG AGCAGACACC ACCTGAAGGC
TTTGAGTCCT CCGGAGCTGA AACCGCACTG GGCGTGGAAG TGACGAAGGC AAAGCCGGTG
ACGCGCACAA TTACGTCCGC GAACATTGAC CGCCTGCGGG TCACCTTCGG GGTGCAGTCA
CTGGTGCAGA CCACCTCACA GGGTGACCGT AACCCGGCAT CCGTCCGACT GCTGATTCAG
TTGCAGCGTA ACGGTAACTG GGTGACGGAA AAGGATGTCA CCATTAACGG CAAGACCACC
TCGCAGTTCC TGGCGTCGGT GATTCTGGAT AATCTGCCGC CCCGGCCCTT TAACATCCGG
ATGGTCAGGG AGACGGCGGA CAGCACCACG GACCAGCTGC AGAACAGAAC GCTGTGGTCG
TCATACACCG AAATCATCGA TGTGAAACAG TGCTACCCGA ACACGGCCAT TGTGGGGCTG
CAGGTGGATG CGGAGCAGTT TGGCGGTCAG CAGATGACGG TGAACTACCA TATCCGCGGT
CGCATCATCC AGGTGCCGTC AAACTATGAC CCGGAAAAAC GCACGTACAG CGGCATCTGG
GACGGCAGCC TGAAACCGGC ATACAGCAAC AACCCTGCCT GGTGCCTGTG GGACATGCTG
ACCCACCCGC GCTACGGAAT GGGAAAACGC CTGGGGGCGG CGGATGTGGA CAAGTGGGCG
CTGTATGCCA TTGCGCAGTA CTGCGACCAG ACGGTCCCGG ATGGTTTCGG GGGCACAGAG
CCGCGGATGA CTTTCAATGC GTACCTGTCA CAACAGCGTA AGGCGTGGGA CGTTCTCAGT
GATTTCTGCT CGGCGATGCG CTGTATGCCG GTATGGAACG GGCAGACGCT GACGTTTGTG
CAGGACCGTC CGTCAGATGT GGTGTGGCCC TACACCAGCA GTGATGTGGT GGTGGATGAT
AACGGCGTGG GGTTTCGCTA CAGCTTCAGC GCCCTGAAGG ACCGCCACAC GGCGGTGGAG
GTGAATTACA CCGACCCGCA GAACGGCTGG CAGACCTCCA CGGAACTGGT GGAAGACCCG
GAAGCCATAC TGCGCTACGG GCGCAACCTG CTGAAGATGG ATGCGTTCGG CTGCACCAGT
CGCGGTCAGG CCCACCGTGC CGGGCTGTGG GTGATAAAGA CCGGACTGCT GGAAACGCAG
ACGGTGGATT TCACGCTCGG GTCTCAGGGG CTGCGTCACA CACCCGGTGA CATTATTGAA
ATCTGTGATA ACGACTATGC CGGGACCATG ACCGGCGGAC GTGTCCTGTC CATCGATGCC
GCCAGCCGCA CCCTGACACT GGACCGTGAG GTGACCCTGC CGGAGACAGG TACACCGACG
GTGAACCTGA TTAACGGCAG CGGTAAGCCG GTGAGCGTGG CCATCACTGC ACACCCCGCG
CCGGACCGGA TACAGGTCAG CACCCTGCCG GATGGCGTGG AGACATACGG TGTATGGGGA
CTCTCCCTGC CGTCACTGCG TCGTCGCCTG TTCCGCTGTG TCTCCATCCG GGAAAACACG
GACGGCACCT TTGCCATCAC GGCAGTGCAG CACGTACCGG AAAAAGAAGC CATCGTGGAT
AACGGGGCGC ACTTTGACGG CGACCAGAGC GGCACCCTGA ACAGCGTCAT CCCTCCGGCA
GTGCAGCACC TGACGGTGGA GGTGAGTGCA GCTGACAGCC AGTATCTGGC GCAGGCGAAA
TGGGACACGC CGCGGGTGGT GAAGGGCGTG CGCTTCAGTC TGCGCCTGAC CAGTGGAAGC
GGTCAGGACA GCCGTCTGGT GACCACCGCC ATCACTGCGG ATACAGAGCA TCGTTTCAGT
GGTCTGCCGC TCGGGGAATA CACCCTGACA GTCAGGGCAA TTAACAGTTA TGGCCAGCAG
GGCGAACCGG CCACCACCAC CTTCCGGATT AACGCGCCAG CAAAACCCGC CACCATTGAA
CTGACGCCGG GGTATTTTCA GATAACGGCG GTACCGGTGC TGGCGGTGTA TGACCCGACG
GTGCAGTTTG AGTTCTGGTT TTCGGAAAAA CGCATCACGA ACACGGCACA GGTGGAAAAA
TCTGCCCGTT ATCTGGGGAG CGGCAGTCAG TGGACTGTCC AGGGAAGCCG GATTAAGCCG
GGGACGGATT TCTGGTTTTA CGTGCGCAGC GTCAACCTGG TGGGGAAATC TGCGTTTGTG
GAAGTCAGCG GGCAGCCCAG CAATGATGGT GAAGGGTATC TGGAATTTTT CCGGGAAAAA
ATAGGAAAAC TGCATCTGGC TCAGGGGCTA TGGGAGCTGA TAGACAACAG CCAGCTTGCG
GATGAGATGG CGGAGATGAA GACCACCATC ACGGAAACCC GCAATGAAAT CACACAGACG
GTCAGTAAAA CGCTGGAGAA CCAGAGCGCC ACTATACAGC AGATACAGCG CGTGCAGAAG
GACACAAATG ATGACCTGGC TGCGCTGTAC ATGCTGAAGG TTCAAAAAAC GAAAGACGGC
ATTCCCTATG TGGCCGGGAT TGGTGCAGGG ATTGAGGATA CTGATGGCCA GCCACTGAGC
AACATACTGC TGCTGGCTGA CCGTATCGCG ATGATAAATC CGGAGAGCGG CAACAGCACG
CCGTTATTTG TGGCGCAGGG GAATCAGCTG TTCATGAACG ACGTGTTCCT GAAGCGACTG
TTTGCGGTGA GTATCACCTC GTCCGGCAAT CCCCCGACGT TTTCCCTGAC GCCGGACGGG
CGACTGACGG CGAAAAATGC GGATATCAGT GGCAGTGTGA ATGCGAACTC AGGGACGCTC
AACAACGTCA CGATTAATGA GAACTGTCAG ATTAAGGGGA AACTGTCAGC CAACCAGATT
GAAGGCGATA TTGTCAAAAC GGTCAGCAAG TCTTTCCCCC GCACGAGCAC TTATGCCAGT
GGCACCATCA CGGTAAGAAT CAGTGATGAT CAGAAGTTTG ACCGGCAGGT CATGATACCG
CCAGTGTTAT TCCGCGGTGG TAAGCATGAG AATTTCAACA GTAATAACCA ACAGTCATAC
TGGTATTCAA CCTGCCGGTT AAGAGTGACC CGCAATGGTC AGGAGATTTT TAATCAGTCC
ACGACGGATG CTCAGGGCGT ATTTTCCTCA GTTATAGATA TGCCTGCCGG ACAGGGGACG
CTGACACTGA CATTCACCGT ATCTTCATCA GGAGCGAATA ACTGGACACC AACAACCAGT
ATCAGCGATC TGCTGGTTGT GGTGATGAAA AAATCCACAG CAGGTATCAG TATCAGCTGA
 
Protein sequence
MGKGGGKAHT PVEAKDNLKS TQMMSVIDAI GEGPIEGPVK GLQSILVNKT PLTDTDGNPV 
IHGVTAVWRA GEQEQTPPEG FESSGAETAL GVEVTKAKPV TRTITSANID RLRVTFGVQS
LVQTTSQGDR NPASVRLLIQ LQRNGNWVTE KDVTINGKTT SQFLASVILD NLPPRPFNIR
MVRETADSTT DQLQNRTLWS SYTEIIDVKQ CYPNTAIVGL QVDAEQFGGQ QMTVNYHIRG
RIIQVPSNYD PEKRTYSGIW DGSLKPAYSN NPAWCLWDML THPRYGMGKR LGAADVDKWA
LYAIAQYCDQ TVPDGFGGTE PRMTFNAYLS QQRKAWDVLS DFCSAMRCMP VWNGQTLTFV
QDRPSDVVWP YTSSDVVVDD NGVGFRYSFS ALKDRHTAVE VNYTDPQNGW QTSTELVEDP
EAILRYGRNL LKMDAFGCTS RGQAHRAGLW VIKTGLLETQ TVDFTLGSQG LRHTPGDIIE
ICDNDYAGTM TGGRVLSIDA ASRTLTLDRE VTLPETGTPT VNLINGSGKP VSVAITAHPA
PDRIQVSTLP DGVETYGVWG LSLPSLRRRL FRCVSIRENT DGTFAITAVQ HVPEKEAIVD
NGAHFDGDQS GTLNSVIPPA VQHLTVEVSA ADSQYLAQAK WDTPRVVKGV RFSLRLTSGS
GQDSRLVTTA ITADTEHRFS GLPLGEYTLT VRAINSYGQQ GEPATTTFRI NAPAKPATIE
LTPGYFQITA VPVLAVYDPT VQFEFWFSEK RITNTAQVEK SARYLGSGSQ WTVQGSRIKP
GTDFWFYVRS VNLVGKSAFV EVSGQPSNDG EGYLEFFREK IGKLHLAQGL WELIDNSQLA
DEMAEMKTTI TETRNEITQT VSKTLENQSA TIQQIQRVQK DTNDDLAALY MLKVQKTKDG
IPYVAGIGAG IEDTDGQPLS NILLLADRIA MINPESGNST PLFVAQGNQL FMNDVFLKRL
FAVSITSSGN PPTFSLTPDG RLTAKNADIS GSVNANSGTL NNVTINENCQ IKGKLSANQI
EGDIVKTVSK SFPRTSTYAS GTITVRISDD QKFDRQVMIP PVLFRGGKHE NFNSNNQQSY
WYSTCRLRVT RNGQEIFNQS TTDAQGVFSS VIDMPAGQGT LTLTFTVSSS GANNWTPTTS
ISDLLVVVMK KSTAGISIS