Gene ECH74115_3120 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_3120 
Symbol 
ID6971393 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp2894581 
End bp2898054 
Gene Length3474 bp 
Protein Length1157 aa 
Translation table11 
GC content57% 
IMG OID643386946 
Producthypothetical protein 
Protein accessionYP_002271414 
Protein GI209400993 
COG category[S] Function unknown 
COG ID[COG4733] Phage-related protein, tail component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones40 
Fosmid unclonability p-value0.0703076 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGGTAAAG GGGGCGGCAA GGGGCACACG CCGGTAGAGG CAAAGGACAA TCTTAAGTCC 
ACGCAGATGA TGAGCGTGAT TGACGCCATT GGTGAAGGGC CGATTGAAGG TCCGGTGAAG
GGGCTGCAGA GTATTCTGGT GAACAAAACC CCACTGACGG ACACGGACGG CAATCCCGTG
ATACACGGTG TGACCGCGGT CTGGCGCGCC GGGGAGCAGG AGCAGACACC GCCGGAAGGT
TTTGAGTCAT CCGGCTCTGA AACCGCACTG GGCGTGGAAG TGACGAAGGC AAAGCCGGTG
ACGCGCACCA TTACGTCCGC GAACATTGAC CGCCTGCGGG TTACCTTCGG GGTGCAGTCA
CTGGTGCAGA CCACCTCACA GGGTGACCGT AACCCGGCAT CCGTCCGCCT GCTGATTCAG
CTGCAGCGTA ACGGTAACTG GGTGACGGAA AAGGATGTCA CCATTAACGG CAAGACCACC
TCGCAGTTCC TGGCGTCGGT GATTCTGGAT AATCTGCCGC CCCGGCCCTT TAACATCCGG
ATGGTCAGGG AGACGGCGGA CAGCACCACG GACCAGCTGC AGAACAGAAC GCTGTGGTCG
TCATACACCG AAATCATCGA TGTGAAACAG TGCTACCCGA ACACGGCCAT TGTGGGGCTG
CAGGTGGATG CGGAGCAGTT TGGCGGTCAG CAGATGACGG TGAACTACCA TATCCGCGGT
CGCATCATCC AGGTGCCGTC AAACTATGAC CCGGAAAAAC GCACGTACAG CGGCATCTGG
GACGGCAGCC TGAAACCGGC ATACAGCAAC AACCCTGCCT GGTGCCTGTG GGACATGCTG
ACCCACCCGC GCTACGGAAT GGGAAAACGC CTGGGGGCGG CGGATGTGGA CAAGTGGGCG
CTGTATGCCA TTGCGCAGTA CTGCGACCAG ACGGTCCCGG ATGGTTTCGG GGGCACAGAG
CCGCGGATGA CTTTCAATGC GTACCTGTCA CAACAGCGTA AGGCGTGGGA CGTTCTCAGT
GATTTCTGCT CGGCGATGCG CTGTATGCCG GTATGGAACG GGCAGACGCT GACGTTTGTG
CAGGACCGTC CGTCAGATGT GGTGTGGCCC TACACCAGCA GTGATGTGGT GGTGGATGAT
AACGGCGTGG GGTTTCGCTA CAGCTTCAGC GCCCTGAAGG ACCGCCACAC GGCGGTGGAG
GTGAATTACA CCGACCCGCA GAACGGCTGG CAGACCTCCA CGGAACTGGT GGAAGACCCG
GAAGCCATAC TGCGCTACGG GCGCAACCTG CTGAAGATGG ATGCGTTCGG CTGCACCAGT
CGCGGTCAGG CCCACCGTGC CGGGCTGTGG GTGATAAAGA CCGGACTGCT GGAAACGCAG
ACGGTGGATT TCACGCTCGG GTCTCAGGGG CTGCGTCACA CACCCGGTGA CATTATTGAA
ATCTGTGATA ACGACTATGC CGGGACCATG ACCGGCGGAC GTGTCCTGTC CATCGATGCC
GCCAGCCGCA CCCTGACACT GGACCGTGAG GTGACCCTGC CGGAGACAGG TACACCGACG
GTGAACCTGA TTAACGGCAG CGGTAAGCCG GTGAGCGTGG CCATCACTGC ACACCCCGCG
CCGGACCGGA TACAGGTCAG CACCCTGCCG GATGGCGTGG AGACATACGG TGTATGGGGA
CTCTCCCTGC CGTCACTGCG TCGTCGCCTG TTCCGCTGTG TCTCCATCCG GGAAAACACG
GACGGCACCT TTGCCATCAC GGCAGTGCAG CACGTACCGG AAAAAGAAGC CATTGTGGAT
AACGGGGCCA GCTTTGAGCC ACTGTCCGGT TCGCTGAACA GCGTCATCCC GCCGGCTGTG
CAGCACCTGA CGGTGGAGGT GAGCGCGGCT GACGGTCAGT ATCTGGCACA GGCGAAATGG
GACACGCCGC GGGTGGTGAA GGGCGTGCGC TTCAGTCTGC GCCTGACCAG TGGTAAGGGA
ACGGATGCCA GACTGGTGAC CACCGCCATC ACCGCAGACA CGGAGCACCG TTTCAGCGGC
CTGCCGCTCG GGGAATACAC CCTGACGGTG CGGGCGATAA ACAGCTATGG CCAGCAGGGT
GAACCTGCCA CCACCACCTT CCGGATTGCC GCACCGGCAG CACCGTCGCG GATTGAGCTG
ACGCCGGGCT ATTTTCAGAT AACCGCCACG CCGCATCTTG CCGTTTATGA CCCGACGGTA
CAGTTTGAGT TCTGGTTTTC GGAAAAGCGG ATTGCGGATA TCAGGCAGGT TGAAACCGCA
GCCCGCTATC TTGGCTCGGC GCTGTACTGG ATAGCTGCCA GTATCAATAT CAAACCGGGC
CATGATTATT ATTTTTATAT CCGCAGTGTG AATACTGTTG GCAAATCGGC ATTCGTGGAG
GCTGTCGGTC GGGCGAGCGA TGATGCGGAA GGTTACCTGG ATTTTTTCAA AGGAGAAATC
GGGAAAACAC ATCTGGCCCA GGAGCTGTGG ACGCAGATTG ATAACGGTCA GCTTGCGCCG
GACCTGGCTG AAATCAGGAC GTCCATTACG AATGTCAGCA ATGAAATCAC GCAGACCGTC
AATAAAAAAC TGGAAAATCA GAGTGCGGCA ATCCAGCAGA TACAGAAAGT TCAGGTTGAT
ACAAATAATA ACCTGAACAG CATGTGGGCC GTGAAACTGC AGCAGATGCA GGACGGACGC
CTTTATATTG CGGGTATCGG TGCCGGTATT GAGAATACGC CAGCAGGAAT GCAGAGTCAG
GTGCTGCTGG CGGCAGACAG GATTGCGATG ATTAATCCTG CGAATGGCAA CACAAAGCCG
ATGTTTGTTG GTCAGGGCGA TCAGATATTT ATGAATGAAG TGTTCCTGAA ATATCTGACG
GCTCCCACCA TTACCAGCGG CGGTAATCCT CCGGCATTTT CCCTGACACC GGACGGGCGG
CTGACGGCGA AAAATGCCGA TATCAGCGGT AACGTGAATG CGAACTCCGG GACGCTCAAC
AACGTCACGA TTAACGAGAA CTGTCGGGTT CTGGGAAAAT TGTCCGCGAA CCAGATTGAA
GGCGATCTCG TTAAAACAGT GGGCAAAGCT TTCCCCCGGG ACTCCCGTGC ACCGGAGCGG
TGGCCATCAG GAACCATTAC CGTCAGGGTT TATGACGATC AGCCGTTTGA CCGGCAGATT
GTTATTCCGG CGGTGGCATT CAGCGGCGCT AAACATGAGA AAGAGCATAC TGATATTTAC
TCCTCATGCC GTCTGATAGT GCGGAAAAAC GGTGCTGAAA TTTATAACCG TACCGCGCTG
GATAATACGC TGATTTACAG TGGTGTTATT GATATGCCTG CCGGTCACGG TCACATGACA
CTGGAGTTTT CGGTGTCAGC ATGGCTGGTA AATAACTGGT ATCCCACAGC AAGTATCAGC
GATTTGCTGG TTGTGGTGAT GAAGAAAGCC ACTGCAGGCA TCACGATTAG CTGA
 
Protein sequence
MGKGGGKGHT PVEAKDNLKS TQMMSVIDAI GEGPIEGPVK GLQSILVNKT PLTDTDGNPV 
IHGVTAVWRA GEQEQTPPEG FESSGSETAL GVEVTKAKPV TRTITSANID RLRVTFGVQS
LVQTTSQGDR NPASVRLLIQ LQRNGNWVTE KDVTINGKTT SQFLASVILD NLPPRPFNIR
MVRETADSTT DQLQNRTLWS SYTEIIDVKQ CYPNTAIVGL QVDAEQFGGQ QMTVNYHIRG
RIIQVPSNYD PEKRTYSGIW DGSLKPAYSN NPAWCLWDML THPRYGMGKR LGAADVDKWA
LYAIAQYCDQ TVPDGFGGTE PRMTFNAYLS QQRKAWDVLS DFCSAMRCMP VWNGQTLTFV
QDRPSDVVWP YTSSDVVVDD NGVGFRYSFS ALKDRHTAVE VNYTDPQNGW QTSTELVEDP
EAILRYGRNL LKMDAFGCTS RGQAHRAGLW VIKTGLLETQ TVDFTLGSQG LRHTPGDIIE
ICDNDYAGTM TGGRVLSIDA ASRTLTLDRE VTLPETGTPT VNLINGSGKP VSVAITAHPA
PDRIQVSTLP DGVETYGVWG LSLPSLRRRL FRCVSIRENT DGTFAITAVQ HVPEKEAIVD
NGASFEPLSG SLNSVIPPAV QHLTVEVSAA DGQYLAQAKW DTPRVVKGVR FSLRLTSGKG
TDARLVTTAI TADTEHRFSG LPLGEYTLTV RAINSYGQQG EPATTTFRIA APAAPSRIEL
TPGYFQITAT PHLAVYDPTV QFEFWFSEKR IADIRQVETA ARYLGSALYW IAASINIKPG
HDYYFYIRSV NTVGKSAFVE AVGRASDDAE GYLDFFKGEI GKTHLAQELW TQIDNGQLAP
DLAEIRTSIT NVSNEITQTV NKKLENQSAA IQQIQKVQVD TNNNLNSMWA VKLQQMQDGR
LYIAGIGAGI ENTPAGMQSQ VLLAADRIAM INPANGNTKP MFVGQGDQIF MNEVFLKYLT
APTITSGGNP PAFSLTPDGR LTAKNADISG NVNANSGTLN NVTINENCRV LGKLSANQIE
GDLVKTVGKA FPRDSRAPER WPSGTITVRV YDDQPFDRQI VIPAVAFSGA KHEKEHTDIY
SSCRLIVRKN GAEIYNRTAL DNTLIYSGVI DMPAGHGHMT LEFSVSAWLV NNWYPTASIS
DLLVVVMKKA TAGITIS