Gene ECH74115_1201 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_1201 
Symbol 
ID6966837 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp1208230 
End bp1211703 
Gene Length3474 bp 
Protein Length1157 aa 
Translation table11 
GC content57% 
IMG OID643385198 
Producthypothetical protein 
Protein accessionYP_002269694 
Protein GI209397821 
COG category[S] Function unknown 
COG ID[COG4733] Phage-related protein, tail component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones64 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGTAAAG GTGGCGGCAG GGCGCACACG CCGGTTGAGG CAAAGGACAA TCTTAAGTCC 
ACGCAGATGA TGAGCGTGAT TGATGCCATT GGTGAAGGGC CGATTGAAGG TCCGGTGAAG
GGGCTGCAGA GTATCCTGGT GAACAAAACC CCGCTGACGG ACACGGACGG TAATCCTGTG
ATACATGGTG TGACAGCGGT CTGGCGCGCC GGGGAGCAGG AGCAGACACC ACCTGAAGGC
TTTGAGTCTT CCGGAGCTGA AACCGCACTG GGCGTGGAAG TGACGAAGGC AAAGCCGGTG
ACGCGCACCA TTACATCCGC GAACATTGAC CGCCTGCGGG TCACCTTCGG GGTGCAGTCA
CTGTTGGAGA CCACCTCAAA GGGCGACCGT AATCCCTCTT CTGTCCGACT GCTGATTCAG
TTGCAGCGTA ACGGTAACTG GGTGACGGAA AAGGATGTCA CCATTAACGG CAAGACCACC
TCGCAGTTTC TGGCGTCGGT GATTCTGGAT AATCTGCCTC CCCGTCCTTT TAACATCCGG
ATGGTCCGGG AGACGGCGGA CAGCACCTCG GACCAGCTGC AGAATAAGAC GCTCTGGTCG
TCATACACCG AAATCATCGA TGTGAAACAG TGCTACCCGA ACACGGCGAT TGTGGGGCTG
CAGGTGGATG CGGAGCAGTT TGGTGGCCAG CAGATGACGG TGAACTACCA TATCCGAGGT
CGCATCATCC AGGTGCCGTC AAACTATGAC CCGGAAAAAC GCACGTACAG CGGCATCTGG
GACGGCAGCC TGAAACCGGC ATACAGCAAC AACCCGGCCT GGTGCCTGTG GGACATGCTG
ACTCACCCGC GCTACGGCAT GGGAAAACGC CTGGGGGCGG CGGATGTGGA CAAGTGGGCG
CTGTATGCCA TTGCGCAGTA CTGCGACCAG ATGGTCCCTG ATGGTTTCGG GGGCACAGAG
CCGCGGATGA CCTTTAATGC GTACCTGTCA CAACAGCGTA AGGCGTGGGA CGTTCTCAGT
GATTTCTGCT CGGCGATGCG CTGTATGCCG GTATGGAACG GCCAGACGCT GACGTTCGTT
CAGGACAGCC CGTCGGATGT GGTGTGGCCG TACACCAACA GTGATGTGGT GGTGGATGAT
AACGGCGTGG GGTTTCGCTA CAGCTTCAGC GCCCTGAAGG ACCGCCACAC GGCGGTGGAG
GTGAATTACA CCGACCCGCA GAACGGCTGG CAGACCTCCA CGGAACTGGT GGAAGACCCG
GAAGCCATAC TGCGCTACGG ACGCAACCTG CTGAAGATGG ATGCGTTCGG CTGCACCAGT
CGCGGTCAGG CCCACCGTGC CGGGCTGTGG GTGATAAAGA CCGGACTGCT GGAAACGCAG
ACGGTGGATT TCACGCTCGG GTCACAGGGG CTGCGTCACA CACCCGGTGA CATTATTGAA
ATCTGTGATA ACGACTATGC CGGGACCATG ACCGGCGGAC GTGTCCTGTC CATCGATGCC
GCCAGCCGCA CCCTGACGCT GGACCGTGAG GTGACACTGC CGGAGACCGG TGCCGCCACG
GTGAACCTGA TTAACGGCAG CGGTAAGCCG GTGAGTGTGG ACATCACCGC ACACCCCGCG
CCGGACCGGA TACAGGTCAG CACCCTGCCT GATGGTGTGG AGACATACGG TGTATGGGGA
CTATCCCTGC CGTCACTGCG CCGTCGCCTG TTCCGCTGTG TCTCCGTCCG GGAAAACACG
GACGGCACCT TTGCCATCAC GGCGGTGCAG CACGTACCGG AAAAAGAAGC CATCGTGGAT
AACGGTGCCC GCTTTGAGCC GCAGTCAGGC ACCCTGAACA GCGTCATCCC TCCGGCAGTG
CAGCACCTGA CGGTGGAGGT GAGCGCAGCT GACGGCCAGT ATCTGGCGCA GGTGAAATGG
GACACGCCGC GGGTGGTGAA GGGCGTGCGC TTCAGTCTGC GCCTGACCAG CGGAAGCGGA
GAAGGCAGCC GTCTGGTGAC CACCGCCATC ACTGCGGATA CAGAGCATCG TTCCAGTGGT
CTGCCGCCGG GGGAATACAC CCTGACGGTC AGGGCGATTA ACAGTTATGG CCAGCAGGGG
GAACCGGCCA CCACCACGTT CAGGATTAAT GCACCTGCGG TACCCGCCAC GATTGAGCTG
ACACCGGGCT ATTTTCAGAT AACAGCGGTC CCGCGTCTTG CGGTGTATGA CCCGACGGTA
CAGTTTGAGT TCTGGTTTTC GGAGACAAAA ATCGCAGATA CATCTCAGGT GGAAACCTCT
GCCCGTTATC TGGGGACCGG CAGTCAGTGG ACTGTCCAGG GAAGCCGGAT TAAGCCGGGG
ACGGATTTCT GGTTTTACGT GCGCAGCGTC AACCTGGTGG GGAAATCTGC GTTTGTGGAA
GTCAGCGGGC AGCCCAGCAA TGATGGTGAA GGGTATCTGG AATTTTTCCG GGAAAAAATA
GGAAAACTGC ATCTGGCTCA GGGGCTATGG GAGCTGATAG ACAACAGCCA GCTTGCGGAT
GAGATGGCGG AGATGAAGAC CACCATCACG GAAACCCGCA ATGAAATCAC ACAGACGGTC
AGTAAAACGC TGGAGAACCA GAGCGCCATC ATACAGCAGA TACAGCGCGT GCAGAAGGAC
ACAAATGATG ACCTTGCTGC ACTTTACATG CTGAAGGTAC AGAAAACAAA AAATGGCATA
CCCTATGTTG CCGGTATTGG AGCGGGGATT GAGGATACTG ATGGCCAGCC CCTGAGCAAC
ATACTGCTGC TGGCTGACCG TATTGCGATG ATTAACCCGG AGGACGGCAA CACCACGCCG
TTATTTGTGG CGCAGGGGAA TCAGTTGTTC ATGAACGATG TGTTCCTGAA ACGACTGTTT
GCGGTGAGTA TCACGTCATC CGCCAATCCC CCGACGTTTT CCCTGACGCC GGAGGGCAGG
CTGACCGCAA GAAATGCTGA TATCAGCGGT AACGTGAATG CGAATTCCGG GACGCTCAAC
AACGTCACGA TTAACGAGAA CTGTCGGGTT CTGGGAAAAC TGTCCGCGAA CCAGATTGAA
GGCGATCTCG TTAAAACAGT GGGCAAAGCT TTCCCCCGGG ATTCCCGTGC ACCGGAGCGG
TGGCCATCAG GGACCATTAC CGTCAGGGTT TATGACGATC AGCCGTTTGA CCGGCAGATT
GTTATTCCGG CGGTGGCATT CAGCGGCGCT AAACATGAGA GAGAGCATAC TGATATTTAC
TCCTCATGCC GTCTGATAGT GCGGAAAAAC GGTGCTGAAA TTTATAACCG TACCGCGCTG
GATAATACGC TGATTTACAG TGGCGTTATT GATATGCCTG CCGGTCACGG TCACATGACG
CTGGAGTTTT CGGTGTCAGC ATGGCTGGTG AATAACTGGT ATCCCACAGC AAGTATCAGC
GATTTGCTGG TTGTGGTGAT GAAGAAAGCC ACCGCAGGCA TCAGTATCAG CTGA
 
Protein sequence
MGKGGGRAHT PVEAKDNLKS TQMMSVIDAI GEGPIEGPVK GLQSILVNKT PLTDTDGNPV 
IHGVTAVWRA GEQEQTPPEG FESSGAETAL GVEVTKAKPV TRTITSANID RLRVTFGVQS
LLETTSKGDR NPSSVRLLIQ LQRNGNWVTE KDVTINGKTT SQFLASVILD NLPPRPFNIR
MVRETADSTS DQLQNKTLWS SYTEIIDVKQ CYPNTAIVGL QVDAEQFGGQ QMTVNYHIRG
RIIQVPSNYD PEKRTYSGIW DGSLKPAYSN NPAWCLWDML THPRYGMGKR LGAADVDKWA
LYAIAQYCDQ MVPDGFGGTE PRMTFNAYLS QQRKAWDVLS DFCSAMRCMP VWNGQTLTFV
QDSPSDVVWP YTNSDVVVDD NGVGFRYSFS ALKDRHTAVE VNYTDPQNGW QTSTELVEDP
EAILRYGRNL LKMDAFGCTS RGQAHRAGLW VIKTGLLETQ TVDFTLGSQG LRHTPGDIIE
ICDNDYAGTM TGGRVLSIDA ASRTLTLDRE VTLPETGAAT VNLINGSGKP VSVDITAHPA
PDRIQVSTLP DGVETYGVWG LSLPSLRRRL FRCVSVRENT DGTFAITAVQ HVPEKEAIVD
NGARFEPQSG TLNSVIPPAV QHLTVEVSAA DGQYLAQVKW DTPRVVKGVR FSLRLTSGSG
EGSRLVTTAI TADTEHRSSG LPPGEYTLTV RAINSYGQQG EPATTTFRIN APAVPATIEL
TPGYFQITAV PRLAVYDPTV QFEFWFSETK IADTSQVETS ARYLGTGSQW TVQGSRIKPG
TDFWFYVRSV NLVGKSAFVE VSGQPSNDGE GYLEFFREKI GKLHLAQGLW ELIDNSQLAD
EMAEMKTTIT ETRNEITQTV SKTLENQSAI IQQIQRVQKD TNDDLAALYM LKVQKTKNGI
PYVAGIGAGI EDTDGQPLSN ILLLADRIAM INPEDGNTTP LFVAQGNQLF MNDVFLKRLF
AVSITSSANP PTFSLTPEGR LTARNADISG NVNANSGTLN NVTINENCRV LGKLSANQIE
GDLVKTVGKA FPRDSRAPER WPSGTITVRV YDDQPFDRQI VIPAVAFSGA KHEREHTDIY
SSCRLIVRKN GAEIYNRTAL DNTLIYSGVI DMPAGHGHMT LEFSVSAWLV NNWYPTASIS
DLLVVVMKKA TAGISIS