Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH74115_1201 |
Symbol | |
ID | 6966837 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli O157:H7 str. EC4115 |
Kingdom | Bacteria |
Replicon accession | NC_011353 |
Strand | + |
Start bp | 1208230 |
End bp | 1211703 |
Gene Length | 3474 bp |
Protein Length | 1157 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 643385198 |
Product | hypothetical protein |
Protein accession | YP_002269694 |
Protein GI | 209397821 |
COG category | [S] Function unknown |
COG ID | [COG4733] Phage-related protein, tail component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 64 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGGTAAAG GTGGCGGCAG GGCGCACACG CCGGTTGAGG CAAAGGACAA TCTTAAGTCC ACGCAGATGA TGAGCGTGAT TGATGCCATT GGTGAAGGGC CGATTGAAGG TCCGGTGAAG GGGCTGCAGA GTATCCTGGT GAACAAAACC CCGCTGACGG ACACGGACGG TAATCCTGTG ATACATGGTG TGACAGCGGT CTGGCGCGCC GGGGAGCAGG AGCAGACACC ACCTGAAGGC TTTGAGTCTT CCGGAGCTGA AACCGCACTG GGCGTGGAAG TGACGAAGGC AAAGCCGGTG ACGCGCACCA TTACATCCGC GAACATTGAC CGCCTGCGGG TCACCTTCGG GGTGCAGTCA CTGTTGGAGA CCACCTCAAA GGGCGACCGT AATCCCTCTT CTGTCCGACT GCTGATTCAG TTGCAGCGTA ACGGTAACTG GGTGACGGAA AAGGATGTCA CCATTAACGG CAAGACCACC TCGCAGTTTC TGGCGTCGGT GATTCTGGAT AATCTGCCTC CCCGTCCTTT TAACATCCGG ATGGTCCGGG AGACGGCGGA CAGCACCTCG GACCAGCTGC AGAATAAGAC GCTCTGGTCG TCATACACCG AAATCATCGA TGTGAAACAG TGCTACCCGA ACACGGCGAT TGTGGGGCTG CAGGTGGATG CGGAGCAGTT TGGTGGCCAG CAGATGACGG TGAACTACCA TATCCGAGGT CGCATCATCC AGGTGCCGTC AAACTATGAC CCGGAAAAAC GCACGTACAG CGGCATCTGG GACGGCAGCC TGAAACCGGC ATACAGCAAC AACCCGGCCT GGTGCCTGTG GGACATGCTG ACTCACCCGC GCTACGGCAT GGGAAAACGC CTGGGGGCGG CGGATGTGGA CAAGTGGGCG CTGTATGCCA TTGCGCAGTA CTGCGACCAG ATGGTCCCTG ATGGTTTCGG GGGCACAGAG CCGCGGATGA CCTTTAATGC GTACCTGTCA CAACAGCGTA AGGCGTGGGA CGTTCTCAGT GATTTCTGCT CGGCGATGCG CTGTATGCCG GTATGGAACG GCCAGACGCT GACGTTCGTT CAGGACAGCC CGTCGGATGT GGTGTGGCCG TACACCAACA GTGATGTGGT GGTGGATGAT AACGGCGTGG GGTTTCGCTA CAGCTTCAGC GCCCTGAAGG ACCGCCACAC GGCGGTGGAG GTGAATTACA CCGACCCGCA GAACGGCTGG CAGACCTCCA CGGAACTGGT GGAAGACCCG GAAGCCATAC TGCGCTACGG ACGCAACCTG CTGAAGATGG ATGCGTTCGG CTGCACCAGT CGCGGTCAGG CCCACCGTGC CGGGCTGTGG GTGATAAAGA CCGGACTGCT GGAAACGCAG ACGGTGGATT TCACGCTCGG GTCACAGGGG CTGCGTCACA CACCCGGTGA CATTATTGAA ATCTGTGATA ACGACTATGC CGGGACCATG ACCGGCGGAC GTGTCCTGTC CATCGATGCC GCCAGCCGCA CCCTGACGCT GGACCGTGAG GTGACACTGC CGGAGACCGG TGCCGCCACG GTGAACCTGA TTAACGGCAG CGGTAAGCCG GTGAGTGTGG ACATCACCGC ACACCCCGCG CCGGACCGGA TACAGGTCAG CACCCTGCCT GATGGTGTGG AGACATACGG TGTATGGGGA CTATCCCTGC CGTCACTGCG CCGTCGCCTG TTCCGCTGTG TCTCCGTCCG GGAAAACACG GACGGCACCT TTGCCATCAC GGCGGTGCAG CACGTACCGG AAAAAGAAGC CATCGTGGAT AACGGTGCCC GCTTTGAGCC GCAGTCAGGC ACCCTGAACA GCGTCATCCC TCCGGCAGTG CAGCACCTGA CGGTGGAGGT GAGCGCAGCT GACGGCCAGT ATCTGGCGCA GGTGAAATGG GACACGCCGC GGGTGGTGAA GGGCGTGCGC TTCAGTCTGC GCCTGACCAG CGGAAGCGGA GAAGGCAGCC GTCTGGTGAC CACCGCCATC ACTGCGGATA CAGAGCATCG TTCCAGTGGT CTGCCGCCGG GGGAATACAC CCTGACGGTC AGGGCGATTA ACAGTTATGG CCAGCAGGGG GAACCGGCCA CCACCACGTT CAGGATTAAT GCACCTGCGG TACCCGCCAC GATTGAGCTG ACACCGGGCT ATTTTCAGAT AACAGCGGTC CCGCGTCTTG CGGTGTATGA CCCGACGGTA CAGTTTGAGT TCTGGTTTTC GGAGACAAAA ATCGCAGATA CATCTCAGGT GGAAACCTCT GCCCGTTATC TGGGGACCGG CAGTCAGTGG ACTGTCCAGG GAAGCCGGAT TAAGCCGGGG ACGGATTTCT GGTTTTACGT GCGCAGCGTC AACCTGGTGG GGAAATCTGC GTTTGTGGAA GTCAGCGGGC AGCCCAGCAA TGATGGTGAA GGGTATCTGG AATTTTTCCG GGAAAAAATA GGAAAACTGC ATCTGGCTCA GGGGCTATGG GAGCTGATAG ACAACAGCCA GCTTGCGGAT GAGATGGCGG AGATGAAGAC CACCATCACG GAAACCCGCA ATGAAATCAC ACAGACGGTC AGTAAAACGC TGGAGAACCA GAGCGCCATC ATACAGCAGA TACAGCGCGT GCAGAAGGAC ACAAATGATG ACCTTGCTGC ACTTTACATG CTGAAGGTAC AGAAAACAAA AAATGGCATA CCCTATGTTG CCGGTATTGG AGCGGGGATT GAGGATACTG ATGGCCAGCC CCTGAGCAAC ATACTGCTGC TGGCTGACCG TATTGCGATG ATTAACCCGG AGGACGGCAA CACCACGCCG TTATTTGTGG CGCAGGGGAA TCAGTTGTTC ATGAACGATG TGTTCCTGAA ACGACTGTTT GCGGTGAGTA TCACGTCATC CGCCAATCCC CCGACGTTTT CCCTGACGCC GGAGGGCAGG CTGACCGCAA GAAATGCTGA TATCAGCGGT AACGTGAATG CGAATTCCGG GACGCTCAAC AACGTCACGA TTAACGAGAA CTGTCGGGTT CTGGGAAAAC TGTCCGCGAA CCAGATTGAA GGCGATCTCG TTAAAACAGT GGGCAAAGCT TTCCCCCGGG ATTCCCGTGC ACCGGAGCGG TGGCCATCAG GGACCATTAC CGTCAGGGTT TATGACGATC AGCCGTTTGA CCGGCAGATT GTTATTCCGG CGGTGGCATT CAGCGGCGCT AAACATGAGA GAGAGCATAC TGATATTTAC TCCTCATGCC GTCTGATAGT GCGGAAAAAC GGTGCTGAAA TTTATAACCG TACCGCGCTG GATAATACGC TGATTTACAG TGGCGTTATT GATATGCCTG CCGGTCACGG TCACATGACG CTGGAGTTTT CGGTGTCAGC ATGGCTGGTG AATAACTGGT ATCCCACAGC AAGTATCAGC GATTTGCTGG TTGTGGTGAT GAAGAAAGCC ACCGCAGGCA TCAGTATCAG CTGA
|
Protein sequence | MGKGGGRAHT PVEAKDNLKS TQMMSVIDAI GEGPIEGPVK GLQSILVNKT PLTDTDGNPV IHGVTAVWRA GEQEQTPPEG FESSGAETAL GVEVTKAKPV TRTITSANID RLRVTFGVQS LLETTSKGDR NPSSVRLLIQ LQRNGNWVTE KDVTINGKTT SQFLASVILD NLPPRPFNIR MVRETADSTS DQLQNKTLWS SYTEIIDVKQ CYPNTAIVGL QVDAEQFGGQ QMTVNYHIRG RIIQVPSNYD PEKRTYSGIW DGSLKPAYSN NPAWCLWDML THPRYGMGKR LGAADVDKWA LYAIAQYCDQ MVPDGFGGTE PRMTFNAYLS QQRKAWDVLS DFCSAMRCMP VWNGQTLTFV QDSPSDVVWP YTNSDVVVDD NGVGFRYSFS ALKDRHTAVE VNYTDPQNGW QTSTELVEDP EAILRYGRNL LKMDAFGCTS RGQAHRAGLW VIKTGLLETQ TVDFTLGSQG LRHTPGDIIE ICDNDYAGTM TGGRVLSIDA ASRTLTLDRE VTLPETGAAT VNLINGSGKP VSVDITAHPA PDRIQVSTLP DGVETYGVWG LSLPSLRRRL FRCVSVRENT DGTFAITAVQ HVPEKEAIVD NGARFEPQSG TLNSVIPPAV QHLTVEVSAA DGQYLAQVKW DTPRVVKGVR FSLRLTSGSG EGSRLVTTAI TADTEHRSSG LPPGEYTLTV RAINSYGQQG EPATTTFRIN APAVPATIEL TPGYFQITAV PRLAVYDPTV QFEFWFSETK IADTSQVETS ARYLGTGSQW TVQGSRIKPG TDFWFYVRSV NLVGKSAFVE VSGQPSNDGE GYLEFFREKI GKLHLAQGLW ELIDNSQLAD EMAEMKTTIT ETRNEITQTV SKTLENQSAI IQQIQRVQKD TNDDLAALYM LKVQKTKNGI PYVAGIGAGI EDTDGQPLSN ILLLADRIAM INPEDGNTTP LFVAQGNQLF MNDVFLKRLF AVSITSSANP PTFSLTPEGR LTARNADISG NVNANSGTLN NVTINENCRV LGKLSANQIE GDLVKTVGKA FPRDSRAPER WPSGTITVRV YDDQPFDRQI VIPAVAFSGA KHEREHTDIY SSCRLIVRKN GAEIYNRTAL DNTLIYSGVI DMPAGHGHMT LEFSVSAWLV NNWYPTASIS DLLVVVMKKA TAGISIS
|
| |