Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH74115_2873 |
Symbol | |
ID | 6967166 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli O157:H7 str. EC4115 |
Kingdom | Bacteria |
Replicon accession | NC_011353 |
Strand | - |
Start bp | 2668207 |
End bp | 2671686 |
Gene Length | 3480 bp |
Protein Length | 1159 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 643386719 |
Product | hypothetical protein |
Protein accession | YP_002271190 |
Protein GI | 209399897 |
COG category | [S] Function unknown |
COG ID | [COG4733] Phage-related protein, tail component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 32 |
Fosmid unclonability p-value | 0.00489128 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGGGCAAAG GTGGTGGCAA GGCGCACACG CCGGTTGAGG CAAAGGACAA TCTTAAGTCC ACGCAGATGA TGAGCGTGAT TGATGCGATT GGTGAAGGGC CGATTGAAGG TCCGGTGAAG GGGCTGCAGA GTATCCTGGT GAACAAAACC CCGCTGACGG ACACGGACGG TAATCCTGTG ATACATGGTG TGACAGCGGT CTGGCGCGCC GGGGAGCAGG AGCAGACACC ACCTGAAGGC TTTGAGTCCT CCGGAGCTGA AACCGCACTG GGCGTGGAAG TGACGAAGGC AAAGCCGGTG ACGCGCACAA TTACGTCCGC GAACATTGAC CGCCTGCGGG TCACCTTCGG GGTGCAGTCA CTGGTGCAGA CCACCTCACA GGGTGACCGT AACCCGGCAT CCGTCCGACT GCTGATTCAG TTGCAGCGTA ACGGTAACTG GGTGACGGAA AAGGATGTCA CCATTAACGG CAAGACCACC TCGCAGTTCC TGGCGTCGGT GATTCTGGAT AATCTGCCGC CCCGGCCCTT TAACATCCGG ATGGTCAGGG AGACGGCGGA CAGCACCACG GACCAGCTGC AGAACAGAAC GCTGTGGTCG TCATACACCG AAATCATCGA TGTGAAACAG TGCTACCCGA ACACGGCCAT TGTGGGGCTG CAGGTGGATG CGGAGCAGTT TGGCGGTCAG CAGATGACGG TGAACTACCA TATCCGCGGT CGCATCATCC AGGTGCCGTC AAACTATGAC CCGGAAAAAC GCACGTACAG CGGCATCTGG GACGGCAGCC TGAAACCGGC ATACAGCAAC AACCCTGCCT GGTGCCTGTG GGACATGCTG ACCCACCCGC GCTACGGAAT GGGAAAACGC CTGGGGGCGG CGGATGTGGA CAAGTGGGCG CTGTATGCCA TTGCGCAGTA CTGCGACCAG ACGGTCCCGG ATGGTTTCGG GGGCACAGAG CCGCGGATGA CTTTCAATGC GTACCTGTCA CAACAGCGTA AGGCGTGGGA CGTTCTCAGT GATTTCTGCT CGGCGATGCG CTGTATGCCG GTATGGAACG GGCAGACGCT GACGTTTGTG CAGGACCGTC CGTCAGATGT GGTGTGGCCC TACACCAGCA GTGATGTGGT GGTGGATGAT AACGGCGTGG GGTTTCGCTA CAGCTTCAGC GCCCTGAAGG ACCGCCACAC GGCGGTGGAG GTGAATTACA CCGACCCGCA GAACGGCTGG CAGACCTCCA CGGAACTGGT GGAAGACCCG GAAGCCATAC TGCGCTACGG GCGCAACCTG CTGAAGATGG ATGCGTTCGG CTGCACCAGT CGCGGTCAGG CCCACCGTGC CGGGCTGTGG GTGATAAAGA CCGGACTGCT GGAAACGCAG ACGGTGGATT TCACGCTCGG GTCTCAGGGG CTGCGTCACA CACCCGGTGA CATTATTGAA ATCTGTGATA ACGACTATGC CGGGACCATG ACCGGCGGAC GTGTCCTGTC CATCGATGCC GCCAGCCGCA CCCTGACACT GGACCGTGAG GTGACCCTGC CGGAGACAGG TACACCGACG GTGAACCTGA TTAACGGCAG CGGTAAGCCG GTGAGCGTGG CCATCACTGC ACACCCCGCG CCGGACCGGA TACAGGTCAG CACCCTGCCG GATGGCGTGG AGACATACGG TGTATGGGGA CTCTCCCTGC CGTCACTGCG TCGTCGCCTG TTCCGCTGTG TCTCCATCCG GGAAAACACG GACGGCACCT TTGCCATCAC GGCAGTGCAG CACGTACCGG AAAAAGAAGC CATCGTGGAT AACGGGGCGC ACTTTGACGG CGACCAGAGC GGCACCCTGA ACAGCGTCAT CCCTCCGGCA GTGCAGCACC TGACGGTGGA GGTGAGTGCA GCTGACAGCC AGTATCTGGC GCAGGCGAAA TGGGACACGC CGCGGGTGGT GAAGGGCGTG CGCTTCAGTC TGCGCCTGAC CAGTGGAAGC GGTCAGGACA GCCGTCTGGT GACCACCGCC ATCACTGCGG ATACAGAGCA TCGTTTCAGT GGTCTGCCGC TCGGGGAATA CACCCTGACA GTCAGGGCAA TTAACAGTTA TGGCCAGCAG GGCGAACCGG CCACCACCAC CTTCCGGATT AACGCGCCAG CAAAACCCGC CACCATTGAA CTGACGCCGG GGTATTTTCA GATAACGGCG GTACCGGTGC TGGCGGTGTA TGACCCGACG GTGCAGTTTG AGTTCTGGTT TTCGGAAAAA CGCATCACGA ACACGGCACA GGTGGAAAAA TCTGCCCGTT ATCTGGGGAG CGGCAGTCAG TGGACTGTCC AGGGAAGCCG GATTAAGCCG GGGACGGATT TCTGGTTTTA CGTGCGCAGC GTCAACCTGG TGGGGAAATC TGCGTTTGTG GAAGTCAGCG GGCAGCCCAG CAATGATGGT GAAGGGTATC TGGAATTTTT CCGGGAAAAA ATAGGAAAAC TGCATCTGGC TCAGGGGCTA TGGGAGCTGA TAGACAACAG CCAGCTTGCG GATGAGATGG CGGAGATGAA GACCACCATC ACGGAAACCC GCAATGAAAT CACACAGACG GTCAGTAAAA CGCTGGAGAA CCAGAGCGCC ACTATACAGC AGATACAGCG CGTGCAGAAG GACACAAATG ATGACCTGGC TGCGCTGTAC ATGCTGAAGG TTCAAAAAAC GAAAGACGGC ATTCCCTATG TGGCCGGGAT TGGTGCAGGG ATTGAGGATA CTGATGGCCA GCCACTGAGC AACATACTGC TGCTGGCTGA CCGTATCGCG ATGATAAATC CGGAGAGCGG CAACAGCACG CCGTTATTTG TGGCGCAGGG GAATCAGCTG TTCATGAACG ACGTGTTCCT GAAGCGACTG TTTGCGGTGA GTATCACCTC GTCCGGCAAT CCCCCGACGT TTTCCCTGAC GCCGGACGGG CGACTGACGG CGAAAAATGC GGATATCAGT GGCAGTGTGA ATGCGAACTC AGGGACGCTC AACAACGTCA CGATTAATGA GAACTGTCAG ATTAAGGGGA AACTGTCAGC CAACCAGATT GAAGGCGATA TTGTCAAAAC GGTCAGCAAG TCTTTCCCCC GCACGAGCAC TTATGCCAGT GGCACCATCA CGGTAAGAAT CAGTGATGAT CAGAAGTTTG ACCGGCAGGT CATGATACCG CCAGTGTTAT TCCGCGGTGG TAAGCATGAG AATTTCAACA GTAATAACCA ACAGTCATAC TGGTATTCAA CCTGCCGGTT AAGAGTGACC CGCAATGGTC AGGAGATTTT TAATCAGTCC ACGACGGATG CTCAGGGCGT ATTTTCCTCA GTTATAGATA TGCCTGCCGG ACAGGGGACG CTGACACTGA CATTCACCGT ATCTTCATCA GGAGCGAATA ACTGGACACC AACAACCAGT ATCAGCGATC TGCTGGTTGT GGTGATGAAA AAATCCACAG CAGGTATCAG TATCAGCTGA
|
Protein sequence | MGKGGGKAHT PVEAKDNLKS TQMMSVIDAI GEGPIEGPVK GLQSILVNKT PLTDTDGNPV IHGVTAVWRA GEQEQTPPEG FESSGAETAL GVEVTKAKPV TRTITSANID RLRVTFGVQS LVQTTSQGDR NPASVRLLIQ LQRNGNWVTE KDVTINGKTT SQFLASVILD NLPPRPFNIR MVRETADSTT DQLQNRTLWS SYTEIIDVKQ CYPNTAIVGL QVDAEQFGGQ QMTVNYHIRG RIIQVPSNYD PEKRTYSGIW DGSLKPAYSN NPAWCLWDML THPRYGMGKR LGAADVDKWA LYAIAQYCDQ TVPDGFGGTE PRMTFNAYLS QQRKAWDVLS DFCSAMRCMP VWNGQTLTFV QDRPSDVVWP YTSSDVVVDD NGVGFRYSFS ALKDRHTAVE VNYTDPQNGW QTSTELVEDP EAILRYGRNL LKMDAFGCTS RGQAHRAGLW VIKTGLLETQ TVDFTLGSQG LRHTPGDIIE ICDNDYAGTM TGGRVLSIDA ASRTLTLDRE VTLPETGTPT VNLINGSGKP VSVAITAHPA PDRIQVSTLP DGVETYGVWG LSLPSLRRRL FRCVSIRENT DGTFAITAVQ HVPEKEAIVD NGAHFDGDQS GTLNSVIPPA VQHLTVEVSA ADSQYLAQAK WDTPRVVKGV RFSLRLTSGS GQDSRLVTTA ITADTEHRFS GLPLGEYTLT VRAINSYGQQ GEPATTTFRI NAPAKPATIE LTPGYFQITA VPVLAVYDPT VQFEFWFSEK RITNTAQVEK SARYLGSGSQ WTVQGSRIKP GTDFWFYVRS VNLVGKSAFV EVSGQPSNDG EGYLEFFREK IGKLHLAQGL WELIDNSQLA DEMAEMKTTI TETRNEITQT VSKTLENQSA TIQQIQRVQK DTNDDLAALY MLKVQKTKDG IPYVAGIGAG IEDTDGQPLS NILLLADRIA MINPESGNST PLFVAQGNQL FMNDVFLKRL FAVSITSSGN PPTFSLTPDG RLTAKNADIS GSVNANSGTL NNVTINENCQ IKGKLSANQI EGDIVKTVSK SFPRTSTYAS GTITVRISDD QKFDRQVMIP PVLFRGGKHE NFNSNNQQSY WYSTCRLRVT RNGQEIFNQS TTDAQGVFSS VIDMPAGQGT LTLTFTVSSS GANNWTPTTS ISDLLVVVMK KSTAGISIS
|
| |