Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH74115_0913 |
Symbol | |
ID | 6969975 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli O157:H7 str. EC4115 |
Kingdom | Bacteria |
Replicon accession | NC_011353 |
Strand | + |
Start bp | 922079 |
End bp | 925492 |
Gene Length | 3414 bp |
Protein Length | 1137 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 643384935 |
Product | hypothetical protein |
Protein accession | YP_002269435 |
Protein GI | 209399520 |
COG category | [S] Function unknown |
COG ID | [COG4733] Phage-related protein, tail component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 59 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGGTAAAG GAAGCAGTAA GGGGCATACC CCGCGCGAAG CGAAGGACAA CCTGAAGTCC ACGCAGCTGC TGAGTGTGAT CGATGCCATC AGCGAAGGAC CGATTGAAGG TCCGGTGGAT GGATTAAAAA GCGTGCTGCT GAACAGTACG CCGGTGCTGG ACAGTGAAGG TAATACCAAC ATCTCCGGTG TCACGGTGGT GTTCCGGGCC GGTGAGCAGG AGCAGACACC GCCGGAGGGA TTTGAATCCT CCGGTTCTGA GACGGTGCTG GGTACGGAAG TGAAATACGA CACGCCGATC ACCCGCACCA TCACGTCGGC AAACATCGAT CGTCTGCGCT TTACTTTCGG TGTGCAGGCA CTGCGGGAAA CCACCTCAAA GGGGGACCGG AATCCGTCGG AAGTCCGCCT GCTGGTTCAG ATACAGCGTA ATGGTGGCTG GGTGACGGAA AAAGACATCA CCATTAAGGG CAAAACCACG TCGCAGTATC TGGCCTCGGT GGTGGTGGAT AACCTGCCGC CGCGCCCGTT TAATATCCGG ATGCGCAGGA TGACGCCGGA CAGCACCACA GACCAGCTGC AGAACAAAAC GCTCTGGTCG TCATACACCG AAATCATCGA TGTGAAACAG TGCTACCCGA ACACGGCACT GGTTGGCGTG CAGGTGGACT CGGAGCAGTT TGGCAGTCAG CAGGTGAGCC GTAATTATCA TCTTCGCGGG CGCATTCTGC AGGTGCCGTC AAACTATGAT CCGGAAAAAC GCACTTACAG CGGCATCTGG GACGGAACGT TAAAACCGGC ATACAGCAAC AACATGGCCT GGTGTCTGTG GGATATGCTG ACCCACCCGC GCTACGGCAT GGGGAAACGT CTTGGTGCGG CGGATGTGGA TAAATGGGCG CTGTATGTCA TCGGCCAGTA CTGCGACCAG TCAGTACCGG ACGGCTTTGG CGGCACGGAG CCGCGCATCA CCTGTAATGC GTACCTGACC ACGCAGCGCA AGGCGTGGGA TGTGCTCAGT GATTTCTGCT CGGCGATGCG CTGTATGCCG GTATGGAACG GGCAGACGCT GACGTTCGTG CAGGACCGGC CGTCGGATAA GGTGTGGACC TATAACCGCA GTAATGTGGT GATGCCGGAT GATGGCGCGC CGTTCCGCTA CAGCTTTAGC GCCCTGAAAG ACCGCCATAA TGCCGTTGAG GTGAACTGGA CTGACCCGGA CAACGGCTGG GAGACGGCGA CAGAGCTTGT GGAGGACACG CAGGCCATTG CCCGTTACGG TCGTAACGTC ACGAAGATGG ATGCCTTTGG CTGTACCAGT CGGGGGCAGG CACACCGTGC CGGGCTGTGG CTGATTAAAA CGGAACTGCT GGAAACGCAG ACCGTGGACT TCAGCGTGGG CGCAGAAGGG CTTCGCCATG TACCGGGCGA TGTCATTGAA ATCTGTGATG ATGACTATGC GGGGATCAGC ATCGGTGGGC GTGTGCTGGC GGTGAACAGC CAGACCCGGA CGCTGACGCT CGACCGTGAA ATCACGCTGC CATCCTCCGG CACCACGCTG ATAAGCCTGG TTGACGGAAG TGGCAATCCG GTCAGCGTGG AGGTTCAGTC CGTCACCGAC GGCGTGAAGG TAAAAGTGAG CCGGGTTCCT GACGGTGTTG CTGAATACAG CGTGTGGGGG CTGAAGCTGC CGACGCTGCG CCAGCGCCTG TTCCGCTGCG TGAGTATCCG TGAGAACGAC GACGGCACGT ATGCCATCAC TGCCGTGCAG CATGTGCCGG AGAAAGAGGG CATCGTGGAT AACGGGGCGC ACTTTGACGG TGACCAGAGC AGCACGGTGA ATGGTGTCAC GCCGCCAGCG GTGCAGCACC TGACCGCCGA AGTCTCCGCA GACAGCGGGG AATATCAGGT GCTGGCGCGA TGGGACACGC CGAAGGTGGT GAAGGGTGTG AGCTTCCTGC TTCGCCTGAC CGTGGCAGCG GATGACGGCA GTGAGCGGCT GGTCAGCACG GCCCGGACGA CGGAAACCAC ATACCGCTTC ACGCAGCTGG CGCTGGGGAA CTACAGGCTG ACAGTCCGGG CGGTAAATGC GTGGGGACAG CAGGGCGATC CGGCGTCGGT ATCGTTCCGG ATTGCCGCAC CGGCAGCGCC GTCACAGATT GAGCTGACAC CGGGCTATTT TCAGATAACC GCCACGCCGC ATCTTGCGGT TTATGATCCG ACGGTACAGT TTGAGTTCTG GTTCTCGGAA ACGCGGATTG CGGATATCAG GCAGGTTGAA ACCAGCGCGC GTTATCTTGG TACGGCGCTG TACTGGATAG CCGCCAGTAT CAATATCAAA CCGGGCCATG ATTATTACTT TTATATCCGC AGTGTGAACA CCGTTGGCAA ATCGGCATTC GTGGAGGCCG TCGGTCGGGC GAGCGATGAT GCGGAAGGTT ATCTGGATTT TTTCAAAGGC AAGATAACCG AATCTCATCT CGGCAAGGAG CTGCTGGAAA AAGTCGATCT GACGGAGGAT AACGCCAGCA GACTGGATGA GTTTTCGAAA GAGTGGAAGG ACGCTAACGA TAAATGGAAT GCCATGTGGG GCGTCAAAAT TGAGCAGACC AAAGACGGCA AACATTATGT CGCGGGTATT GGCCTCAGCA TGGAGGACAC GGAGGAAGGC AAGCTGAGCC AGTTTCTGGT TGCCGCCAAT CGTATCGCGT TTATTGACCC GGCAAACGGG AATGAAACGC CGATGTTTGT GGCGCAGGGC AACCAGATAT TCATGAACGA CGTGTTCCTG AAGCGCCTGA CGGCCCCCAC CATTACCAGC GGTGGAAATC CACCGGTATT TTCCCTGACA TCAGACGGAA AGCTGACCGC TAAAAATGCG GATATCAGTG GCAGTGTGAA TGCGAACTCC GGGACGCTCA ACAACGTCAC GGTAAATGAA AACTGTACGA TTAAGGGCAT GCTGGAGGCG ACTCAGGTCA GAGGTGACTT CGTTAAAGCT GTATCCAAAT CATTTCCGAA ACAGGCTGGT ACGTGGGGTA ACACGGAAAC ACCAAACGGG ACGGTTACAG TCACCATCAG CGATGATCAT AACTTTGACC GTCAAATCAT TATTCCGCCC ATTATCTTTA ACGGAATAGC GTATAGCGAT CCGGGAAGTG GTAATAACCC GGGAGGTACA AGATACACGG GTTATGGTTT TGAAGTTCGC AAAAACGGTG TATTAATCGC ATCCAGAGAA ACTAAAGGGG CCATTCCCGG TAGCTACAGT GCGGTTATTG ATATGCCGAG TGGCAGGGGA AGCGTCACTC TGGAGTTTAA GGTTTTCCAT AAAGGCAATC AGCGGGCAGG TAATATCACC GACTGTACGG TGATTGTGAC CAAAAAAGCG GCTTCCGGCA TCAGTATCCG TTGA
|
Protein sequence | MGKGSSKGHT PREAKDNLKS TQLLSVIDAI SEGPIEGPVD GLKSVLLNST PVLDSEGNTN ISGVTVVFRA GEQEQTPPEG FESSGSETVL GTEVKYDTPI TRTITSANID RLRFTFGVQA LRETTSKGDR NPSEVRLLVQ IQRNGGWVTE KDITIKGKTT SQYLASVVVD NLPPRPFNIR MRRMTPDSTT DQLQNKTLWS SYTEIIDVKQ CYPNTALVGV QVDSEQFGSQ QVSRNYHLRG RILQVPSNYD PEKRTYSGIW DGTLKPAYSN NMAWCLWDML THPRYGMGKR LGAADVDKWA LYVIGQYCDQ SVPDGFGGTE PRITCNAYLT TQRKAWDVLS DFCSAMRCMP VWNGQTLTFV QDRPSDKVWT YNRSNVVMPD DGAPFRYSFS ALKDRHNAVE VNWTDPDNGW ETATELVEDT QAIARYGRNV TKMDAFGCTS RGQAHRAGLW LIKTELLETQ TVDFSVGAEG LRHVPGDVIE ICDDDYAGIS IGGRVLAVNS QTRTLTLDRE ITLPSSGTTL ISLVDGSGNP VSVEVQSVTD GVKVKVSRVP DGVAEYSVWG LKLPTLRQRL FRCVSIREND DGTYAITAVQ HVPEKEGIVD NGAHFDGDQS STVNGVTPPA VQHLTAEVSA DSGEYQVLAR WDTPKVVKGV SFLLRLTVAA DDGSERLVST ARTTETTYRF TQLALGNYRL TVRAVNAWGQ QGDPASVSFR IAAPAAPSQI ELTPGYFQIT ATPHLAVYDP TVQFEFWFSE TRIADIRQVE TSARYLGTAL YWIAASINIK PGHDYYFYIR SVNTVGKSAF VEAVGRASDD AEGYLDFFKG KITESHLGKE LLEKVDLTED NASRLDEFSK EWKDANDKWN AMWGVKIEQT KDGKHYVAGI GLSMEDTEEG KLSQFLVAAN RIAFIDPANG NETPMFVAQG NQIFMNDVFL KRLTAPTITS GGNPPVFSLT SDGKLTAKNA DISGSVNANS GTLNNVTVNE NCTIKGMLEA TQVRGDFVKA VSKSFPKQAG TWGNTETPNG TVTVTISDDH NFDRQIIIPP IIFNGIAYSD PGSGNNPGGT RYTGYGFEVR KNGVLIASRE TKGAIPGSYS AVIDMPSGRG SVTLEFKVFH KGNQRAGNIT DCTVIVTKKA ASGISIR
|
| |