Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH74115_1802 |
Symbol | |
ID | 6971212 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli O157:H7 str. EC4115 |
Kingdom | Bacteria |
Replicon accession | NC_011353 |
Strand | + |
Start bp | 1716856 |
End bp | 1719711 |
Gene Length | 2856 bp |
Protein Length | 951 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 643385747 |
Product | hypothetical protein |
Protein accession | YP_002270237 |
Protein GI | 209395920 |
COG category | [S] Function unknown |
COG ID | [COG4733] Phage-related protein, tail component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 39 |
Fosmid unclonability p-value | 0.0582533 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGGGTAAAG GGGGCGGCAA GGGGCACACG CCGGTAGAGG CAAAGGACAA TCTTAAGTCC ACGCAGATGA TGAGCGTGAT TGACGCCATT GGTGAAGGGC CGATTGAAGG TCCGGTGAAG GGGCTGCAGA GTATTCTGGT GAACAAAACC CCGCTGACGG ACACGGACGG TAATCCCGTG ATACACGGTG TGACTGCGGT CTGGCGTGCC GGGGAGCAGG AGCAGACACC ACCGGAAGGC TTTGAGTCCT CCGGAGCTGA AACCGCACTG GGCGTGGAAG TGACGAAGGC AAAGCCGGTG ACGCGCACCA TTACGTCCGC GAACATTGAC CGCCTGCGGG TCACCTTCGG GGTGCAGTCA CTGTTGGAGA CCACCTCAAA GGGCGACCGT AATCACTCTT CTGTCCGACT GCTGATTCAG TTGCAGCGTA ACGGTAACTG GGTGACGGAA AAGGATGTCA CCATTAACGG CAAGACCACC TCGCAGTTCC TGGCGTCGGT GATTCTGGAT AATCTGCCGC CCCGGCCCTT TAACATCCGG ATGGTCAGGG AGACGGCGGA CAGCACCACG GACCAGCTGC AGAACAGAAC GCTGTGGTCG TCATACACCG AAATCATCGA TGTGAAACAG TGCTACCCGA ACACGGCCAT TGTGGGGCTG CAGGTGGATG CGGAGCAGTT TGGCGGTCAG CAGATGACGG TGAACTACCA TATCCGCGGT CGCATCATCC AGGTGCCGTC AAACTATGAC CCGGAAAAAC GCACGTACAG CGGCATCTGG GACGGCAGCC TGAAACCGGC ATACAGCAAC AACCCTGCCT GGTGCCTGTG GGACATGCTG ACCCACCCGC GCTACGGAAT GGGAAAACGC CTGGGGGCGG CGGATGTGGA CAAGTGGGCG CTGTATGCCA TTGTGACACT GCCGGAGACC GGTGCCGCCA CGGTGAACCT GATTAACGGC AGCGGTAAGC CGGTGAGTGT GGACATCACC GCACACCCCG CGCCGGACCG GATACAGGTC AGTACCCTGC CTGATGGTGT GGAGACATAC GGGGTGTGGG GACTCTCCCT GCCGTCACTG CGCCGTCGCC TGTTCCGCTG TGTCTCCGTC CGGGAAAACA CGGACGGCAC CTTTGCCATC ACGGCGGTGC AGCACGTACC GGAAAAAGAA GCCATCGTGG ATAACGGTGC CCGCTTTGAG CCGCAGTCAG GTTCCCTGAA CAGCGTCATC CCACCGGCAG TACAGCACCT GACGGTGGAG GTGAGTGCAG CTGACGGCCA GTATCTGGCG CAGGCTAAAT GGGACACGCC GCGGGTGGTG AAGGGCGTGC GCTTCAGTCT GCGCCTGACC AGTGGTAAGG GAACGGATGC CAGACTGGTG ACCACCGCCA TCACCGCAGA CACGGAGCAC CGTTTCAGCG GCCTGCCGCT CGGGGAATAC ACCCTGACGG TGCGGGCGAT AAACAGCTAT GGCCAGCAGG GTGAACCTGC CACCACCACC TTCCGGATTG CCGCACCGGC AGCACCGTCG CGGATTGAGC TGACGCCGGG CTATTTTCAG ATAACCGCCA CGCCGCATCT TGCCGTTTAT GACCCGACGG TACAGTTTGA GTTCTGGTTC TCGGAAAAGC GGATTGCGGA TATCAGGCAG GTTGAAACCG CAGCCCGCTA TCTTGGCTCG GCGCTGTACT GGATAGCTGC CAGTATCAAT ATCAAACCGG GCCATGATTA TTATTTTTAT ATCCGCAGTG TGAATACTGT TGGCAAATCG GCATTCGTGG AGGCTGTCGG TCGGGCGAGC GATGATGCGG AAGGTTACCT GGATTTTTTC AAAGGAGAAA TCGGGAAAAC ACATCTGGCC CAGGAGCTGT GGACGCAGAT TGATAACGGT CAGCTTGCGC CGGACCTGGC TGAAATCAGG ACGTCCATTA CGAATGTCAG CAATGAAATC ACGCAGACCG TCAATAAAAA ACTGGAAAAT CAGAGTGCGG CAATCCAGCA GATACAGAAA GTTCAGGTTG ATACAAATAA TAACCTGAAC AGCATGTGGG CCGTGAAACT GCAGCAGATG CAGGACGGAC GCCTTTATAT TGCGGGTATC GGTGCCGGTA TTGAGAATAC GCCAGCAGGA ATGCAGAGTC AGGTGCTGCT GGCGGCAGAC AGGATTGCGA TGATTAATCC TGCGAATGGC AACACAAAGC CGATGTTTGT TGGTCAGGGC GATCAGATAT TTATGAATGA AGTGTTCCTG AAATATCTGA CGGCTCCCAC CATTACCAGC GGCGGTAATC CTCCGGCATT TTCCCTGACA CCGGACGGGC GGCTGACGGC GAAAAATGCC GATATCAGCG GTAACGTGAA TGCGAACTCC GGGACGCTCA ACAACGTCAC GATTAACGAG AACTGTCGGG TTCTGGGAAA ATTGTCCGCG AACCAGATTG AAGGCGATCT CGTTAAAACA GTGGGCAAAG CTTTCCCCCG GGACTCCCGT GCACCGGAGC GGTGGCCATC AGGAACCATT ACCGTCAGGG TTTATGACGA TCAGCCGTTT GACCGGCAGA TTGTTATTCC GGCGGTGGCA TTCAGCGGCG CTAAACATGA GAAAGAGCAT ACTGATATTT ACTCCTCATG CCGTCTGATA GTGCGGAAAA ACGGTGCTGA AATTTATAAC CGTACCGCGC TGGATAATAC GCTGATTTAC AGTGGTGTTA TTGATATGCC TGCCGGTCAC GGTCACATGA CACTGGAGTT TTCGGTGTCA GCATGGCTGG TAAATAACTG GTATCCCACA GCAAGTATCA GCGATTTGCT GGTTGTGGTG ATGAAGAAAG CCACTGCAGG CATCACGATT AGCTGA
|
Protein sequence | MGKGGGKGHT PVEAKDNLKS TQMMSVIDAI GEGPIEGPVK GLQSILVNKT PLTDTDGNPV IHGVTAVWRA GEQEQTPPEG FESSGAETAL GVEVTKAKPV TRTITSANID RLRVTFGVQS LLETTSKGDR NHSSVRLLIQ LQRNGNWVTE KDVTINGKTT SQFLASVILD NLPPRPFNIR MVRETADSTT DQLQNRTLWS SYTEIIDVKQ CYPNTAIVGL QVDAEQFGGQ QMTVNYHIRG RIIQVPSNYD PEKRTYSGIW DGSLKPAYSN NPAWCLWDML THPRYGMGKR LGAADVDKWA LYAIVTLPET GAATVNLING SGKPVSVDIT AHPAPDRIQV STLPDGVETY GVWGLSLPSL RRRLFRCVSV RENTDGTFAI TAVQHVPEKE AIVDNGARFE PQSGSLNSVI PPAVQHLTVE VSAADGQYLA QAKWDTPRVV KGVRFSLRLT SGKGTDARLV TTAITADTEH RFSGLPLGEY TLTVRAINSY GQQGEPATTT FRIAAPAAPS RIELTPGYFQ ITATPHLAVY DPTVQFEFWF SEKRIADIRQ VETAARYLGS ALYWIAASIN IKPGHDYYFY IRSVNTVGKS AFVEAVGRAS DDAEGYLDFF KGEIGKTHLA QELWTQIDNG QLAPDLAEIR TSITNVSNEI TQTVNKKLEN QSAAIQQIQK VQVDTNNNLN SMWAVKLQQM QDGRLYIAGI GAGIENTPAG MQSQVLLAAD RIAMINPANG NTKPMFVGQG DQIFMNEVFL KYLTAPTITS GGNPPAFSLT PDGRLTAKNA DISGNVNANS GTLNNVTINE NCRVLGKLSA NQIEGDLVKT VGKAFPRDSR APERWPSGTI TVRVYDDQPF DRQIVIPAVA FSGAKHEKEH TDIYSSCRLI VRKNGAEIYN RTALDNTLIY SGVIDMPAGH GHMTLEFSVS AWLVNNWYPT ASISDLLVVV MKKATAGITI S
|
| |