Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH74115_0908 |
Symbol | |
ID | 6966762 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli O157:H7 str. EC4115 |
Kingdom | Bacteria |
Replicon accession | NC_011353 |
Strand | + |
Start bp | 916622 |
End bp | 919687 |
Gene Length | 3066 bp |
Protein Length | 1021 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 643384930 |
Product | putative prophage tail length tape measure protein |
Protein accession | YP_002269430 |
Protein GI | 209396645 |
COG category | [S] Function unknown |
COG ID | [COG5281] Phage-related minor tail protein |
TIGRFAM ID | [TIGR01541] phage tail tape measure protein, lambda family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 55 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGACCAGA TAGCCAACCT GGTCATTGAT TTGGGGATTG ATGCGGCAGA GTTTAAAAAT GAAATTCCCC GTATCAAAAA CCTTCTGAAT GGTGCAGCCA GCGATGCAGA ACGGTCTTCT GCCCGTATGC AGCGTTTTAT GGAGCGTCAG ACTCAGGCCG CCCGGCAGAC AATGCAGGCG GCTTCTTCGG CTGCAACAGC CGCATCCGTC CATGCGCAGA CGGTGGAGAA GAGCGCACAG GCTCATGAAC GCATGGCCCG CGAGGTGGAG CAAACCCGCC AGCGTATGGA GGCACTGAGC CAGAAAATGC GCGAGGAACA GGCGCAGGCC ATGGCTCTGG CGGAGGCTCA GGATAAAGCG GCTGCTGCGT TTTATCGTCA GATTGACAGT GTGAAACAGG CCGGTGCGGG GCTGCAGGAA TTACAGCGTA TTCAGCAACA GATCCGACAG GCCAGAAACA GTGGCGGGAT TGGTCAGCAG GATTATCTGG CGCTGATTTC TGAGGTTACG GCGAAAACCC GTGTTCTTAC ACAGGCTGAG GAAGAGGCTA CCCGACAGAA AGTGGCGTTT ATCCGTCAGC TTAAAGAGCA GGCAACCCGC CAGAATCTTT CTTCTTCTGA GTTGCTTCGT GCTAAGGCTG CCCAGCTGGG GGTAAGCAGT GCTGCAGAAG TGTATATCCG CAAAATGGAG CAGGCAGGAA AAGCCACGCA TTCGCTGGGT CTGAAAAGTG CAGCAGCCCG TCAGGAGATA GGCGTTCTGA TAGGTGAACT GGCCCGCGGC AATTTAGGTG CGCTGAGGGG ATCCGGAATA ACGCTGGCTA ACCGTGCCGG GTGGATAGAC ACACTGATGT CACCGAAAGG CATGATGCCA GGAGCGGTTA TTGGCGGTAT TGCCGCGGCT GTCTATGGTC TGGGTAAAGC CTGGTATGAC GGTCAGAAGG AGGGGGAAGA ATTTAACCGC CAGCTGTCGC TGACGGGGCA TTATGCCGGA GTCACTGCCG GGCAGCTGTG GACGCTCAGT CGTGCTATTT CCGGGAATGG TATCACGCAA CATGCTGCAG CCGGTGCGCT GGCTCAGGTG GTGGGGAGTG GTGCATTTCG TGGAAACGAT ATCGGTATGG TGGCGAGAGC TGCCGCACAG ATGGAGCGAT CGGTTGGCCA GTCGGTCAGC GATACCATAA ATCAGTTTAA GCGGCTGAAG GATGATCCTG TAAATGCCGC GAAGGCTCTG GACAATGAGC TGCATTTTCT TACTGCCACT CAGCTTGAGC AGATACGCGT CCTTGGGGAG CAGGGGCGGT CCAGTGATGC TGCACGGATA GCCATGTCTG CACTGGCAGA GGAAACCGGT CGGCGTACTG CGGATATTGA TAATAACCTC AATGCGCTTG GCAGTACGCT GAAGTATCTG TCTGATTTGT GGAGTCGTTT CTGGGATGCG GCCATGAATA TTGGTCGTGA AGACTCGCTG GATGAACAGA TTGCCGCTTT ACAGGAGAAA GTGTCGCGGG CGAAAAGACT CCCCTGGACG GCATCATCTT CTCAGGTTGA ATACGATCAG CAGCGTCTTA ACGATCTTCA GGAGAAAAAA CGCCAGAAGG ATTTGCAGGA TGCAAAAGAG CAGGCAGAGC GGAATTATCA GGAGCAACAG AAACGCCGTA ATGCTGAAAA TGCTGCACTG AACCGGATGA ATGAAACGGA AGCAGCACGA CATCAGCGTG AAATAGCGCG TATTAATGCC ATGCAGTACG CCGATCAGGC TGTCAGGGAT GCGGCGATAC AACGTGAAAA TGAACGTTAC GAGAAAGCCC TGGCATCCGG TAAGAAAAAA ACACGCGAAA CCCGTAATGA TGAGGCCACC CGGTTATTGC TGCAGTACAG TCAGCAACAG GCACAGGTGG AAGGACAGAT TGCTGCTGCC AGACAGTCAG CAGGCATTGC CACGGAAAGG ATGACAGAAG CGCGTAAACA GCTTCTGGCT CTGCAGCAGC GCATCAGCGA CCTGGACGGG AAAAAACTGA CGGCAGATGA AAAGAGTGTG CTGGCCCGTA AAGATGAACT GATTCAGGCA CTGACGCTGC TGGATGTAAA ACAGCAGGAG CTTCAGAAAC AGACGGCACT CAACGAGCTG AAGAAAAAAA CAATTCAGCT GACCAGTCAA CTGGCTGAAG AAGAGCGCGC TCAGCGTCAG CAACATGACC TGGATATCGC CACGGTGGGT ATGGGTGATC AGCAGCGACA GCGATATCAG GTACAACTGA GTCTTCGCCA GAAATACCAG CAACAGCTGG AGCAGTTGAG GCGGGATAGT GAGCAGAAAG GGACATATAA CACGGATGAC TACAGAAAGG CCGAGCAGGC GCTGACGGAG AGCCTGAACC GACAACTGAA TGAGAATCGC CGTTACTGGC AACAGCTTGA AGTTGTACAG GGTAACTGGA AAAACGGAGT CCTGCGTGCA TTTCAGGATT TTACCGTGGA TGCAGATAAT ACGGCAGAAA CAGCAGAACA GGTGTTCTCG TCAGCCTTCA GCAACATGGG AAATGGCCTG GCAACTTTTG TCACTACCGG CAAACTCAAT TTCAAATCCT TCACCTCTTC TGTGCTGTCA GATATGGCGA AAATCCTGGC GCAGGCAACC ATGATGAAAT CGATAAAAGG GATTGGCAGT GTACTGGGAT TTGATCTCAG CAGCCTTTCC CTGAATGCCA ATGGGGGGAT TTATCAGTCT GCTGATTTGA GTCGTTACAG TGGCACGGTG GTTAACCGTC CGACGTTTTT TGCTTTTGCA AAAGGCGCGG GTGTGATGGG GGAAGCGGGA CCTGAAGCCA TTCTGCCATT GCGTCGTGGT GCTGACGGTA AGCTGGGGGT TGTGGCGGAT ATTGGGGGTT CAGGTATGGC GATGTTTTCC CCGCAGTACA ACATCGAGAT CAATAACGAT GGCACGAACG GGCAGATAGG TCCGGCTGCC CTGAAGGCGG TTTATGACCT CGGGAAAAAA GCGGCAGCGG ACTTTATGCA ACAGCAGGCC CGTGATGGTG GTCGGTTAAG TGGAGCATAT CGGTAA
|
Protein sequence | MDQIANLVID LGIDAAEFKN EIPRIKNLLN GAASDAERSS ARMQRFMERQ TQAARQTMQA ASSAATAASV HAQTVEKSAQ AHERMAREVE QTRQRMEALS QKMREEQAQA MALAEAQDKA AAAFYRQIDS VKQAGAGLQE LQRIQQQIRQ ARNSGGIGQQ DYLALISEVT AKTRVLTQAE EEATRQKVAF IRQLKEQATR QNLSSSELLR AKAAQLGVSS AAEVYIRKME QAGKATHSLG LKSAAARQEI GVLIGELARG NLGALRGSGI TLANRAGWID TLMSPKGMMP GAVIGGIAAA VYGLGKAWYD GQKEGEEFNR QLSLTGHYAG VTAGQLWTLS RAISGNGITQ HAAAGALAQV VGSGAFRGND IGMVARAAAQ MERSVGQSVS DTINQFKRLK DDPVNAAKAL DNELHFLTAT QLEQIRVLGE QGRSSDAARI AMSALAEETG RRTADIDNNL NALGSTLKYL SDLWSRFWDA AMNIGREDSL DEQIAALQEK VSRAKRLPWT ASSSQVEYDQ QRLNDLQEKK RQKDLQDAKE QAERNYQEQQ KRRNAENAAL NRMNETEAAR HQREIARINA MQYADQAVRD AAIQRENERY EKALASGKKK TRETRNDEAT RLLLQYSQQQ AQVEGQIAAA RQSAGIATER MTEARKQLLA LQQRISDLDG KKLTADEKSV LARKDELIQA LTLLDVKQQE LQKQTALNEL KKKTIQLTSQ LAEEERAQRQ QHDLDIATVG MGDQQRQRYQ VQLSLRQKYQ QQLEQLRRDS EQKGTYNTDD YRKAEQALTE SLNRQLNENR RYWQQLEVVQ GNWKNGVLRA FQDFTVDADN TAETAEQVFS SAFSNMGNGL ATFVTTGKLN FKSFTSSVLS DMAKILAQAT MMKSIKGIGS VLGFDLSSLS LNANGGIYQS ADLSRYSGTV VNRPTFFAFA KGAGVMGEAG PEAILPLRRG ADGKLGVVAD IGGSGMAMFS PQYNIEINND GTNGQIGPAA LKAVYDLGKK AAADFMQQQA RDGGRLSGAY R
|
| |