Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH74115_1872 |
Symbol | |
ID | 6967657 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli O157:H7 str. EC4115 |
Kingdom | Bacteria |
Replicon accession | NC_011353 |
Strand | + |
Start bp | 1768960 |
End bp | 1771605 |
Gene Length | 2646 bp |
Protein Length | 881 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 643385808 |
Product | putative prophage tail length tape measure protein |
Protein accession | YP_002270297 |
Protein GI | 209396157 |
COG category | [S] Function unknown |
COG ID | [COG5281] Phage-related minor tail protein |
TIGRFAM ID | [TIGR01541] phage tail tape measure protein, lambda family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 43 |
Fosmid unclonability p-value | 0.179958 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCAGGTA ATTTTGCCGA TCTGACAGCC GTGCTGACAC TGGATTCTGC CCGTTTTTCT GAAGAGGCAG CGCGGGTAAA AAAAGAGCTG GGTGAAACCA GTGCGCTTGC TGATTTGATG TCCGGGAAAG TCAGTCAGTC TTTCAGAAAA CAGGCTGATG CTGCTGAGCA GAGTCTGAGC CGACAGGCGC TGGCTGCACA AAAAGCCGGG ATATCAGTCG GACAGTATAA GGCTGCCATG CGCACACTGC CCGCACAGTT CACGGATATT GTCACTCAGC TTGCTGGTGG TCAGAATCCC TTCCTTATCA TGCTGCAGCA GGGGGGGCAG ATCAGCGATT CATTCGGTGG ACCGCTCAGC CTGCTTACCC TGCTGAAGGA GGAACTTCTC GGGATCAGGG ATGCCTCTGA ATCATCAGAG GAGTCGCTGT CAGATACGGC AAATGCACTG GCTGAAAATG CCCGGAATGC CGGTGAGCTG GGACGATTTA TGTCGGTGGC CCGTGTGGCG GCAGGTGGCG GGGTTGCCGT ACTGGCCGCG CTTGCTGCCG CCGCCTGGCA GGCAGAGCAG GCTGATCGGG CCTTATTGCG TTCACTGATC CTGACCGGAG GGGCGGCTGC CACCACAACG GCAGAATTGT GGAAAATGGC CGGGGTGATC AGCGATGAAG CCGGTGGTGG TATCAGACAG GCGGCAGAAA ATCTGGCCCG TCTGGCAGAA AGCGGGAAAT ATACCGCCGG GCAGCTACGG ATCATGGGGG AAACCTCTCA GAGATGGCTG CAGACGGTGG GGGACGATGC CGGGAAGGTG GAAAAAGCCT TTGAAGGGAT TGCAGCAGAT CCGGTGAAGG CGCTGGCCTC CCTGAATCAG CAGTATAACT TCCTGAGCGT TTCCCAGTTA CGCCATATTG ATGAGCTTGA GCGCACGAAA GGTAAACAGG CTGCGGTGAC GGAGGCGATG TCCCTGTTTG CGGATGTCAT GAATGCACGT CTGGAGCAAC TTGATAAAGC GGCCACGCCG GTGGAAAAAA TCTGGGACGA TGTTAAAACC TGGACTTCTG ACGCATGGGC ATGGATAGGT GATCATACAC TGGGGGCACT CAGTCTGATC ACTGACGTGG TGGCCGGAAC CGTTGAACAA GTGAAGCTGC TGCTTGTGCA GGGGGATCTG GCGCTGGCTG AATTTATTCA GTCAGCCTGG GAAACGACAA AGAATGTGCC CGGCGTTGGT GCGTTGTTTG GTGAACTGGC AGAAGAGAAC CGCGTATTTA TTGAGAAAAC AAAACGCGAT GAACTGGCGC TGAGAAAATC CATTGCGGAA CGGGATGCGC GTATACGCCA GGGGGAAATA GGGTACATCA ACCGCTCGCG TGCAACAGGC GTCAGCAAAG GTCCAGGGCA GCAGGAAGCC GTCAGCCGTC TGGCTGAAGA GCTGACAGGT AAAAAGCATA CATCACCGAA AACGCGCTCT GCCGGGGAGA GGGAAGAGGA GCAGGCAAGA GAGGCTCTGC TTGCCCTTGA AGCTGAGCTC AGGACGCTGG AAAAACACAG CGGTGCGAAT GAGAAAATTA GCCGGCAGCG CCGTGATTTA TGGAAGGCGG AAAGTCAGTA TGCGGTCCTG AAAGAGGCTG CCACGAAACG GCAGTTATCC TGGCAGGAAA AATCCCTGCT GGCCCATGAG AAAGAGACGC TGGAGTACAA ACGCCAGCTG GCTGACCTGG GCGACAAGGT TGAACACCAG AAACGGCTGA ATGAGCTGGC ACAGCAGGCT GCGCGGTTTG AGCAGCAGCA GGGCGCGAAG CAGGCGGCAA TCAGTGCCCA GGCGCGGGGC CTCACCGACC GTCAGGCGCA GCGGGAGTCG GAAGAGCAGC GCCTTCGTGA CGTGTACGGT GATAATCCGG ATGCGCTGGC GAAGGCCACA TCGGCACTGA AGAACACCTG GTCTGCGGAG GAGCAGCTTC GTGGAAGCTG GATGGCCGGT CTGAAGTCCG GCTGGGGCGA GTGGGCAGAA AGTGCGACGG ACAGTTTTTC GCAGGTTAAA AGCGTGGCCA CGCAGACCTT TGACGGTATT GCACAGAATA TGGCAGCGAT GCTGACCGGC AGCGAACAGA ACTGGCGTGG TTTCACCCGT TCTGTGCTCT CCATGCTGAC AGAGATTTTT CTGAAGCAGG CGATGGTGGG GATAGTCGGG AGTATCGGCA GCGCCATTGG CGGTGCTTTC GGTGGTGGTG CATCTGCTTC CTCGGGGACG GCCATTGAGG CTGCGGCGGC GAACTTCCAT TTCGCGACCG GAGGATTTAC GGGGACGGGC GGCAAATATG AGCCTGCGGG GATAGTTCAC CGCGGGGAGT TTGTTTTCAC GAAAGAGGCA ACCAGCCGGA TAGGTGTGGG GAATCTTTAC CGTCTGATGC GCGGCTATGC GGAAGGTGGT TATGTGGGTG GTGCCGGAAG TCCGGCGCAG ATGCGGCGGG CGGAAGGTAT TAATTTTAAT CAGAACAATC ACGTGGTGAT TCAGAACGAC GGCACCAACG GACAGGCGGG GCCGCAGCTG ATGAAGGCGG TGTATGACAT GGCCCGCAAG GGGGCGCAGG ATGAGCTCCG GCTGCAGTTG CGTGATGGCG GTATGTTATC GGGGAGCGGG CGATGA
|
Protein sequence | MAGNFADLTA VLTLDSARFS EEAARVKKEL GETSALADLM SGKVSQSFRK QADAAEQSLS RQALAAQKAG ISVGQYKAAM RTLPAQFTDI VTQLAGGQNP FLIMLQQGGQ ISDSFGGPLS LLTLLKEELL GIRDASESSE ESLSDTANAL AENARNAGEL GRFMSVARVA AGGGVAVLAA LAAAAWQAEQ ADRALLRSLI LTGGAAATTT AELWKMAGVI SDEAGGGIRQ AAENLARLAE SGKYTAGQLR IMGETSQRWL QTVGDDAGKV EKAFEGIAAD PVKALASLNQ QYNFLSVSQL RHIDELERTK GKQAAVTEAM SLFADVMNAR LEQLDKAATP VEKIWDDVKT WTSDAWAWIG DHTLGALSLI TDVVAGTVEQ VKLLLVQGDL ALAEFIQSAW ETTKNVPGVG ALFGELAEEN RVFIEKTKRD ELALRKSIAE RDARIRQGEI GYINRSRATG VSKGPGQQEA VSRLAEELTG KKHTSPKTRS AGEREEEQAR EALLALEAEL RTLEKHSGAN EKISRQRRDL WKAESQYAVL KEAATKRQLS WQEKSLLAHE KETLEYKRQL ADLGDKVEHQ KRLNELAQQA ARFEQQQGAK QAAISAQARG LTDRQAQRES EEQRLRDVYG DNPDALAKAT SALKNTWSAE EQLRGSWMAG LKSGWGEWAE SATDSFSQVK SVATQTFDGI AQNMAAMLTG SEQNWRGFTR SVLSMLTEIF LKQAMVGIVG SIGSAIGGAF GGGASASSGT AIEAAAANFH FATGGFTGTG GKYEPAGIVH RGEFVFTKEA TSRIGVGNLY RLMRGYAEGG YVGGAGSPAQ MRRAEGINFN QNNHVVIQND GTNGQAGPQL MKAVYDMARK GAQDELRLQL RDGGMLSGSG R
|
| |