Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH74115_1796 |
Symbol | |
ID | 6972210 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli O157:H7 str. EC4115 |
Kingdom | Bacteria |
Replicon accession | NC_011353 |
Strand | + |
Start bp | 1711617 |
End bp | 1714262 |
Gene Length | 2646 bp |
Protein Length | 881 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 643385742 |
Product | putative prophage tail length tape measure protein |
Protein accession | YP_002270232 |
Protein GI | 209395830 |
COG category | [S] Function unknown |
COG ID | [COG5281] Phage-related minor tail protein |
TIGRFAM ID | [TIGR01541] phage tail tape measure protein, lambda family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 42 |
Fosmid unclonability p-value | 0.132418 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCAGGTA ATTTTGCCGA TCTGACAGCC GTGCTGACAC TGGATTCTGC CCGTTTTTCT GAAGAGGCAG CGCGGGTAAA AAAAGAGCTG GGTGAAACCA GTGCGCTTGC TGATTTGATG TCCGGGAAAG TCAGTCAGTC TTTCAGAAAA CAGGCTGATG CTGCTGAGCA GAGTCTGAGC CGACAGGCGC TGGCTGCACA AAAAGCCGGG ATATCAGTCG GACAGTATAA GGCTGCCATG CGCACACTGC CCGCACAGTT CACGGATATT GTCACTCAGC TTGCCGGTGG TCAGAATCCC TTCCTTATCA TGCTGCAGCA GGGGGGGCAG ATCAGCGATT CATTCGGTGG ACCGCTCAGC CTGCTTACCC TGCTGAAGGA GGAACTTCTC GGGATCAGGG ATGCCTCTGA ATCATCAGAG GAGTCGCTGT CAGATACGGC AAATGCACTG GCTGAAAATG CCCGGAATGC CGGTGAGCTG GGACGATTTA TGTCGGTGGC CCGTGTGGCG GCAGGTGGCG GGGTTGCCGT ACTGGCCGCG CTTGCTGCCG CCGCCTGGCA GGCAGAGCAG GCTGACCGGG CCTTATTGCG TTCACTGACC CTGACCGGAG GGGCTGCTGC CACCACAACG GCAGAATTGT GGAAAATGGC CGGGGTGATC AGCGATGAAG CCGGTGGTGG TATCAGACAG GCGGCAGAAA ATCTGGCCCG TCTGGCAGAA AGCGGGAAAT ATACCGCCGG GCAGCTACGG ATCATGGGGG AAACCTCTCA GAGATGGCTG CAGACGGTGG GGGACGATGC CGGGAAGGTG GAAAAAGCCT TTGAAGGGAT TGCAGCAGAT CCGGTGAAGG CGCTGGCCTC CCTGAATCAG CAGTATAACT TCCTGAGCGT TTCCCAGTTA CGCCATATTG ATGAGCTTGA GCGCACGAAA GGTAAACAGG CTGCGGTGAC GGAGGCGATG TCCCTGTTTG CGGATGTCAT GAATGCACGT CTGGAGCAAC TTGATAAAGC GGCCACGCCG GTGGAAAAAA TCTGGGACGA TGTTAAAACC TGGACTTCTG ACGCATGGGC ATGGATAGGT GATCATACAC TGGGGGCACT CAGTCTGATC ACTGACGTGG TGGCAGGAAC CGTTGAACAG GTGAAGCTGC TGCTTGTGCA GGGGGATCTG GCGCTGGCTG AATTTATTCA GTCAGCCTGG GAAACGACAA AGAATGTGCC CGGCGTTGGT GCGTTGTTTG GTGAACTGGC AGAAGAGAAC CGCGTATTTA TTGAGAAAAC AAAACGCGAT GAACTGGCGC TGAGAAAATC CATTGCGGAA CGGGATGCGC GTATACGCCA GGGGGAAATG GGGTACATCA ACCGCTCGCG TGCAACAGGC GTCAGCAAAG GTCCGGGGCA GCAGGAAGCC GTCAGCCGTC TGGCTGAAGA GCTGACAGGT AAAAAGCATA CATCACCGAA AACGCGCTCT GCCGGGGAGA GGGAAGAGGA GCAGGCAAGA GAGGCTCTGC TTGCCCTTGA AGCTGAGCTC AGGACGCTGG AAAAACACAG CGGTGCGAAT GAGAAAATCA GCCGGCAGCG CCGTGATTTA TGGAAAGCGG AAAATCAGTA TGCGGTCCTG AAAGAGGCTG CCACGAAACG GCAGTTATCT GAGCAGGAAA AATCCCTGCT GGCGCATAAA GACGAGACGC TGGAGTACAA ACGCCAGCTG GCTGAGCTGG GCGACAAGGT TGAATACCAG AAACGCCTGA ATGAGCTGGC ACAGCAGGCG GTGCGGTTTG AAGAGCAGCA GAGCGCGAAG CAGGCCGCCA TCAGCGCAAA AGCCCGCGGT CTCACTGACC GTCAGGCGCA GCGGGAGTCT GAAGCGCAGC GTCTTCGGGA CGTGTACGGT GATAATCCGG CTGCGCTGGC GAAGGCCACA TCGGCACTGA AGAACACCAG GTCTGCGGAG GAGCAGCTTC GTGGAAGCTG GATGGCCGGG CTGAAGTCCG GCTGGGGCGA GTGGGCGGAA AGTGCGACGG ACAGTTTTTC GCAGGTTAAA AGTGCTGCCA CGCAGACCTT TGACGGTATT GCACAGAATA TGGCGGCGAT GCTGACCGGT GCAGAGGCAG ACTGGCGGGG ATTCACCCGT TCGGTGCTGT CCATGATGAC AGAAATCCTG CTTAAACAGG CCATGGTGGG CATTGTCGGG CGTATCGGCA GCGCCATTGG CGGTGCTTTC GGTGGTGGTG CATCTGCTTC CTCGGGGACG GCCATTCAGG CTGCGGCGGC GAACTTCCAT TTCGCGACCG GAGGATTTAC GGGGACGGGC GGCAAATATG AGCCTGCGGG GATAGTTCAC CGCGGGGAGT TTGTTTTCAC GAAAGAGGCA ACCAGCCGGA TAGGTGTGGG GAATCTTTAC CGTCTGATGC GCGGCTATGC GGAAGGTGGT TATGTGGGTG GTGCCGGAAG TCCGGCGCAG ATGCGGCGGA CGGAAGGCAT TAATTTTAAT CAGAACAATC ACGTGGTGAT TCAGAACGAC GGCTCCAACG GACAGGCGGG GCCGCAGCTG ATGAAGGCGG TGTATGACAT GGCCCGCAAG GGGGCGCAGG ATGAGATTCA GGCGCAGATG CGTGATGGCG GCGTCTTTTC CGGAGGCAGG CGATGA
|
Protein sequence | MAGNFADLTA VLTLDSARFS EEAARVKKEL GETSALADLM SGKVSQSFRK QADAAEQSLS RQALAAQKAG ISVGQYKAAM RTLPAQFTDI VTQLAGGQNP FLIMLQQGGQ ISDSFGGPLS LLTLLKEELL GIRDASESSE ESLSDTANAL AENARNAGEL GRFMSVARVA AGGGVAVLAA LAAAAWQAEQ ADRALLRSLT LTGGAAATTT AELWKMAGVI SDEAGGGIRQ AAENLARLAE SGKYTAGQLR IMGETSQRWL QTVGDDAGKV EKAFEGIAAD PVKALASLNQ QYNFLSVSQL RHIDELERTK GKQAAVTEAM SLFADVMNAR LEQLDKAATP VEKIWDDVKT WTSDAWAWIG DHTLGALSLI TDVVAGTVEQ VKLLLVQGDL ALAEFIQSAW ETTKNVPGVG ALFGELAEEN RVFIEKTKRD ELALRKSIAE RDARIRQGEM GYINRSRATG VSKGPGQQEA VSRLAEELTG KKHTSPKTRS AGEREEEQAR EALLALEAEL RTLEKHSGAN EKISRQRRDL WKAENQYAVL KEAATKRQLS EQEKSLLAHK DETLEYKRQL AELGDKVEYQ KRLNELAQQA VRFEEQQSAK QAAISAKARG LTDRQAQRES EAQRLRDVYG DNPAALAKAT SALKNTRSAE EQLRGSWMAG LKSGWGEWAE SATDSFSQVK SAATQTFDGI AQNMAAMLTG AEADWRGFTR SVLSMMTEIL LKQAMVGIVG RIGSAIGGAF GGGASASSGT AIQAAAANFH FATGGFTGTG GKYEPAGIVH RGEFVFTKEA TSRIGVGNLY RLMRGYAEGG YVGGAGSPAQ MRRTEGINFN QNNHVVIQND GSNGQAGPQL MKAVYDMARK GAQDEIQAQM RDGGVFSGGR R
|
| |