Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH74115_3126 |
Symbol | |
ID | 6969619 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli O157:H7 str. EC4115 |
Kingdom | Bacteria |
Replicon accession | NC_011353 |
Strand | - |
Start bp | 2900648 |
End bp | 2903293 |
Gene Length | 2646 bp |
Protein Length | 881 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 643386952 |
Product | putative prophage tail length tape measure protein |
Protein accession | YP_002271420 |
Protein GI | 209399313 |
COG category | [S] Function unknown |
COG ID | [COG5281] Phage-related minor tail protein |
TIGRFAM ID | [TIGR01541] phage tail tape measure protein, lambda family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 28 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 36 |
Fosmid unclonability p-value | 0.0141545 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCAGGTA ATTTTGCCGA TCTGACAGCC GTGCTGACAC TGGATTCTGC CCGTTTTTCT GAAGAGGCAG CGCGGGTAAA AAAAGAGCTG GGTGAAACCA GTGCGCTTGC TGATTTGATG TCCGGGAAAG TCAGTCAGTC TTTCAGAAAA CAGGCTGATG CTGCTGAGCA GAGTCTGAGC CGACAGGCGC TGGCTGCACA AAAAGCCGGG ATATCAGTCG GACAGTATAA GGCTGCCATG CGCACACTGC CCGCACAGTT CACGGATATT GTCACTCAGC TTGCCGGTGG TCAGAATCCC TTCCTTATCA TGCTGCAGCA GGGGGGGCAG ATCAGCGATT CATTCGGTGG ACCGCTCAGC CTGCTTACCC TGCTGAAGGA GGAACTTCTC GGGATCAGGG ATGCCTCTGA ATCATCAGAG GAGTCGCTGT CAGATACGGC AAATGCACTG GCTGAAAATG CCCGGAATGC CGGTGAGCTG GGACGATTTA TGTCGGTGGC CCGTGTGGCG GCAGGTGGCG GGGTTGCCGT ACTGGCCGCG CTTGCTGCCG CCGCCTGGCA GGCAGAGCAG GCTGACCGGG CCTTATTGCG TTCACTGACC CTGACCGGAG GGGCTGCTGC CACCACAACG GCAGAATTGT GGAAAATGGC CGGGGTGATC AGCGATGAAG CCGGTGGTGG TATCAGACAG GCGGCAGAAA ATCTGGCCCG TCTGGCAGAA AGCGGGAAAT ATACCGCCGG GCAGCTACGG ATCATGGGGG AAACCTCTCA GAGATGGCTG CAGACGGTGG GGGACGATGC CGGGAAGGTG GAAAAAGCCT TTGAAGGGAT TGCAGCAGAT CCGGTGAAGG CGCTGGCCTC CCTGAATCAG CAGTATAACT TCCTGAGCGT TTCCCAGTTA CGCCATATTG ATGAGCTTGA GCGCACGAAA GGTAAACAGG CTGCGGTGAC GGAGGCGATG TCCCTGTTTG CGGATGTCAT GAATGCACGT CTGGAGCAAC TTGATAAAGC GGCCACGCCG GTGGAAAAAA TCTGGGACGA TGTTAAAACC TGGACTTCTG ACGCATGGGC ATGGATAGGT GATCATACAC TGGGGGCACT CAGTCTGATC ACTGACGTGG TGGCAGGAAC CGTTGAACAG GTGAAGCTGC TGCTTGTGCA GGGGGATCTG GCGCTGGCTG AATTTATTCA GTCAGCCTGG GAAACGACAA AGAATGTGCC CGGCGTTGGT GCGTTGTTTG GTGAACTGGC AGAAGAGAAC CGCGTATTTA TTGAGAAAAC AAAACGTGAT GAACTGGCGC TGAGAAAATC CATTGCGGAA CGGGATGCGC GTATACGCCA GGGGGAAATG GGGTACATCA ACCGTTCGCG TGCAACAGGC GTCAGCAAAG GTCCGGGGCA GCAGGAAGCC GTCAGCCGTC TGGCTGAAGA GCTGACAGGT AAAAAGCATA CATCACCGAA AACGCGCTCT GCCGGGGAGA GGGAAGAGGA GCAGGCAAGA GAGGCTCTGC TTGCCCTTGA AGCTGAGCTC AGGACGCTGG AAAAACACAG CGGTGCGAAT GAGAAAATCA GCCGGCAGCG CCGTGATTTA TGGAAGGCGG AAAGTCAGTA TGCGGTCCTG AAAGAGGCTG CCACGAAACG ACAGTTATCT GAGCAGGAAA AATCCCTGCT GGCGCATAAA GACGAGACGC TGGAGTACAA ACGCCAGCTG GCTGAGCTGG GCGACAAGGT TGAATACCAG AAACGCCTGA ATGAGCTGGC ACAGCAGGCG GTGCGGTTTG AAGAGCAGCA GAGCGCGAAG CAGGCCGCCA TCAGCGCAAA AGCCCGCGGT CTCACTGACC GTCAGGCGCA GCGGGAGTCT GAAGCGCAGC GTCTTCGGGA CGTGTACGGT GATAATCCGG CTGCGCTGGC GAAGGCCACA TCGGCACTGA AGAACACCTG GTCTGCGGAG GAGCAGCTTC GTGGAAGCTG GATGGCCGGG CTGAAGTCCG GCTGGGGCGA GTGGGCGGAA AGTGCGACGG ACAGTTTTTC GCAGGTTAAA AGTGCTGCCA CGCAGACCTT TGACGGTATT GCACAGAATA TGGCGGCGAT GCTGACCGGT GCAGAGGCAG ACTGGCGGGG ATTCACCCGT TCGGTGCTGT CCATGATGAC AGAAATCCTG CTTAAACAGG CCATGGTGGG CATTGTCGGG CGTATCGGCA GCGCCATTGG CGGTGCTTTC GGTGGTGGTG CGTCTGCCTC CACGGGGACG GCCATTCAGG CTGCGGCGGC GAACTTCCAT TTCGCGACCG GGGGATTTAC GGGGACGGGG GGTAAATATG AACCTGCGGG AATTGTTCAT CGCGGGGAGT TTGTCTTCAC GAAGGAGGCA ACCAGCCGGA TTGGCGTCGG CAACCTGTAC CGCCTGATGC GGGGCTATGC GGAAGGGGGT TATGTGGGCG GTGCCGGAAG TCCGGCGCAG ATGCGGCGGA CGGAAGGCAT TAATTTTAAT CAGAACAATC ACGTGGTGAT TCAGAACGAC GGCACCAACG GACAGGCGGG GCCGCAGCTG ATGAAGGCGG TGTATGACAT GGCCCGCAAG GGGGCGCAGG ATGAGCTCCG GCTGCAGTTG CGTGATGGCG GTATGTTATC GGGGAGCAGG CGATGA
|
Protein sequence | MAGNFADLTA VLTLDSARFS EEAARVKKEL GETSALADLM SGKVSQSFRK QADAAEQSLS RQALAAQKAG ISVGQYKAAM RTLPAQFTDI VTQLAGGQNP FLIMLQQGGQ ISDSFGGPLS LLTLLKEELL GIRDASESSE ESLSDTANAL AENARNAGEL GRFMSVARVA AGGGVAVLAA LAAAAWQAEQ ADRALLRSLT LTGGAAATTT AELWKMAGVI SDEAGGGIRQ AAENLARLAE SGKYTAGQLR IMGETSQRWL QTVGDDAGKV EKAFEGIAAD PVKALASLNQ QYNFLSVSQL RHIDELERTK GKQAAVTEAM SLFADVMNAR LEQLDKAATP VEKIWDDVKT WTSDAWAWIG DHTLGALSLI TDVVAGTVEQ VKLLLVQGDL ALAEFIQSAW ETTKNVPGVG ALFGELAEEN RVFIEKTKRD ELALRKSIAE RDARIRQGEM GYINRSRATG VSKGPGQQEA VSRLAEELTG KKHTSPKTRS AGEREEEQAR EALLALEAEL RTLEKHSGAN EKISRQRRDL WKAESQYAVL KEAATKRQLS EQEKSLLAHK DETLEYKRQL AELGDKVEYQ KRLNELAQQA VRFEEQQSAK QAAISAKARG LTDRQAQRES EAQRLRDVYG DNPAALAKAT SALKNTWSAE EQLRGSWMAG LKSGWGEWAE SATDSFSQVK SAATQTFDGI AQNMAAMLTG AEADWRGFTR SVLSMMTEIL LKQAMVGIVG RIGSAIGGAF GGGASASTGT AIQAAAANFH FATGGFTGTG GKYEPAGIVH RGEFVFTKEA TSRIGVGNLY RLMRGYAEGG YVGGAGSPAQ MRRTEGINFN QNNHVVIQND GTNGQAGPQL MKAVYDMARK GAQDELRLQL RDGGMLSGSR R
|
| |