Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH74115_3026 |
Symbol | |
ID | 6968919 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli O157:H7 str. EC4115 |
Kingdom | Bacteria |
Replicon accession | NC_011353 |
Strand | - |
Start bp | 2809529 |
End bp | 2811976 |
Gene Length | 2448 bp |
Protein Length | 815 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 643386860 |
Product | phage tail tape measure protein, TP901 family |
Protein accession | YP_002271328 |
Protein GI | 209397347 |
COG category | [S] Function unknown |
COG ID | [COG5283] Phage-related tail protein |
TIGRFAM ID | [TIGR01760] phage tail tape measure protein, TP901 family, core region |
| ![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_cp.jpg)
![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_hh.jpg)
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 36 |
Fosmid unclonability p-value | 0.0149182 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGTAACA ATGTAAAATT ACAGGTATTG CTCAGGGCTG TTGACCAGGC ATCCCGCCCG TTTAAATCCA TCCGCACAGC GAGTAAGTCG CTGTCGGGGG ATATCCGGGA AACACAAAAA TCACTGCGCG AGCTGAACGG GCAGGCATCC CGTATTGAGG GATTCCGCAA GACCAGCGCA CAGCTCGCCG TGACTGGTCA TGCACTTGAA AAGGCACGGC AGGAGGCCGA AGCCCTTGCC ACACAGTTTA AAAACACCGA ACGTCCGACC CGTGCTCAGG CGAAAGTGCT GGAATCCGCA AAGCGTGCGG CGGAGGACTT ACAGGCGAAA TATAACCGCC TGACGGATTC CGTTAAACGC CAGCAGCGGG AACTGGCCGC TGTGGGAATT AATACCCGCA ATCTTGCACA TGATGAGCTG GGGCTGAAAA ACCGTATCAG TGAAACCACC GCACAGCTTA ACCGGCAGCG TGACGCGCTG GCGCGTGTCA GTGCGCAACA GGCAAAACTT AACGCAGTAA AACAGCGCTA TCAGGCAGGA AAGGAGCTGG CCGGAAATAT GGCCTCAGTG GGCGCTGCCG GTGTGGGGAT TGCGGCGGCG GGAACGATGG CCGGTGTTAA GCTGCTGATG CCCGGTTATG AATTTGCGCA GAAAAACTCA GAATTACAGG CCGTGCTCGG TGTGGCAAAA GACTCCGCCG AAATGGCCGC GCTACGCAAG CAGGCGCGCC AGCTCGGCGA CAATACCGCC GCCTCGGCGG ATGATGCGGC CGGTGCACAG ATTATCATCG CGAAAGCGGG TGGGGATGTT GATGCCATTC AGGCGGCAAC GCCGGTCACG CTGAATATGG CGCTGGCGAA CCGTCGCACG ATGGAAGAAA ACGCCGCCCT GCTGATGGGG ATGAAATCTG CCTTTCAGCT TTCAAACGAT AAGGTTGCTC ATATCGGGGA TGTTCTCTCC ATGACGATGA ACAAAACCGC CGCCGATTTT GACGGTATGA GCGATGCGCT GACCTATGCC GCACCTGTGG CAAAAAATGC CGGTGTCAGC ATTGAAGAAA CCGCCGCAAT GGTCGGGGCG CTGCATGATG CAAAAATCAC AGGCTCAATG GCGGGGACGG GAAGCCGTGC CGTGTTAAGT CGCCTGCAGG CACCGACGGG AAAAGCATGG GATGCACTCA AAGAGCTTGG AGTGAAAACC TCAGACAGCA AGGGGAATAC CCGACCAATA TTTACCATTC TGAAAGAAAT GCAGGCCAGT TTTGAGAAAA ACCGGCTCGG TACTGCCCAG CAGGCTGAAT ACATGAAAAC CATTTTCGGG GAGGAGGCCA GTTCAGCCGC CGCCGTGCTG ATGACTGCCG CCTCAACCGG AAAGCTGGAC AAACTGACCG CTGCGTTTAA AGCCTCAGAC GGGAAGACCG CAGAGCTGGT AAATATCATG CAGGACAACC TCGGCGGTGA CTTTAAGGAG TTTCAGTCCG CTTATGAGGC GGTGGGGACT GACCTGTTTG ACCAACAGGA AGGCGCACTG CGTAATCTCA CGCAGACGGC CACAAAGTAT GTGTTAAAAC TCGACGGCTG GATCCAGAAA AACAAATCAC TGGCGTCAAC CATCGGCCTT ATTGTCGGTG GCGCACTGGC GCTTATTGGC ATCATCGGTG CAATTGGTCT TGTAGCCTGG CCGGTTATCA CCGGCATTAA TGCCATCATC GCGGCAGCAG GCGCAATGGG GGCAATCTTC ACGACGGTTG GCAGTGCTGT TATGACGGCC ATCGGGGCGA TTAGCTGGCC GGTTGTGGCC GTGGTGGCTG CAATTGTCGC CGGGGCGTTG CTTATCCGTA AATACTGGGA GCCTGTCAGC GCATTCTTTG GCGGTGTGGT TGAAGGGCTG AAAGCTGCAT TTGCGCCGGT GGGGGAACTG TTCACGCCAC TTAAGCCGCT GTTTGACTGG CTGGGTGAAA AGTTACAGGC CGCGTGGCAG TGGTTTAAAA AACTGATTGC CCCGGTCAAA GCCACCCAGG ACACCCTGAA CCGTTGCCGT GACACGGGCG TCATGTTCGG GCAGGCACTG GCTGACGCGT TGATGCTGCC GCTTAATGCG TTCAACAAAC TGCGCAGTGG TATTGACTGG GTACTGGAAA AACTCGGTGT TATCAACAAA GAGTCAGACA CACTTGACCA GACCGCCGCC AGAACTCATG CCGCCACGTA TGGCACCGGT GGTTATATTC CGGCGACCAG CTCTTATGCA GGCTATCAGG CTTATCAGCC GGTCACGGCA CCGGCTGGCC GCTCTTATGT GGACCAGAGT AAAAACGAAT ATCACATCAG CCTGACGGGT GGTACTGCGC CGGGGACACA GCTTGACCGC CAGTTACAGG ATGCGCTCGA AAAATACGAG CGGGATAAAC GTGCGCGCGC CCGTGCCAGC ATGATGCATG ACGGTTAA
|
Protein sequence | MSNNVKLQVL LRAVDQASRP FKSIRTASKS LSGDIRETQK SLRELNGQAS RIEGFRKTSA QLAVTGHALE KARQEAEALA TQFKNTERPT RAQAKVLESA KRAAEDLQAK YNRLTDSVKR QQRELAAVGI NTRNLAHDEL GLKNRISETT AQLNRQRDAL ARVSAQQAKL NAVKQRYQAG KELAGNMASV GAAGVGIAAA GTMAGVKLLM PGYEFAQKNS ELQAVLGVAK DSAEMAALRK QARQLGDNTA ASADDAAGAQ IIIAKAGGDV DAIQAATPVT LNMALANRRT MEENAALLMG MKSAFQLSND KVAHIGDVLS MTMNKTAADF DGMSDALTYA APVAKNAGVS IEETAAMVGA LHDAKITGSM AGTGSRAVLS RLQAPTGKAW DALKELGVKT SDSKGNTRPI FTILKEMQAS FEKNRLGTAQ QAEYMKTIFG EEASSAAAVL MTAASTGKLD KLTAAFKASD GKTAELVNIM QDNLGGDFKE FQSAYEAVGT DLFDQQEGAL RNLTQTATKY VLKLDGWIQK NKSLASTIGL IVGGALALIG IIGAIGLVAW PVITGINAII AAAGAMGAIF TTVGSAVMTA IGAISWPVVA VVAAIVAGAL LIRKYWEPVS AFFGGVVEGL KAAFAPVGEL FTPLKPLFDW LGEKLQAAWQ WFKKLIAPVK ATQDTLNRCR DTGVMFGQAL ADALMLPLNA FNKLRSGIDW VLEKLGVINK ESDTLDQTAA RTHAATYGTG GYIPATSSYA GYQAYQPVTA PAGRSYVDQS KNEYHISLTG GTAPGTQLDR QLQDALEKYE RDKRARARAS MMHDG
|
| |