Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH74115_2168 |
Symbol | |
ID | 6970387 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli O157:H7 str. EC4115 |
Kingdom | Bacteria |
Replicon accession | NC_011353 |
Strand | - |
Start bp | 2080258 |
End bp | 2081826 |
Gene Length | 1569 bp |
Protein Length | 522 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 643386063 |
Product | host specificity protein |
Protein accession | YP_002270552 |
Protein GI | 209395902 |
COG category | [S] Function unknown |
COG ID | [COG4733] Phage-related protein, tail component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.524267 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 52 |
Fosmid unclonability p-value | 0.854577 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGGAAAAG GTGGCGGTAA GGCACACACG CCTCGTGAGG CGAAGGATAA TCTCAAATCC ACGCAGATGA TGAGTGTGAT TGATGCGATT GGTGAGGGAC CGATAGAAGG TCCGGTGAAG GGACTGCAGA GTATTCTGGT GAACAAAACC CCACTGACGG ACACGGACGG CAATCCCGTG ATACACGGTG TGACCGCGGT CTGGCGTGCC GGGGAGCAGG AGCAGACACC ACCGGAAGGC TTTGAGTCCT CCGGAGCTGA AACCGGACTG GGCGTGGAAG TGACGAAGGC AAAACCGGTG ACGCGCACCA TTACGTCCGC GAACATTGAC CGCCTGCGGG TCACCTTCGG GGTGCAGTCA CTGTTGGAGA CCACCTCAAA GGGCGACCGT AATCACTCTT CTGTCCGACT GCTGATTCAG TTGCAGCGTA ACGGTAACTG GGTGACGGAA AAGGATGTCA CCATTAACGG CAAGACCACC TCGCAGTTTC TGGCGTCGGT GATTCTGGAT AATCTGCCTG AGCGGCCCTT TAACATCCGG ATGGTCCGGG AGACAGCGGA CAGCACCTCG GACCAGCTGC AGAATAAGAC GCTCTGGTCG TCATACACCG AAATCATCGA TGTGAAACAG TGCTACCCGA ACACGGCGAT TGTGGGGCTG CAGGTGGATG CGGAGCAGTT TGGCGGTCAG CAGATGACGG TGAACTACCA TATCCGCGGT CGCATCATCC AGGTACCGTC AAACTATGAC CCGGAAAAAC GCACGTACAG CGGCATCTGG GACGGCAGCC TGAAACCGGC ATACAGCAAC AACCCTGCCT GGTGCCTGTG GGACATGCTG ACCCACCCGC GCTACGGAAT GGGAAAACGC CTGGGGGCGG CGGATGTGGA CAAGTGGGCG CTGTATGCCA TTGCGCAGTA CTGCGACCAG ACGGTCCCGG ATGGTTTCGG GGGCACAGAG CCGCGGATGA CTTTCAATGC GTACCTGTCA CAACAGCGTA AGGCGTGGGA CGTTCTCAGT GATTTCTGCT CGGCGATGCG CTGTATGCCG GTATGGAACG GGCAGACGCT GACGTTTGTG CAGGACCGTC CGTCAGATGT GGTGTGGCCC TACACCAGCA GTGATGTGGT GGTGGATGAT AACGGCGTGG GGTTTCGCTA CAGCTTCAGC GCCCTGAAGG ACCGCCACAC GGCGGTGGAG GTGAATTACA CCGACCCGCA GAACGGCTGG CAGACCTCCA CGGAACTGGT GGAAGACCCG GAAGCCATAC TGCGCTACGG ACGCAACCTG CTGAAGATGG ATGCGTTCGG CTGCACCAGT CGCGGTCAGG CCCACCGTGC CGGGCTGTGG GTGATAAAGA CCGGACTGCT GGAAACGCAG ACGGTGGATT TCACGCTCGG GTCACAGGGG CTGCGTCACA CACCCGGTGA CATTATTGAA ATCTGTGATA ACGACTATGC CGGGACCATG ACCGGCGGAC GTGTCCTGTC CATCGATGCC GCCAGCCGCA CGCGCAGTAC TGCGACCAGA CGGTCCCGGA TGGTTTCGGG GGCACAGAGC CGCGGATGA
|
Protein sequence | MGKGGGKAHT PREAKDNLKS TQMMSVIDAI GEGPIEGPVK GLQSILVNKT PLTDTDGNPV IHGVTAVWRA GEQEQTPPEG FESSGAETGL GVEVTKAKPV TRTITSANID RLRVTFGVQS LLETTSKGDR NHSSVRLLIQ LQRNGNWVTE KDVTINGKTT SQFLASVILD NLPERPFNIR MVRETADSTS DQLQNKTLWS SYTEIIDVKQ CYPNTAIVGL QVDAEQFGGQ QMTVNYHIRG RIIQVPSNYD PEKRTYSGIW DGSLKPAYSN NPAWCLWDML THPRYGMGKR LGAADVDKWA LYAIAQYCDQ TVPDGFGGTE PRMTFNAYLS QQRKAWDVLS DFCSAMRCMP VWNGQTLTFV QDRPSDVVWP YTSSDVVVDD NGVGFRYSFS ALKDRHTAVE VNYTDPQNGW QTSTELVEDP EAILRYGRNL LKMDAFGCTS RGQAHRAGLW VIKTGLLETQ TVDFTLGSQG LRHTPGDIIE ICDNDYAGTM TGGRVLSIDA ASRTRSTATR RSRMVSGAQS RG
|
| |