Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH74115_3057 |
Symbol | |
ID | 6968913 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli O157:H7 str. EC4115 |
Kingdom | Bacteria |
Replicon accession | NC_011353 |
Strand | - |
Start bp | 2832149 |
End bp | 2833372 |
Gene Length | 1224 bp |
Protein Length | 407 aa |
Translation table | 11 |
GC content | 35% |
IMG OID | 643386889 |
Product | hypothetical protein |
Protein accession | YP_002271357 |
Protein GI | 209400285 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.000245353 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 0.000000000123111 |
Fosmid Hitchhiker | No |
Fosmid clonability | decreased coverage |
| |
Sequence |
Gene sequence | ATGAGCATGG TTGAATGGAT AAGCCACGAG AAAAAAGATA GCCTGATACT ATTTGTACAT GGATTAAATG GTGGTTTGGA AACATGGAAC TTTAATAAAG AAACTTCATT TCCAAAACTA TTAGCAGAAG ATGAAGACAT TGGCAATGTA TTTGATATTG CATGTTTCAA TTATTTCACC AAATTTACAC AAACGTATGC TAAAACGTCA GGATTATGGG CTCGTATTTT CTCTAAGAAA AATAAGTTAG AGAGAAATTT ACCAACTGAT GAAATAGCTG AGTTATTGTA TACTGAGATT AGGGTAACGC TAAGTGATTA CTCTCGAATT ATAATAATCG CCCATAGTAT GGGGGGGGTA ATATCAAAAA ATCTAATTCT AAAAAAAGTT GAGCATGAGG AACAATCTAA TATAATTGGA TTCATTTCGT TAGCAGTCCC TCACTTTGGT GCTAAATTGG CGAATATAAC TTCAATGGTA TCTTCAAATG CTCAGTTAGT AGATCTTGGA TTGTTAAGTG AAGCGACCGA TACGTTGAAT AGGCGCTGGA TAAATTGTAG CAAAAAACTT CCTATCACTA GATATGTTTA TGGTTCTCAT GACACTATAG TTGATAAAAA AAGCGCATTA CCGATGGATA GTGAACGAAA TAACTCAATT GCAGTTAATG AAGGGCATAG CTCAATTTGC AAGCCCGAAA ACAGCTCGTC TACAGTGTTT GTTGCGGTAA AGCAATTTAT TCAGCAGATA AATTTAGAAG CACCCAAGAA GATATGTGTT GAGCGTTTTT CTGATGAGAA ACAATATGAT AATGAGTATT TCGTCTTAAA AATGATTGTC GCAGATATAC ATCAGGATAT CGCTATGCAT GCAAAGGAAT ATTATTATAA TGCTGAGCTT GCGAGAAATA TTTTTACGAG TGATTACGAT AGGAAACTAC TTGGTCATTT GTATTCTAAA ATAAGGGAGA TTTACCAGGA AGAGTATGAG CAATATATCG CGAATTCAAT TTCCCCAGAT AAATTTATTG CTGCTGTTCA TAGACGAATA GCCCAAGAGG ATAAGTCGTC GTTAGATTCT CTTATAAAAA GTTTAGAAAC GATACATAAG AAAGGAATGT TGCATCAACT GGCTAACAAA AATGACAGGG ATATTGTCTG GTCATCTGAA ACCAGCGTAG AAACTTTGGA GCAATTACGG CGAGGTAATC ATGAAGAAAA ATAA
|
Protein sequence | MSMVEWISHE KKDSLILFVH GLNGGLETWN FNKETSFPKL LAEDEDIGNV FDIACFNYFT KFTQTYAKTS GLWARIFSKK NKLERNLPTD EIAELLYTEI RVTLSDYSRI IIIAHSMGGV ISKNLILKKV EHEEQSNIIG FISLAVPHFG AKLANITSMV SSNAQLVDLG LLSEATDTLN RRWINCSKKL PITRYVYGSH DTIVDKKSAL PMDSERNNSI AVNEGHSSIC KPENSSSTVF VAVKQFIQQI NLEAPKKICV ERFSDEKQYD NEYFVLKMIV ADIHQDIAMH AKEYYYNAEL ARNIFTSDYD RKLLGHLYSK IREIYQEEYE QYIANSISPD KFIAAVHRRI AQEDKSSLDS LIKSLETIHK KGMLHQLANK NDRDIVWSSE TSVETLEQLR RGNHEEK
|
| |