Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH74115_5302 |
Symbol | |
ID | 6967942 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli O157:H7 str. EC4115 |
Kingdom | Bacteria |
Replicon accession | NC_011353 |
Strand | + |
Start bp | 4943266 |
End bp | 4944696 |
Gene Length | 1431 bp |
Protein Length | 476 aa |
Translation table | 11 |
GC content | 42% |
IMG OID | 643388965 |
Product | hypothetical protein |
Protein accession | YP_002273374 |
Protein GI | 209396984 |
COG category | [S] Function unknown |
COG ID | [COG5339] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.0147516 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 52 |
Fosmid unclonability p-value | 0.759699 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATACGTA AGTCAGCTAC AGGTGTTATT GTTGCGTTAG CCGTAATCTG GGGTGGTGGC ACATGGTACA CAGGTACGCA AATTCAGCCT GGTGTCGAAA AATTTATTAA AGATTTTAAC GATGCTAAAA AGAAAGGTGA ACATGCCTAC GATATGACGT TAAGTTATCA AAATTTTGAC AAAGGCTTTT TTAATTCCCG TTTTCAAATG CAAATGACAT TCGATAACGG TGCACCCGAT CTCAATATCA AGCCAGGCCA GAAAGTTGTA TTTGATGTGG ATGTTGAGCA CGGTCCGTTG CCCATCACAA TGTTAATGCA TGGTAATGTC ATCCCAGCAC TGGCAGCGGC AAAAGTGAAC TTAGTGAATA ATGAACTGAC ACAACCGCTA TTTATCGCCG CGAAAAATAA ATCGCCCGTG GAAGCGACAT TGCGATTCGC GTTTGGTGGC TCATTCTCTA CGACATTAGA TGTTGCCCCT GCAGAGTATG GAAAGTTTTC TTTTGGTGAG GGCCAGTTTA CTTTTAATGG TGATAGTAGT TCATTGTCTA ACCTGGATAT TGAAGGCAAA GTCGAAGATA TTGTTCTGCA ATTATCACCA ATGAACAAAG TAACGGCAAA AAGTTTTACC ATTGATTCTC TGGCGCGATT AGAAGAAAAG AAATTTCCGG TTGGTGAAAG CGAGTCGAAA TTTAATCAGA TTAACATTAT CAATCACGGG GAAGACGTTG CCCAAATCGA TGCTTTCGTT GCAAAAACCA GGCTGGATCG CGTTAAAGAC AAAGATTATA TCAATGTCAA TCTGACCTAC GAACTTGATA AGTTAACAAA AGGGAATCAG CAACTCGGTA GTGGTGAGTG GTCATTGATT GCTGAATCTA TTGATCCCTC AGCGGTGCGC CAATTTATCA TCCAGTATAA CATTGCGATG CAGAAGCAGC TTGCTGCACA CCCTGAATTA GCAAACGATG AAGTTGCTCT GCAAGAAGTG AATGCTGCAT TGTTCAAAGA GTATTTACCG TTATTACAAC AAAGTGAGCC GACCATTAAA CAACCGGTAA GATGGAAGAA CGCACTCGGC GAACTAAATG CCAATCTGGA TATCAGTATT GCCGACCCAG CCAAATCTTC ATCATCCACA AACAAAGATA TCAAATCGCT CAATTTTGAT GTGAAGTTAC CGCTTAATGT CGTCACAGAA ACCGCAAAAC AGCTTAATTT ATCTGAAGGA ATGGATGCGG AAAAAGCGCA AAAGCAGGCT GATAAACAAA TCAGCGGGAT GATGACATTA GGTCAGATGT TTCAGTTAAT CACGATTGAC AACAATACCG CCTCGCTGCA GCTGCGTTAT ACACCGGGTA AAGTTGTTTT TAACGGACAG GAGATGAGCG AAGAAGAATT TATGTCTCGT GCCGGACGTT TTGTTCATTA A
|
Protein sequence | MIRKSATGVI VALAVIWGGG TWYTGTQIQP GVEKFIKDFN DAKKKGEHAY DMTLSYQNFD KGFFNSRFQM QMTFDNGAPD LNIKPGQKVV FDVDVEHGPL PITMLMHGNV IPALAAAKVN LVNNELTQPL FIAAKNKSPV EATLRFAFGG SFSTTLDVAP AEYGKFSFGE GQFTFNGDSS SLSNLDIEGK VEDIVLQLSP MNKVTAKSFT IDSLARLEEK KFPVGESESK FNQINIINHG EDVAQIDAFV AKTRLDRVKD KDYINVNLTY ELDKLTKGNQ QLGSGEWSLI AESIDPSAVR QFIIQYNIAM QKQLAAHPEL ANDEVALQEV NAALFKEYLP LLQQSEPTIK QPVRWKNALG ELNANLDISI ADPAKSSSST NKDIKSLNFD VKLPLNVVTE TAKQLNLSEG MDAEKAQKQA DKQISGMMTL GQMFQLITID NNTASLQLRY TPGKVVFNGQ EMSEEEFMSR AGRFVH
|
| |