Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH74115_2381 |
Symbol | |
ID | 6970786 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli O157:H7 str. EC4115 |
Kingdom | Bacteria |
Replicon accession | NC_011353 |
Strand | + |
Start bp | 2253137 |
End bp | 2254741 |
Gene Length | 1605 bp |
Protein Length | 534 aa |
Translation table | 11 |
GC content | 47% |
IMG OID | 643386254 |
Product | hypothetical protein |
Protein accession | YP_002270736 |
Protein GI | 209400503 |
COG category | [S] Function unknown |
COG ID | [COG4529] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.000071971 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | 32 |
Fosmid unclonability p-value | 0.00180659 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGAAAAAAA TTGCAATTGT GGGTGCCGGG CCTACGGGGA TCTACACCTT ATTCTCGCTT CTACAGCAAC AAACTCTACT TTCTATTTCT ATCTTCGAGC AGGCTGACGA GGCCGGTGTC GGGATGCCAT ACAGTGATGA GGAAAACTCA AAAATGATGC TGGCAAATAT TGCCAGTATT GAAATACCGC CGATTTATTG TACGTATCTC GAATGGCTAC AAAAGCAAGA AGCCAGTCAT CTCCAGCGTT ATGGCGTTAA AAAAGAAACC TTGCACGATC GTCAGTTTTT ACCGCGAATT CTGCTGGGCG AATATTTCCG CGATCAATTT TTACGATTAG TAGACCAGGC ACGAAAGCAA AAATTTGCAG TGGCTGTTTA TGAATCATGC CAGGTTACCG ATCTGCAAAT TACAAATGCT GGCGTCATGC TCGCTACAAA TCAGGATTTA CCCAGCGAGA CGTTTGATTT AGCGGTGATC GCCACGGGTC ACGTCTGGCC TGATGAAGAA GAAGCAACCC GAACGTATTT TCCAAGCCCG TGGTCAGGCT TGATGGAAGC AAAGGTCGAT GCGTGTAACG TGGGTATTAT GGGAACATCC TTGAGCGGAC TGGATGCGGC AATGGCAGTG GCTATTCAGC ATGGTTCGTT CATTGAAGAT GATAAACAAC ACGTCGTTTT TCACCGCGAT AACGCAAGTG AAAAGCTAAA TATTACGTTA ATGTCGCGCA CGGGTATTTT ACCCGAAGCC GATTTCTATT GCCCTATTCC CTACGAGCCC TTACACATCG TCACTGATCA GGCATTAAAT GCTGAGATTC AAAAAGGCGA ATATGGCCTT TTGGATCGGG TATTTAGATT GATAGTAGAG GAAATCAAGT TTGCTGATCC AGACTGGAGT CAACGCATAG CCTTAGAGAG CCTGAATGTC GATTCCTTTG CTCAAGCCTG GTTTGCCGAG CGCAAACAAC GCGACCAATT TGACTGGGCA GAAAAAAATC TCCAGGAAGT CGAACGCAAT AAACGAGAAA AACATACTGT TCCCTGGCGT TATGTCATTC TGCGCCTGCA TGAAGCCGTA CAGGAAATTG TTCCACATCT GAATGAACAC GACCATAAAC GGTTCAGTAA AGGCCTTGCC CGGGTTTTCA TCGATAATTA TGCGGCAATC CCTTCAGAGT CTATTCGTCG CCTACTTGCC TTACGTGAAG CGGGAATCAT TCATATTCTC GCTCTCGGTG AAGACTACAA AATGGAAATT AACGAGTCGC GCACCGTCCT GAAAACGGAA GACAACAGCT ACTCGTTTGA CGTTTTTATT GATGCCCGCG GGCAGCGTCC GCTTAAAGTG AAAGATATTC CTTTCCCTGG ACTACGCGAA CAATTACAGA AAACAGGGGA TGAAATCCCT GATGTTGGTG AAGATTATAC GTTACAGCAA CCCGAAGATA TTCGTGGGCG CGTAGCGTTC GGCGCGTTGC CCTGGTTGAT GCACGACCAG CCTTTCGTTC AGGGACTTAC GGCATGTGCA GAAATTGGTG AGGCGATGGC TCGGGCGGTC GTAAAGCCTG CATCCCGTGC TCGTCGGCGT CTTTCGTTTG ATTAA
|
Protein sequence | MKKIAIVGAG PTGIYTLFSL LQQQTLLSIS IFEQADEAGV GMPYSDEENS KMMLANIASI EIPPIYCTYL EWLQKQEASH LQRYGVKKET LHDRQFLPRI LLGEYFRDQF LRLVDQARKQ KFAVAVYESC QVTDLQITNA GVMLATNQDL PSETFDLAVI ATGHVWPDEE EATRTYFPSP WSGLMEAKVD ACNVGIMGTS LSGLDAAMAV AIQHGSFIED DKQHVVFHRD NASEKLNITL MSRTGILPEA DFYCPIPYEP LHIVTDQALN AEIQKGEYGL LDRVFRLIVE EIKFADPDWS QRIALESLNV DSFAQAWFAE RKQRDQFDWA EKNLQEVERN KREKHTVPWR YVILRLHEAV QEIVPHLNEH DHKRFSKGLA RVFIDNYAAI PSESIRRLLA LREAGIIHIL ALGEDYKMEI NESRTVLKTE DNSYSFDVFI DARGQRPLKV KDIPFPGLRE QLQKTGDEIP DVGEDYTLQQ PEDIRGRVAF GALPWLMHDQ PFVQGLTACA EIGEAMARAV VKPASRARRR LSFD
|
| |