Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH74115_2979 |
Symbol | wcaK |
ID | 6970131 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli O157:H7 str. EC4115 |
Kingdom | Bacteria |
Replicon accession | NC_011353 |
Strand | - |
Start bp | 2757339 |
End bp | 2758619 |
Gene Length | 1281 bp |
Protein Length | 426 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 643386819 |
Product | putative pyruvyl transferase |
Protein accession | YP_002271287 |
Protein GI | 209398906 |
COG category | [S] Function unknown |
COG ID | [COG2327] Uncharacterized conserved protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.158831 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 29 |
Fosmid unclonability p-value | 0.000214392 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGAAATTAC TTATTCTGGG CAACCACACT TGCGGCAATC GTGGCGACAG CGCCATCTTG CGCGGCTTAC TTGATGCCAT CAACATCCTC AATCCACATA CCGAAGTGGA CGTGATGAGC CGCTATCCGG TCAGTTCTTC CTGGCTGCTC AACCGCCCGG TAATGGGTGA TCCGCTGTTC CTTCAAATGA AACAACACAA CAGCGCGGCG GGCGTTGTCG GGCGCGTTAA AAAAGTCCTC CGTCGCCGCT ATCAGCACCA GGTATTGCTC TCACGCGTCA CCGACACTGG CAAGCTGCGC AATATCGCCA TCGCCCAGGG ATTCACCGAC TTCGTGCGCC TACTGTCAGG TTACGACGCC ATTATTCAGG TCGGCGGATC GTTTTTTGTC GATCTCTACG GCGTGCCGCA GTTTGAACAT GCACTTTGCA CATTTATGGC GAAAAAGCCG CTGTTTATGA TTGGTCACAG CGTCGGTCCC TTCCAGGATG AGCAATTTAA CCAACTGGCG AACTACGTTT TTGGTCACTG CGACGCGCTG ATCTTGCGCG AATCGGTCAG CCTCGATCTA ATGAAACGCA GCAATATCAC CACTGCAAAA GTTGAACATG GCGTCGATAC CGCGTGGCTG GTCGATCACC ACACAGAAGA CTTCACCGCC AGCTATGCCG TCCAACACTG GCTGGACGTT GCCGCACAAC AGAAAACGGT GGCAATTACC CTGCGCGAAC TGGCACCGTT CGACAAACGC CTCGGCACCA CTCAGCAAGT GTATGAAAAA GCCTTTGCCG GGGTGGTCAA TCGCATTCTC GACGAAGGCT ATCAGGTCAT TGCGCTCTCC ACCTGTACGG GTATCGACAG CTATAACAAA GACGACCGCA TGGTGGCGCT CAACCTGCGC CAGCACATCA GCGATCCTGC CCGTTATCAC GTAGTGATGG ATGAACTTAA CGATCTGGAA ATGGGCAAAA TTCTCGGTGC CTGTGAACTC ACCGTCGGTA CGCGCCTGCA CTCCGCCATT ATCTCAATGA ACTTTGCTAC TCCGGCGATT GCCATCAACT ATGAACATAA ATCCGCCGGG GTTATGCAGC AGCTGGGACT ACCGGAGATG GCAATTGATA TCCGTCATTT ATTAGACGGC AGCCTGCAAG CGATGGTTGC GGATACCTTA GGCCAGCTTC CGGCGCTGAA CGCACGACTT AGCGAAGCCG TCAGTCGTGA GCGTCAGACG GGAATGCAGA TGGTGCAGTC CGTACTGGAA CGCATCGGGG AGGTGAAATG A
|
Protein sequence | MKLLILGNHT CGNRGDSAIL RGLLDAINIL NPHTEVDVMS RYPVSSSWLL NRPVMGDPLF LQMKQHNSAA GVVGRVKKVL RRRYQHQVLL SRVTDTGKLR NIAIAQGFTD FVRLLSGYDA IIQVGGSFFV DLYGVPQFEH ALCTFMAKKP LFMIGHSVGP FQDEQFNQLA NYVFGHCDAL ILRESVSLDL MKRSNITTAK VEHGVDTAWL VDHHTEDFTA SYAVQHWLDV AAQQKTVAIT LRELAPFDKR LGTTQQVYEK AFAGVVNRIL DEGYQVIALS TCTGIDSYNK DDRMVALNLR QHISDPARYH VVMDELNDLE MGKILGACEL TVGTRLHSAI ISMNFATPAI AINYEHKSAG VMQQLGLPEM AIDIRHLLDG SLQAMVADTL GQLPALNARL SEAVSRERQT GMQMVQSVLE RIGEVK
|
| |