Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_1017 |
Symbol | wcaK |
ID | 6145929 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | + |
Start bp | 1036956 |
End bp | 1038236 |
Gene Length | 1281 bp |
Protein Length | 426 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 641615904 |
Product | putative pyruvyl transferase |
Protein accession | YP_001743096 |
Protein GI | 170683425 |
COG category | [S] Function unknown |
COG ID | [COG2327] Uncharacterized conserved protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 0.826012 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 53 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAATTAC TTATTCTGGG CAACCACACT TGCGGCAATC GTGGCGACAG CGCCATCTTG CGCGGCTTAC TTGATGCCAT CAACATCCTC AATCCACATG CCGAAGTGGA CGTGATGAGC CGCTATCCGG TCAGTTCTTC CTGGCTGCTC AACCGCCCGG TAATGGGCGA TCCGCTGTTC CTGCAAATGA AACAACACAA CAGCGCGGCG GGCGTTGTCG GGCGCGTTAA AAAAGTCCTC CGTCGCCGCT ATCAGCACCA GGTATTGCTC TCACGCGTCA CCGACACTGG CAAGCTGCGT AATATCGCCA TCGCCCAGGG ATTCACCGAC TTCGTGCGCC TGCTGTCAGG TTACGACGCC ATTATTCAGG TCGGCGGATC GTTTTTTGTC GATCTCTATG GCGTTCCGCA GTTTGAACAT GCGCTTTGCA CTTTCATGGC GAAAAAGCCG CTGTTTATGA TTGGTCACAG CGTCGGCCCG TTCCAGGATG AGCAATTTAA CCAACTGGCG AACTACGTTT TTGGTCACTG TGACGCGCTG ATCCTGCGCG AATCGGTCAG CCTTGATTTG ATGAAACGCA GCAATATCAC CACCGCAAAA GTGGAACATG GCGTCGATAC CGCGTGGCTG GTCGATCACC ACACAGAAGA CTTCACCGCC AGCTATGCCG TTCAACATTG GCTGGACGTT GCCGCACAAC AGAAAACGGT GGCAATTACC CTGCGCGAAC TGGCACCGTT CGACAAACGT CTCGGCACCA CTCAACAAGC GTATGAAAAA GCCTTTGCCG GGGTGGTCAA TCGCATTCTC GACGAAGGCT ATCAGGTTAT TGCGCTCTCC ACCTGTACGG GCATCGACAG CTATAACAAA GACGACCGCA TGGTGGCGCT CAACCTGCGC CAGCACATCA GCGATCCTGC CCGTTACCAC GTGGTGATGG ATGAACTCAA CGATCTGGAA ATGGGCAAAA TTCTCGGTGC CTGTGAACTC ACCGTCGGTA CGCGCCTGCA CTCAGCCATT ATCTCAATGA ACTTTGCTAC TCCGGCGATT GCCATCAACT ATGAACATAA ATCCGCCGGG ATTATGCAAC AACTGGGGCT ACCAGAGATG GCAATTGATA TCCGTCATTT ACTGGACGGC AGCCTGCAAG CGATGGTTGC GGATACCTTA GGCCAGCTTC CGGCGCTGAA CGCACGACTT AACGAAGCTG TCAGTCGTGA GCGTCATACA GGAATGCAGA TGGTGCAATC TGTGCTTGAA CGCATCGGGG AGGTGAAATG A
|
Protein sequence | MKLLILGNHT CGNRGDSAIL RGLLDAINIL NPHAEVDVMS RYPVSSSWLL NRPVMGDPLF LQMKQHNSAA GVVGRVKKVL RRRYQHQVLL SRVTDTGKLR NIAIAQGFTD FVRLLSGYDA IIQVGGSFFV DLYGVPQFEH ALCTFMAKKP LFMIGHSVGP FQDEQFNQLA NYVFGHCDAL ILRESVSLDL MKRSNITTAK VEHGVDTAWL VDHHTEDFTA SYAVQHWLDV AAQQKTVAIT LRELAPFDKR LGTTQQAYEK AFAGVVNRIL DEGYQVIALS TCTGIDSYNK DDRMVALNLR QHISDPARYH VVMDELNDLE MGKILGACEL TVGTRLHSAI ISMNFATPAI AINYEHKSAG IMQQLGLPEM AIDIRHLLDG SLQAMVADTL GQLPALNARL NEAVSRERHT GMQMVQSVLE RIGEVK
|
| |