Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcolC_1598 |
Symbol | wcaM |
ID | 6065621 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli ATCC 8739 |
Kingdom | Bacteria |
Replicon accession | NC_010468 |
Strand | + |
Start bp | 1777533 |
End bp | 1778927 |
Gene Length | 1395 bp |
Protein Length | 464 aa |
Translation table | 11 |
GC content | 48% |
IMG OID | 641601014 |
Product | putative colanic acid biosynthesis protein |
Protein accession | YP_001724584 |
Protein GI | 170019630 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 21 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCATTTA AAAAACTCTC CCGACGCACC TTCCTGACGG CAAGCTCCGC GCTTGCCTTC CTTCATACCC CTTTCGCTCG TGCACTTCCC GCCCGACAAA GCGTTAACAT TAACGACTAC AACCCACACG ACTGGATCGC CTCATTTAAA CAAGCCTTCA GCGAAGGGCA AACGGTCGTC GTGCCTGCCG GATTGGTTTG TGACAATATC AACACCGGCA TCTTTATCCC TCCCGGTAAA ACGTTACACA TCCTTGGAAG CCTGCGCGGC AACGGCAGAG GGCGATTTGT CTTACAGGAC GGCAGCCAGG TGACAGGGGA GGAGGGCGGC AGTATGCATA ACATCACCCT GGATGTGCGC GGTTCTGACT GCTCCATCAA AGGGCTGGTG ATGAGCGGCT TTGGCCCGGT AACGCAGATT TATATCGGCG GCAAAAACAA ACGGGTCATG CGCAACCTGA CCATCGATAA CCTCACTGTC AGCCACGCTA ATTACGCCAT CTTACGCCAG GGATTTCATA ATCAGATTAT CGGTGCCAAC ATCACCAACT GTAAGTTCAG CGACTTACAG GGCGACGCCA TCGAATGGAA CGTGGCGATT AACGACAGTG ATATTTTGAT ATCTGACCAT GTCATCGAGC GCATCAACTG TACCAACGGC AAAATCAACT GGGGAATCGG CATAGGCCTT GCAGGAAGCA CTTACGATAA CAACTACCCG GAAGACCAGG CAGTGAAAAA CTTTGTCGTG GCGAATATCA CGGGATCGGA TTGTCGGCAG TTGATCCATG TTGAAAATGG CAAACATTTT GTTATTAGTA ATATCAAAGC CCGCAATATC ACGCCGGATT TCAGTAAGAA AGCGGGCATT GATAACGCCA CGGTCGCTAT TTACGGTTGT GACAATTTCG TGATTGATAA TATTGAAATG ATTAATAGCG CCGGGATGTT AATCGGCTAT GGGGTAATTA AAGGCAAATA TCTCTCGATA CCGCAAAATT TCCGAGTGAA TAATATTCAA CTGGATAACA CCCATCTTGC TTATAAATTG CGCGGCATCC AAATCTCCGC CGGGAATGCT GTCTCCTTTG TGGCGCTGAC TAACATTGAG ATGAAGCGTG CGTCGCTGGA GTTACACAAC AAACCGCAAC ATCTTTTTAT GCGTAATATC AAGGTGATGC AGGAATCCTC AGTTGGACCA GCATTGAGCA TGAACTTCGA CATGCGCAAA GACGTTCGCG GCGTCTTTAT GGCGAAAAAA GAAACACTCC TGTCTCTTGC AAATGTTCAT GCGGTGAATG AAAGAGGGCA AAGCTCCGTC GATATCGACA GGATAAATCA CCATATTGTT AATGTGGAAA AGATTAACTT TAGATTGCCG GAACGGAGGG AGTAG
|
Protein sequence | MPFKKLSRRT FLTASSALAF LHTPFARALP ARQSVNINDY NPHDWIASFK QAFSEGQTVV VPAGLVCDNI NTGIFIPPGK TLHILGSLRG NGRGRFVLQD GSQVTGEEGG SMHNITLDVR GSDCSIKGLV MSGFGPVTQI YIGGKNKRVM RNLTIDNLTV SHANYAILRQ GFHNQIIGAN ITNCKFSDLQ GDAIEWNVAI NDSDILISDH VIERINCTNG KINWGIGIGL AGSTYDNNYP EDQAVKNFVV ANITGSDCRQ LIHVENGKHF VISNIKARNI TPDFSKKAGI DNATVAIYGC DNFVIDNIEM INSAGMLIGY GVIKGKYLSI PQNFRVNNIQ LDNTHLAYKL RGIQISAGNA VSFVALTNIE MKRASLELHN KPQHLFMRNI KVMQESSVGP ALSMNFDMRK DVRGVFMAKK ETLLSLANVH AVNERGQSSV DIDRINHHIV NVEKINFRLP ERRE
|
| |