Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH74115_2977 |
Symbol | wcaM |
ID | 6969348 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli O157:H7 str. EC4115 |
Kingdom | Bacteria |
Replicon accession | NC_011353 |
Strand | - |
Start bp | 2754717 |
End bp | 2756111 |
Gene Length | 1395 bp |
Protein Length | 464 aa |
Translation table | 11 |
GC content | 47% |
IMG OID | 643386817 |
Product | putative colanic acid biosynthesis protein |
Protein accession | YP_002271285 |
Protein GI | 209397395 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.000190544 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | 26 |
Fosmid unclonability p-value | 0.0000308601 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGCCATTTA AAAAACTCTC CCGACGCACC TTCCTGACGG CAAGCTCGGC GCTTGCCTTC CTTCATACCC CTTTCGCTCG CGCATTTCCC GCCCGACAAA GCGTTAATAT CAACGACTAC AACCCTCACG ACTGGATCGC CTCATTTAAA CAAGCCTTCA GCGAAGGGCA AACGGTTGTC GTGCCTGCTG GATTCGTTTG TGACAATATC AACACCGGCA TCTTCATTCC TCCTGGCAAA ACGTTACACA TCCTTGGAAG CCTGCGCGGC AACGGCAGAG GGCGATTTGT CTTACAGGAC GGCAGCCAGG TGACAGGGGA GGAGGGCGGC AGTATGCATA ACATCACCCT GGATGTGCGT GGTTCTGACT GCACCATCAA AGGGCTGGCG ATGAGCGGCT TTGGTCCGGT AACGCAGATT TATATCGGCG GCAAAAACAA ACGGGTCATG CGCAACCTGA CCATCGATAA CCTGACCATT AGCCACGCTA ATTACGCCAT CTTACGCCAG GGATTTCATA ACCAGATTAT CGGTGCCAAC ATCACCAATT GTAAGTTCAG CGACTTACAG GGCGATGCCA TTGAATGGAA CGTGGCGATT AACGACCGCG ATATTTTGAT CTCCGACCAT GTCATCGAGC GCATCAACTG TACTAACGGC AAAATCAACT GGGGCATTGG CATAGGCCTT GCGGGAAGCA CTTACGATAA TAATTACCCG GAAGATCAGG CAGTGAAAAA CTTTGTCGTG GCGAATATCA CGGGATCGGA TTGTCGGCAG TTGATACATG TTGAAAATGG TAAACATTTT GTTATTCGTA ATATCAAAGC CCGCAATATC ACGCAGGATT TCAGTAAGAA AGCAGGTATT GATAACGCCA CAGTCGCTAT TTACGGTTGT GACAATTTCG TGATTGATAA TATTGAAATG ATTAATAGCG CCGGGATGTT AATCGGCTAT GGGGTAATTA AAGGCAATTA TCTCTCGATT CCGCAAAATT TCCGAGTGAA TAATATTCAA CTGGATAACA CCCATCTTGC TTATAAATTG CGCGGCCTCC AAATCTCTGC CGGGAATGCC GTCTCCTTTG TGGCACTAAC TAACATTGAG ATGAAGCGTG CGTCGCTGGA GTTACACAAC AAACCGCAAC ATCTTTTTCT GCGTAATATC AAAGTGATGC AGGAATCCTC TGTTGGACCC GCATTGATTA TGAACTTCGA CATGCGCAAA GACGTTCGAG GCGTCTTTAT GGCGAAAGAA GAAACACTGC TGTCTCTTGC AAATGTTCAT GCGGTGAATG AGAAAGGACA AAGCTCCGTC GATATCGACA GGATTAATCA CCATATTGTT AATGTGGAAA AGATTAACTT TAGATTGCCG GAACGGAGAG AGTAG
|
Protein sequence | MPFKKLSRRT FLTASSALAF LHTPFARAFP ARQSVNINDY NPHDWIASFK QAFSEGQTVV VPAGFVCDNI NTGIFIPPGK TLHILGSLRG NGRGRFVLQD GSQVTGEEGG SMHNITLDVR GSDCTIKGLA MSGFGPVTQI YIGGKNKRVM RNLTIDNLTI SHANYAILRQ GFHNQIIGAN ITNCKFSDLQ GDAIEWNVAI NDRDILISDH VIERINCTNG KINWGIGIGL AGSTYDNNYP EDQAVKNFVV ANITGSDCRQ LIHVENGKHF VIRNIKARNI TQDFSKKAGI DNATVAIYGC DNFVIDNIEM INSAGMLIGY GVIKGNYLSI PQNFRVNNIQ LDNTHLAYKL RGLQISAGNA VSFVALTNIE MKRASLELHN KPQHLFLRNI KVMQESSVGP ALIMNFDMRK DVRGVFMAKE ETLLSLANVH AVNEKGQSSV DIDRINHHIV NVEKINFRLP ERRE
|
| |