Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SbBS512_E1189 |
Symbol | wcaM |
ID | 6270605 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Shigella boydii CDC 3083-94 |
Kingdom | Bacteria |
Replicon accession | NC_010658 |
Strand | + |
Start bp | 1094366 |
End bp | 1095760 |
Gene Length | 1395 bp |
Protein Length | 464 aa |
Translation table | 11 |
GC content | 47% |
IMG OID | 641725321 |
Product | putative colanic acid biosynthesis protein |
Protein accession | YP_001879835 |
Protein GI | 187733294 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 32 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCCATTTA AAAAACTCTC CCGACGTACC TTCCTGACGG CAAGCTCGGC GCTTGCCTTC CTCCATACCC CTTTCGCCCG CGCACTTCCC GCCCGACAAA GCGTTAACAT TAACGACTAC AACCTACACG ACTGGATCGC CTCATTTAAA CAAGCCTTCG GCGAAGGGCA AACGGTCGTC GTGCCTGCCG GATTCGTTTG TGACAATATC AACGCCGGCA TCTTCATTCC TCCTGGCAAA ACGTTACACA TCCTCGGAAG CCTGCGAGGC AACGGCAGAG GGCGATTTGT CTTACAGGAC GGCAGCCAGG TGACAGGGGA GGAGGGCGGC AGTATGCATA ACATCACCCT GGATGTGCGT GGCTCTGACT GCACCATCAA AGGGCTGGCG ATGAGCGGCT TTGGCCCGGT AACGCAGATT TATATCGGCG GCAAAAACAA ACGGGTCATG CGCAACCTGA CCATCGATAA CCTCACTGTC AGCCACGCTA ATTACGCCAT CTTACGCCAG GGATTTCATA ACCAGATTAT CGGTGCCAAC ATCACCAATT GTAAGTTCAG CGACTTACAG GGCGATGCCA TTGAATGGAA CGTGGCAATT AACGACAGTG ATATTTTGAT CTCCGACCAC ATCATCGAGC GCATCAACTG TACTAACGGA AAAATCAACT GGGGCATTGG CATAGGTCTT GCGGGAAGCA CTTATGATAA TAATTACCCG GAAGACCAGT CAGTGAAAAA CTTTGTCGTG GCGAATATCA CGGGATCGGA TTGTCGGCAG TTGATCCATG TTGAAAATGG TAAACATTTT GTTATTCGTA ATATCAAAGC CCGCAATATC ACGCCGGATT TCAGTAAGAA AGCAGGCATT GATAACGCGA CAGTTGCTAT TTACGGTTGT GACAATTTCG TGATTGATAA TATTGAAATG ATTAATAGTG CCGGGATGTT AATCGGCTAT GGGGTAATTA AAGGCAAATA TCTCTCGATA CCGCAAAATT TCCAAGTGAA TAATATTCAA CTGGATAACA CCCATCTTGC TTATAAATTG CGCGGCATCC AAATCTCCGC CGGGAATGCT GTCTCCTTTG TGGCGCTGAC TAACATTGAG ATGAAGCGTG CGTCGCTGGA GTTACACAAC AAACCGCAAC ATCTTTTTAT GCGTAATATC AAGGTGATGC AGGAATCCTC AGTTGGACCA GCATTGAGCA TGAACTTCGA CATGCGCAAA GACGTTCGCG GCGTCTTTAT GGCGAAAAAA GAAACACTGC TGTCTCTTGC AAATGTTCAT GCGGTGAATG AAAAAGGGCA AAGCTCCGTC GATATCGACA GAGTTAATCA CCATATTGTT AATGTGGAAA AGATTAACTT TAGATTGCCG GAACGGAGAG AGTAG
|
Protein sequence | MPFKKLSRRT FLTASSALAF LHTPFARALP ARQSVNINDY NLHDWIASFK QAFGEGQTVV VPAGFVCDNI NAGIFIPPGK TLHILGSLRG NGRGRFVLQD GSQVTGEEGG SMHNITLDVR GSDCTIKGLA MSGFGPVTQI YIGGKNKRVM RNLTIDNLTV SHANYAILRQ GFHNQIIGAN ITNCKFSDLQ GDAIEWNVAI NDSDILISDH IIERINCTNG KINWGIGIGL AGSTYDNNYP EDQSVKNFVV ANITGSDCRQ LIHVENGKHF VIRNIKARNI TPDFSKKAGI DNATVAIYGC DNFVIDNIEM INSAGMLIGY GVIKGKYLSI PQNFQVNNIQ LDNTHLAYKL RGIQISAGNA VSFVALTNIE MKRASLELHN KPQHLFMRNI KVMQESSVGP ALSMNFDMRK DVRGVFMAKK ETLLSLANVH AVNEKGQSSV DIDRVNHHIV NVEKINFRLP ERRE
|
| |