Gene SbBS512_E1189 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSbBS512_E1189 
SymbolwcaM 
ID6270605 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShigella boydii CDC 3083-94 
KingdomBacteria 
Replicon accessionNC_010658 
Strand
Start bp1094366 
End bp1095760 
Gene Length1395 bp 
Protein Length464 aa 
Translation table11 
GC content47% 
IMG OID641725321 
Productputative colanic acid biosynthesis protein 
Protein accessionYP_001879835 
Protein GI187733294 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones32 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCATTTA AAAAACTCTC CCGACGTACC TTCCTGACGG CAAGCTCGGC GCTTGCCTTC 
CTCCATACCC CTTTCGCCCG CGCACTTCCC GCCCGACAAA GCGTTAACAT TAACGACTAC
AACCTACACG ACTGGATCGC CTCATTTAAA CAAGCCTTCG GCGAAGGGCA AACGGTCGTC
GTGCCTGCCG GATTCGTTTG TGACAATATC AACGCCGGCA TCTTCATTCC TCCTGGCAAA
ACGTTACACA TCCTCGGAAG CCTGCGAGGC AACGGCAGAG GGCGATTTGT CTTACAGGAC
GGCAGCCAGG TGACAGGGGA GGAGGGCGGC AGTATGCATA ACATCACCCT GGATGTGCGT
GGCTCTGACT GCACCATCAA AGGGCTGGCG ATGAGCGGCT TTGGCCCGGT AACGCAGATT
TATATCGGCG GCAAAAACAA ACGGGTCATG CGCAACCTGA CCATCGATAA CCTCACTGTC
AGCCACGCTA ATTACGCCAT CTTACGCCAG GGATTTCATA ACCAGATTAT CGGTGCCAAC
ATCACCAATT GTAAGTTCAG CGACTTACAG GGCGATGCCA TTGAATGGAA CGTGGCAATT
AACGACAGTG ATATTTTGAT CTCCGACCAC ATCATCGAGC GCATCAACTG TACTAACGGA
AAAATCAACT GGGGCATTGG CATAGGTCTT GCGGGAAGCA CTTATGATAA TAATTACCCG
GAAGACCAGT CAGTGAAAAA CTTTGTCGTG GCGAATATCA CGGGATCGGA TTGTCGGCAG
TTGATCCATG TTGAAAATGG TAAACATTTT GTTATTCGTA ATATCAAAGC CCGCAATATC
ACGCCGGATT TCAGTAAGAA AGCAGGCATT GATAACGCGA CAGTTGCTAT TTACGGTTGT
GACAATTTCG TGATTGATAA TATTGAAATG ATTAATAGTG CCGGGATGTT AATCGGCTAT
GGGGTAATTA AAGGCAAATA TCTCTCGATA CCGCAAAATT TCCAAGTGAA TAATATTCAA
CTGGATAACA CCCATCTTGC TTATAAATTG CGCGGCATCC AAATCTCCGC CGGGAATGCT
GTCTCCTTTG TGGCGCTGAC TAACATTGAG ATGAAGCGTG CGTCGCTGGA GTTACACAAC
AAACCGCAAC ATCTTTTTAT GCGTAATATC AAGGTGATGC AGGAATCCTC AGTTGGACCA
GCATTGAGCA TGAACTTCGA CATGCGCAAA GACGTTCGCG GCGTCTTTAT GGCGAAAAAA
GAAACACTGC TGTCTCTTGC AAATGTTCAT GCGGTGAATG AAAAAGGGCA AAGCTCCGTC
GATATCGACA GAGTTAATCA CCATATTGTT AATGTGGAAA AGATTAACTT TAGATTGCCG
GAACGGAGAG AGTAG
 
Protein sequence
MPFKKLSRRT FLTASSALAF LHTPFARALP ARQSVNINDY NLHDWIASFK QAFGEGQTVV 
VPAGFVCDNI NAGIFIPPGK TLHILGSLRG NGRGRFVLQD GSQVTGEEGG SMHNITLDVR
GSDCTIKGLA MSGFGPVTQI YIGGKNKRVM RNLTIDNLTV SHANYAILRQ GFHNQIIGAN
ITNCKFSDLQ GDAIEWNVAI NDSDILISDH IIERINCTNG KINWGIGIGL AGSTYDNNYP
EDQSVKNFVV ANITGSDCRQ LIHVENGKHF VIRNIKARNI TPDFSKKAGI DNATVAIYGC
DNFVIDNIEM INSAGMLIGY GVIKGKYLSI PQNFQVNNIQ LDNTHLAYKL RGIQISAGNA
VSFVALTNIE MKRASLELHN KPQHLFMRNI KVMQESSVGP ALSMNFDMRK DVRGVFMAKK
ETLLSLANVH AVNEKGQSSV DIDRVNHHIV NVEKINFRLP ERRE