Gene SbBS512_E1188 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSbBS512_E1188 
SymbolwcaL 
ID6271131 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShigella boydii CDC 3083-94 
KingdomBacteria 
Replicon accessionNC_010658 
Strand
Start bp1093135 
End bp1094355 
Gene Length1221 bp 
Protein Length406 aa 
Translation table11 
GC content55% 
IMG OID641725320 
Productcolanic acid biosynthesis glycosyl transferase WcaL 
Protein accessionYP_001879834 
Protein GI187730685 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones37 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAGGTCG GCTTCTTTTT ACTGAAATTT CCGCTGTCGT CGGAAACCTT CGTCCTCAAT 
CAAATTACCG CGTTTATTGA TATGGGCTTT GAGGTGGAGA TTGTCGCGCT GCAAAAAGGC
GACACCCAGA ATACCCACGC GGCATGGACG AAATACAACC TTGCCGCCAG AACCCGCTGG
TTACAGGACG AACCACAAGG CAAAGTTGCG AAACTGCGCC ACCGCGCCAG CCAGACCTTA
CGCGGCATTC ATCGTAAAAA TACCTGGCAG GCGCTTAACC TCAAACGCTA TGGTGCCGAG
TCGCGGAACC TGATTTTGTC TGCCATTTGC GGCCAGGTCG CAACACCATT TTATGCCGAT
GTCTTTATCG CTCATTTTGG CCCTGCGGGG GTAACCGCAG CAAAACTACG CGAACTGGGT
GTGATTCGCG GCAAAATTGC CACCATCTTC CACGGTATTG ATATCTCCAG TCGGGAAGTG
CTCAACCACT ACACTCCCGA ATATCAACAA CTGTTTCGCC GTGGCGACCT GATGTTACCG
ATAAGCGATC TGTGGGCCGG AAGGCTGCAA AAAATGGGCT GCCCGAGGGA AAAAATCGCC
GTATCGCGCA TGGGCGTGGA CATGACGCGT TTTAGCCCGC GTCCGGTGAA AGCGCCCGCA
ACGCCGCTGG AAATCATCTC CGTCGCACGC TTAACCGAAA AAAAAGGCCT GCATGTGGCG
ATCGAAGCCT GCCGTCAGTT GAAAGAGCAG GGCATGACAT TTCGCTATCG CATCCTCGGC
ATTGGCCCGT GGGAACGACG CCTGCGTACC CTCATCGAAC AATATCAACT GGAAGATGTG
GTAGAGATGC CGGGCTTTAA ACCGAGCCAC GAAGTGAAAG CGATGCTCGA CGACGCGGAT
GTCTTCCTGT TGCCATCGGT AACGGGCGCG GATGGCGATA TGGAAGGCAT TCCGGTAGCG
CTGATGGAAG CGATGGCGGT CGGCATTCCG GTGGTTTCTA CTCTGCATAG CGGAATACCG
GAACTGGTGG AGGCTGACAA ATCCGGCTGG CTGGTGCCTG AGAACGATGC TCGCGCACTG
GCGCAACGCT TGGCGGCATT TAGCCAACTG GACACCGACG AACTGGCTCC GGTCGTCAAA
CGCGCGCGCG AAAAAGTCGA ACACGATTTT AACCAGCAGG TGATTAATCG AGAACTCGCC
AGCTTGTTGC AGGCTTTATA G
 
Protein sequence
MKVGFFLLKF PLSSETFVLN QITAFIDMGF EVEIVALQKG DTQNTHAAWT KYNLAARTRW 
LQDEPQGKVA KLRHRASQTL RGIHRKNTWQ ALNLKRYGAE SRNLILSAIC GQVATPFYAD
VFIAHFGPAG VTAAKLRELG VIRGKIATIF HGIDISSREV LNHYTPEYQQ LFRRGDLMLP
ISDLWAGRLQ KMGCPREKIA VSRMGVDMTR FSPRPVKAPA TPLEIISVAR LTEKKGLHVA
IEACRQLKEQ GMTFRYRILG IGPWERRLRT LIEQYQLEDV VEMPGFKPSH EVKAMLDDAD
VFLLPSVTGA DGDMEGIPVA LMEAMAVGIP VVSTLHSGIP ELVEADKSGW LVPENDARAL
AQRLAAFSQL DTDELAPVVK RAREKVEHDF NQQVINRELA SLLQAL