Gene SbBS512_E4049 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSbBS512_E4049 
Symbol 
ID6269412 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShigella boydii CDC 3083-94 
KingdomBacteria 
Replicon accessionNC_010658 
Strand
Start bp3781816 
End bp3783069 
Gene Length1254 bp 
Protein Length417 aa 
Translation table11 
GC content33% 
IMG OID641727889 
ProductO-antigen polymerase 
Protein accessionYP_001882321 
Protein GI187734123 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value0.0576542 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCGTTTT GTTGGAATGA AATTAATTCT GGTATAAAGT CTTTAATTCT CATATTATGT 
ATTTTTTCTT TAATGACTTT GTCTTTATGG GATGATGTTG CAACAAAGTT TCTTCATGCA
GCTGGAATTA TATCTGCATT GTATTTTCTT GCGACACCAA AAAAAACAAT AACTAATAAT
CCTACTTTGT TAATTTTCAT CTCATTATGT CTTTTGGGTA TCGTAAATAT CATCTGGTAT
TCACATTATA AAGTTTCAGG CTCTGTTTAT ACCAATGCAT ATCGTGGCCC AATGGAAACA
GGAAAAATTG CCTTGTGTAG CGCTTTTATT TTCTTAGTTC TTTTTGCTAA AAATGAGATG
AGAACAAAAA TAAAATTTGG GAAACTAATT CTGTTCGCAT CCCTGGCAAC GCAGTTACTT
TTTTTTGCGC ATGCCATGTG GCAACATTTC TATTTAAACG TCGACCGTGT TGCATTATCA
GCTTCCCACG CTACAACAGC AGGCTACATC ATCCTTTTTC CTTCTTTACT GGCATCAATT
CTCATTTTAA AATCCGACTT TAGACATAAA ACAACATTAT ATACAATTAA CTTCATGCTT
AGCTTATGTG CTGTCATAGT AACTGAGACG CGTGCAGCCA TATTAGTGTT TCCATTCTTT
GCGTTAATAT TAATCGTAAT GGATAGTTAT ATTAATAAGC GAATTAATTA TAAGTTATAT
TGTTTTATTG CGATTGCATT ATTAGCAGGT GTATTTTCTT TTAAAGATAC ATTGCTTATG
AGAATGAATG ACTTAAATAA CGATTTAGTT AATTATTCGC ATGATAACAC CAGAACTTCA
GTCGGTGCCC GTCTGGCAAT GTATGAAGTT GGCTTAAAAA CATATTCTCC AATAGGACAA
TCACTGGAAA AACGTGCAGA AAAAATACAT GAGCTAGAAG AAAAAGAGCC TAGATTGAGT
GGCGCTTTAC CCTTTGTAGA TTCTCATTTG CATAACGATC TCATAGATAC GTTATCAACG
CGTGGTATTC CTGGAGTTGT ATTAACAATT TTAGCATTTT CAGCAATACT CATATATGCC
TTAAGAACTG CTAAAGAACC TTATATTTTA ATCTTGCTTT TTTCACTACT GGTAGTAGGC
CTAAGTGATG TAATACTCTT TTCTAAACCG GTTCCGACTG CTGTGTTTGT CACCATAATA
TTGCTTTGTG CTTATTTTAA AGCACAATCA GACCAATATT TATTAGATAA GTAA
 
Protein sequence
MSFCWNEINS GIKSLILILC IFSLMTLSLW DDVATKFLHA AGIISALYFL ATPKKTITNN 
PTLLIFISLC LLGIVNIIWY SHYKVSGSVY TNAYRGPMET GKIALCSAFI FLVLFAKNEM
RTKIKFGKLI LFASLATQLL FFAHAMWQHF YLNVDRVALS ASHATTAGYI ILFPSLLASI
LILKSDFRHK TTLYTINFML SLCAVIVTET RAAILVFPFF ALILIVMDSY INKRINYKLY
CFIAIALLAG VFSFKDTLLM RMNDLNNDLV NYSHDNTRTS VGARLAMYEV GLKTYSPIGQ
SLEKRAEKIH ELEEKEPRLS GALPFVDSHL HNDLIDTLST RGIPGVVLTI LAFSAILIYA
LRTAKEPYIL ILLFSLLVVG LSDVILFSKP VPTAVFVTII LLCAYFKAQS DQYLLDK