Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SbBS512_E4049 |
Symbol | |
ID | 6269412 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Shigella boydii CDC 3083-94 |
Kingdom | Bacteria |
Replicon accession | NC_010658 |
Strand | - |
Start bp | 3781816 |
End bp | 3783069 |
Gene Length | 1254 bp |
Protein Length | 417 aa |
Translation table | 11 |
GC content | 33% |
IMG OID | 641727889 |
Product | O-antigen polymerase |
Protein accession | YP_001882321 |
Protein GI | 187734123 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 0.0576542 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCGTTTT GTTGGAATGA AATTAATTCT GGTATAAAGT CTTTAATTCT CATATTATGT ATTTTTTCTT TAATGACTTT GTCTTTATGG GATGATGTTG CAACAAAGTT TCTTCATGCA GCTGGAATTA TATCTGCATT GTATTTTCTT GCGACACCAA AAAAAACAAT AACTAATAAT CCTACTTTGT TAATTTTCAT CTCATTATGT CTTTTGGGTA TCGTAAATAT CATCTGGTAT TCACATTATA AAGTTTCAGG CTCTGTTTAT ACCAATGCAT ATCGTGGCCC AATGGAAACA GGAAAAATTG CCTTGTGTAG CGCTTTTATT TTCTTAGTTC TTTTTGCTAA AAATGAGATG AGAACAAAAA TAAAATTTGG GAAACTAATT CTGTTCGCAT CCCTGGCAAC GCAGTTACTT TTTTTTGCGC ATGCCATGTG GCAACATTTC TATTTAAACG TCGACCGTGT TGCATTATCA GCTTCCCACG CTACAACAGC AGGCTACATC ATCCTTTTTC CTTCTTTACT GGCATCAATT CTCATTTTAA AATCCGACTT TAGACATAAA ACAACATTAT ATACAATTAA CTTCATGCTT AGCTTATGTG CTGTCATAGT AACTGAGACG CGTGCAGCCA TATTAGTGTT TCCATTCTTT GCGTTAATAT TAATCGTAAT GGATAGTTAT ATTAATAAGC GAATTAATTA TAAGTTATAT TGTTTTATTG CGATTGCATT ATTAGCAGGT GTATTTTCTT TTAAAGATAC ATTGCTTATG AGAATGAATG ACTTAAATAA CGATTTAGTT AATTATTCGC ATGATAACAC CAGAACTTCA GTCGGTGCCC GTCTGGCAAT GTATGAAGTT GGCTTAAAAA CATATTCTCC AATAGGACAA TCACTGGAAA AACGTGCAGA AAAAATACAT GAGCTAGAAG AAAAAGAGCC TAGATTGAGT GGCGCTTTAC CCTTTGTAGA TTCTCATTTG CATAACGATC TCATAGATAC GTTATCAACG CGTGGTATTC CTGGAGTTGT ATTAACAATT TTAGCATTTT CAGCAATACT CATATATGCC TTAAGAACTG CTAAAGAACC TTATATTTTA ATCTTGCTTT TTTCACTACT GGTAGTAGGC CTAAGTGATG TAATACTCTT TTCTAAACCG GTTCCGACTG CTGTGTTTGT CACCATAATA TTGCTTTGTG CTTATTTTAA AGCACAATCA GACCAATATT TATTAGATAA GTAA
|
Protein sequence | MSFCWNEINS GIKSLILILC IFSLMTLSLW DDVATKFLHA AGIISALYFL ATPKKTITNN PTLLIFISLC LLGIVNIIWY SHYKVSGSVY TNAYRGPMET GKIALCSAFI FLVLFAKNEM RTKIKFGKLI LFASLATQLL FFAHAMWQHF YLNVDRVALS ASHATTAGYI ILFPSLLASI LILKSDFRHK TTLYTINFML SLCAVIVTET RAAILVFPFF ALILIVMDSY INKRINYKLY CFIAIALLAG VFSFKDTLLM RMNDLNNDLV NYSHDNTRTS VGARLAMYEV GLKTYSPIGQ SLEKRAEKIH ELEEKEPRLS GALPFVDSHL HNDLIDTLST RGIPGVVLTI LAFSAILIYA LRTAKEPYIL ILLFSLLVVG LSDVILFSKP VPTAVFVTII LLCAYFKAQS DQYLLDK
|
| |