Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SbBS512_E1170 |
Symbol | |
ID | 6271715 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Shigella boydii CDC 3083-94 |
Kingdom | Bacteria |
Replicon accession | NC_010658 |
Strand | + |
Start bp | 1072846 |
End bp | 1073985 |
Gene Length | 1140 bp |
Protein Length | 379 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 641725303 |
Product | polysaccharide biosynthesis/export protein |
Protein accession | YP_001879817 |
Protein GI | 187731528 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG1596] Periplasmic protein involved in polysaccharide export |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 56 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGATGAAAT CCAAAATGAA ATTGATGCCA TTATTGGTGT CAGTAACCTT GATAAGCGGT TGCACAGTAC TTCCGGGCAG CAATATGTCG ACGATGGGCA AAGACGTCAT CAAACAGCAG GACGCTGATT TCGATCTCGA CAAAATGGTG AATGTTTATC CGCTGACCCC GCGCCTGATT GACCAATTAC GCCCACGCCC GAATGTAGCG CGCCCCAATA TGACGCTGGA AAGTGAGATC GCGAATTACC AGTATCGCGT CGGGCCGGGC GACGTTCTGA ATGTCACCGT CTGGGATCAC CCGGAACTCA CCACGCCAGC CGGTCAGTAC CGCAGCTCCA GCGACACCGG CAACTGGGTA CAGCCTGACG GCACTATGTT TTACCCGTAT ATCGGCAAGG TCCACGTAGT CGGGAAAACG CTCGCTGAAA TCCGCAGTGA TATTACCGGG CGCTTAGCGA CGTACATCGC TGACCCGCAG GTGGATGTTA ATATCGCCGC CTTCCGCTCG CAAAAGGCCT ATATCTCAGG TCAGGTGAAT AAATCCGGTC AACAGGCGAT CACCAACGTA CCACTGACCA TTCTCGACGC CATCAACGCC GCAGGTGGCC TGACCGACAC CGCTGACTGG CGCAACGTGG TGCTAACACA CAATGGTCGT GAAGAACGCA TTTCTTTGCA GGCGCTGATG CAAAACGGCG ACCTCAATCA GAACCGACTG CTTTACCCCG GCGATATTCT CTACGTGCCG CGTAATGATG ATCTGAAAGT GTTTGTGATG GGTGAAGTGA AGAAACAGAG CACCCTGAAA ATGGACTTTA GCGGCATGAC CCTGACTGAA GCCCTGGGCA ATGCCGAAGG TATCGACATG ACCACCTCCA ACGCCAGCGG CATCTTTGTC ATTCGTCCGC TGAAAGGCGA GGGCGGGCGT AACGGCAAGA TTGCCAATAT CTACCAGCTG GATATGTCCG ATGCGACGTC GCTGGTGATG GCGACAGAAT TCCGCCTGCA ACCTTATGAC GTGGTGTATG TCACCACCGC CCCGGTTTCC CGCTGGAACC GTCTGATCAA TCAGTTGCTG CCAACTATTA GCGGTGTCCG TTACATGACG GATACAGCCA GCGACATTCA TAACTGGTAA
|
Protein sequence | MMKSKMKLMP LLVSVTLISG CTVLPGSNMS TMGKDVIKQQ DADFDLDKMV NVYPLTPRLI DQLRPRPNVA RPNMTLESEI ANYQYRVGPG DVLNVTVWDH PELTTPAGQY RSSSDTGNWV QPDGTMFYPY IGKVHVVGKT LAEIRSDITG RLATYIADPQ VDVNIAAFRS QKAYISGQVN KSGQQAITNV PLTILDAINA AGGLTDTADW RNVVLTHNGR EERISLQALM QNGDLNQNRL LYPGDILYVP RNDDLKVFVM GEVKKQSTLK MDFSGMTLTE ALGNAEGIDM TTSNASGIFV IRPLKGEGGR NGKIANIYQL DMSDATSLVM ATEFRLQPYD VVYVTTAPVS RWNRLINQLL PTISGVRYMT DTASDIHNW
|
| |