Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SbBS512_E2382 |
Symbol | |
ID | 6269130 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Shigella boydii CDC 3083-94 |
Kingdom | Bacteria |
Replicon accession | NC_010658 |
Strand | + |
Start bp | 2172317 |
End bp | 2173318 |
Gene Length | 1002 bp |
Protein Length | 333 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 641726384 |
Product | alkanesulfonate transporter substrate-binding subunit |
Protein accession | YP_001880866 |
Protein GI | 187732766 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG0715] ABC-type nitrate/sulfonate/bicarbonate transport systems, periplasmic components |
TIGRFAM ID | [TIGR01728] ABC transporter, substrate-binding protein, aliphatic sulfonates family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 42 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTTCAGGT TCCTGACCTT CTGTCTCTGC GAGGTAATGC CCATGCGTAA CATCATTAAA CTGGCGCTGG TGGGATTGCT TAGCGTCTCT ACGATTGCGG TTGCTGCAGA ATCCTCGCCT GAAGCGTTAT GTATAGGCTA TCAGAAAGGC AGTATTGGTA TGGTACTGGC AAAAAGCCAC CAGTTACTGG AAAAACGCTA TCCGCAATCA AAAATCTCCT GGGTGGAGTT CCCCGCGGGT CCGCAAATGT TGGAAGCCTT AAACGTTGGC AGTATTGATC TCGGCAGTAC CGGGGATATT CCGCCAATCT TTGCCCAGGC TGCCGGGGCT GATTTGGTAT ATGTGAGCGT CGAGCCACCG AAGCCCAAAG CCGAAGTGAT TCTGGTGGCA GAAAACAGCC CGATCAAAAC CGTAGCCGAT CTTAAAGGTC ACAAAGTTGC CTTTCAGAAA GGTTCCAGTT CACACAACCT TTTACTGCGT GCACTGCGTC AGGCCGGACT TAAGTTTACC GATATCCAAC CCACTTACCT AACGCCCGCT GATGCCCGCG CCGCGTTCCA GCAAGGTAAC GTTGACGCCT GGGCTATCTG GGATCCCTAC TACTCTGCTG CATTATTACA GGGCGGCGTG CGGGTGCTGA AAGACGGCAC CGATCTCAAT CAAACTGGAT CGTTTTATCT GGCAGCTCGT CCCTATGCAG AAAAAAACGG CGCTTTTATT CAGGGCGTAC TGGCAACCTT TAGTGAGGCC GATGCGTTAA CCCGCAGCCA GCGCGAGCAA AGCATCGCTT TACTGGCAAA AACGATGGGC TTACCGGCAC CGGTGATTGC CTCTTACTTA GATCATCGCC CTCCTACCAC CATCAAACCG GTTAACGCCG AGGTTGCCGC CTTACAGCAG CAAACGGCAG ATCTGTTTTA TGAAAATCGT CTGGTGCCGA AAAAAGTCGA TATTCGCCAG CGCATCTGGC AGCCCACTCA ACTGGAAGGA AAACAATTAT GA
|
Protein sequence | MFRFLTFCLC EVMPMRNIIK LALVGLLSVS TIAVAAESSP EALCIGYQKG SIGMVLAKSH QLLEKRYPQS KISWVEFPAG PQMLEALNVG SIDLGSTGDI PPIFAQAAGA DLVYVSVEPP KPKAEVILVA ENSPIKTVAD LKGHKVAFQK GSSSHNLLLR ALRQAGLKFT DIQPTYLTPA DARAAFQQGN VDAWAIWDPY YSAALLQGGV RVLKDGTDLN QTGSFYLAAR PYAEKNGAFI QGVLATFSEA DALTRSQREQ SIALLAKTMG LPAPVIASYL DHRPPTTIKP VNAEVAALQQ QTADLFYENR LVPKKVDIRQ RIWQPTQLEG KQL
|
| |