Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SbBS512_E0816 |
Symbol | mglB |
ID | 6272884 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Shigella boydii CDC 3083-94 |
Kingdom | Bacteria |
Replicon accession | NC_010658 |
Strand | + |
Start bp | 767747 |
End bp | 768745 |
Gene Length | 999 bp |
Protein Length | 332 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 641724990 |
Product | galactose ABC transporter, periplasmic galactose-binding protein |
Protein accession | YP_001879517 |
Protein GI | 187732107 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1879] ABC-type sugar transport system, periplasmic component |
TIGRFAM ID | [TIGR01168] Gram-positive signal peptide, YSIRK family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 32 |
Plasmid unclonability p-value | 0.685859 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAATAAGA AGGTGTTAAC CCTGTCTGCT GTGATGGCCA GCATGTTATT CGGTGCCGCT GCACACGCTG CTGATACTCG CATTGGTGTA ACAATCTATA AGTACGACGA TAACTTTATG TCTGTAGTGC GCAAGGCTAT TGAGCAAGAT GCGAAAGCCG CGCCAGATGT TCAGCTGCTG ATGAATGATT CTCAGAATGA CCAGTCCAAG CAGAACGATC AGATCGACGT ATTGCTGGCG AAAGGGGTGA AGGCACTGGC AATCAACCTG GTTGACCCGG CAGCTGCGGG TACGGTGATT GAGAAAGCGC GTGGGCAAAA CGTGCCGGTG GTTTTCTTCA ACAAAGAACC GTCTCGTAAG GCGCTGGATA GCTACGACAA AGCCTACTAC GTTGGCACTG ACTCCAAAGA GTCCGGCATT ATTCAGGGCG ATTTGATTGC TAAACACTGG GCGGCGAATC AGGGTTGGGA TCTGAACAAA GACGGTCAGA TTCAGTCCGT ACTGCTGAAA GGTGAACCGG GCCATCCGGA TGCAGAAGCA CGTACCACTT ACGTGATTAA AGAATTGAAC GATAAAGGCA TCAAAACTGA ACAGTTACAG TTAGATACCG CAATGTGGGA TACCGCTCAG GCGAAAGATA AGATGGACGC CTGGCTGTCT GGCCCGAACG CCAACAAAAT CGAAGTGGTT ATCGCCAACA ACGATGCGAT GGCAATGGGC GCGGTAGAAG CACTGAAAGC ACACAACAAG TCCAGCATTC CGGTGTTTGG CGTCGATGCT CTGCCAGAAG CGCTGGCGCT GGTGAAATCC GGTGCACTGG CGGGCACCGT ACTGAACGAT GCTAACAACC AGGCGAAAGC GACCTTTGAT CTGGCGAAAA ACCTGGCCGA TGGTAAAGGT GCGGCTGATG GCACCAACTG GAAAATCGAC AACAAAGTGG TCCGCGTACC TTATGTTGGC GTAGATAAAG ACAACCTGGC TGAATTCAGC AAGAAATAA
|
Protein sequence | MNKKVLTLSA VMASMLFGAA AHAADTRIGV TIYKYDDNFM SVVRKAIEQD AKAAPDVQLL MNDSQNDQSK QNDQIDVLLA KGVKALAINL VDPAAAGTVI EKARGQNVPV VFFNKEPSRK ALDSYDKAYY VGTDSKESGI IQGDLIAKHW AANQGWDLNK DGQIQSVLLK GEPGHPDAEA RTTYVIKELN DKGIKTEQLQ LDTAMWDTAQ AKDKMDAWLS GPNANKIEVV IANNDAMAMG AVEALKAHNK SSIPVFGVDA LPEALALVKS GALAGTVLND ANNQAKATFD LAKNLADGKG AADGTNWKID NKVVRVPYVG VDKDNLAEFS KK
|
| |