Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SbBS512_E3870 |
Symbol | ugpB |
ID | 6272716 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Shigella boydii CDC 3083-94 |
Kingdom | Bacteria |
Replicon accession | NC_010658 |
Strand | + |
Start bp | 3595071 |
End bp | 3596387 |
Gene Length | 1317 bp |
Protein Length | 438 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 641727726 |
Product | glycerol-3-phosphate transporter periplasmic binding protein |
Protein accession | YP_001882161 |
Protein GI | 187731002 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1653] ABC-type sugar transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 51 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAACCGT TACATTATAC AGCTTCAGCA CTGGCGCTCG GACTGGCGTT AATGGGGAAT GCACAGGCAG TGACGACCAT TCCGTTCTGG CATTCTATGG AAGGGGAACT GGGTAAAGAG GTGGATTCTC TGGCCCAACG TTTTAACGCC GAAAATCCGG ATTACAAAAT TGTACCGACC TATAAAGGCA ACTACGAACA GAATTTAAGC GCGGGGATTG CCGCATTTCG TACCGGCAAC GCTCCGGCTA TTTTGCAGGT TTATGAAGTT GGCACCGCCA CCATGATGGC ATCGAAAGCC ATTAAACCGG TATATGACGT GTTTAAAGAG GCGGGGATTC AATTCGATGA GTCGCAGTTT GTGCCGACGG TTTCCGGTTA CTACTCCGAC AGCAAAACGG GCCACTTACT CTCCCAGCCA TTCAACAGCT CGACTCCCGT TCTCTATTAC AACAAAGACG CCTTCAAGAA AGCGGGGTTA GACCCGGAAC AGCCGCCGAA AACCTGGCAG GATCTGGCGA ACTATGCCGC GAAACTGAAA GCCTCCGGCA TGAAGTGCGG CTACGCCAGC GGCTGGCAGG GCTGGATCCA ACTGGAAAAC TTTAGCGCTT GGAACGGTCT GCCGTTTGCC AGCAAAAACA ACGGCTTTGA CGGCACAGAT GCGGTGCTGG AGTTCAACAA GCCGGAGCAG GTGAAACACA TCGCCATGCT CGAGGAGATG AACAAGAAGG GCGACTTCAG CTACGTCGGT CGTAAGGATG AATCCACCGA GAAGTTCTAT AACGGTGATT GCGCGATGAC CACCGCCTCT TCCGGTTCTC TTGCCAACAT TCGCGAGTAC GCCAAATTTA ACTACGGCGT AGGCATGATG CCTTACGACG GCGATGCGAA AGATGCGCCA CAAAACGCCA TTATCGGCGG AGCCAGCCTG TGGGTGATGC AGGGTAAAGA TAAAGAAACG TATACCGGTG TGGCGAAGTT CCTCGATTTC CTCGCGAAGC CAGAAAACGC TGCCGAGTGG CATCAGAAAA CCGGTTATCT GCCAATCACC AAAGCAGCGT ATGACCTGAC CCGTGAGCAG GGCTTTTATG AGAAAAACCC AGGGGCGGAT ACCGCGACGC GTCAGATGCT GAATAAGCCG CCGTTGCCGT TCACCAAAGG GCTGCGTCTG GGCAACATGC CGCAGATCCG CGTGATTGTG GATGAAGAGC TGGAGAGCGT GTGGACCGGT AAGAAGACAC CACAGCAGGC ACTGGATACC GCCGTTGAGC GTGGGAACCA GTTACTGCGC CGCTTTGAGA AATCGACGAA GTCTTAA
|
Protein sequence | MKPLHYTASA LALGLALMGN AQAVTTIPFW HSMEGELGKE VDSLAQRFNA ENPDYKIVPT YKGNYEQNLS AGIAAFRTGN APAILQVYEV GTATMMASKA IKPVYDVFKE AGIQFDESQF VPTVSGYYSD SKTGHLLSQP FNSSTPVLYY NKDAFKKAGL DPEQPPKTWQ DLANYAAKLK ASGMKCGYAS GWQGWIQLEN FSAWNGLPFA SKNNGFDGTD AVLEFNKPEQ VKHIAMLEEM NKKGDFSYVG RKDESTEKFY NGDCAMTTAS SGSLANIREY AKFNYGVGMM PYDGDAKDAP QNAIIGGASL WVMQGKDKET YTGVAKFLDF LAKPENAAEW HQKTGYLPIT KAAYDLTREQ GFYEKNPGAD TATRQMLNKP PLPFTKGLRL GNMPQIRVIV DEELESVWTG KKTPQQALDT AVERGNQLLR RFEKSTKS
|
| |