Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SbBS512_E4237 |
Symbol | |
ID | 6268421 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Shigella boydii CDC 3083-94 |
Kingdom | Bacteria |
Replicon accession | NC_010658 |
Strand | + |
Start bp | 3960963 |
End bp | 3962711 |
Gene Length | 1749 bp |
Protein Length | 582 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 641728056 |
Product | PTS system, alpha-glucoside-specific IIBC component |
Protein accession | YP_001882477 |
Protein GI | 187730493 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1263] Phosphotransferase system IIC components, glucose/maltose/N-acetylglucosamine-specific [COG1264] Phosphotransferase system IIB components |
TIGRFAM ID | [TIGR00826] PTS system, glucose-like IIB component [TIGR00852] PTS system, maltose and glucose-specific subfamily, IIC component [TIGR02005] PTS system, alpha-glucoside-specific IIBC component |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 36 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGTTCGATG CTGTGTACCA GTGCTCACAG ATGTCTACTT TTTCGCGAAA ACGTAGATCT CTACCGCCCA ACGAAAAGCA TGAAAGCGAT CACGAATCCC ATTGGGTCGT CATGTTCTGC CGATCGCATC TTCCTATCCT CGCTCCAGGC CTGCCGCATA ACCAATCAGG CTTCCTACTT ACAGAATTGA GAAAAGAGGA TGTGGAAATG CTCAGTCAAA TTCAACGCTT TGGCGGCGCG ATGTTCACGC CAGTGCTGCT GTTTCCCTTC GCCGGGATTG TGGTGGGTCT TGCCATCTTG CTGCAAAACC CGATGTTTGT CGGGGAATCA CTGACCGATC CGAACAGTTT ATTCGCGCAA ATCGTACACA TTATTGAAGA GGGCGGTTGG ACGGTATTCC GTAATATGCC GCTGATTTTT GCTGTCGGTT TACCCATTGG CCTTGCTAAG CAAGCGCAGG GGCGTGCTTG TCTGGCGGTG ATGGTGAGTT TCCTGACCTG GAACTATTTC ATCAACGCGA TGGGAATGAC CTGGGGAAGC TACTTCGGCG TCGATTTCAC TCAGGACGCG GTGGCAGGTA GCGGTCTGAC AATGATGGCC GGGATTAAAA CCCTCGATAC CAGCATTATC GGCGCAATTA TCATTTCCGG CATTGTGACG GCGCTGCATA ACCGTCTGTT CGATAAAAAA CTGCCGGTTT TTCTCGGCAT TTTCCAGGGG ACGTCTTATG TGGTGATTAT CGCCTTCCTG GTGATGATCC CCTGTGCCTG GCTGACGTTG CTCGGCTGGC CAAAAGTACA AATGGGGATT GAATCTCTGC AAGCGTTCCT GCGTTCGGCG GGTGCACTTG GGGTGTGGGT TTACACCTTC CTCGAACGTA TTCTGATCCC AACCGGTTTA CACCACTTCA TCTACGGACA GTTTATCTTT GGTCCGGCAG CTGTTGAAGG CGGCATTCAG ATGTACTGGG CGCAGCATCT GCAAGAGTTC AGTCTGAGCG CCGAGCCGCT GAAATCGTTG TTCCCGGAAG GCGGTTTTGC CCTGCACGGT AACTCAAAAA TCTTTGGTGC CGTGGGCATT TCTTTAGCGA TGTACTTCAC TGCCGCACCG GAAAATCGGG TAAAAGTGGC GGGCTTGCTG ATTCCCGCAA CCTTAACCGC CATGCTGGCG GCCTCAATGT CGACCGTGAT GTATCTCTTT GGTGTGGTGG GCAACATGGG CGGAGGTCTG ATTGACCAGG TTTTACCGCA AAACTGGATC CCGATGTTCA GCAACCACGC GGATATGATG CTGACCCAAA TCGCCATTGG GTTGTGCTTT ACCCTGCTGT ACTTCGTGGT TTTCCGCACA CTGATTCTGC AATTCAACAT GTGCACGCCG GGACGTGAAG ATGCGGAAGT GAAACTCTAC TCAAAAGCCG AATACAAAGC CTCGCGAGGC CAAACCACCG CTGCAGAGCC AAAAAAAGAG CTGGATCAGG CTGCCGGTAT CCTGCAAGCC CTGGGCGGGG TCGGCAATAT CTCCAGCATT AACAATTGCG CGACGCGTTT ACGTATTGCA CTGCATGACA TGTCACAAAC GCTGGATGAC GAAGTCTTTA AAAAGCTGGG AGCGCACGGC GTCTTCCGTA GTGGCGATGC CATTCAGGTG ATCATTGGTC TGCATGTATC CCAGCTGCGT GAACAGCTCG ATAGCTTAAT TAATTCTCAT CAATCAGCAG AAAATGTTGC CATTACGGAG GCAGTATAA
|
Protein sequence | MFDAVYQCSQ MSTFSRKRRS LPPNEKHESD HESHWVVMFC RSHLPILAPG LPHNQSGFLL TELRKEDVEM LSQIQRFGGA MFTPVLLFPF AGIVVGLAIL LQNPMFVGES LTDPNSLFAQ IVHIIEEGGW TVFRNMPLIF AVGLPIGLAK QAQGRACLAV MVSFLTWNYF INAMGMTWGS YFGVDFTQDA VAGSGLTMMA GIKTLDTSII GAIIISGIVT ALHNRLFDKK LPVFLGIFQG TSYVVIIAFL VMIPCAWLTL LGWPKVQMGI ESLQAFLRSA GALGVWVYTF LERILIPTGL HHFIYGQFIF GPAAVEGGIQ MYWAQHLQEF SLSAEPLKSL FPEGGFALHG NSKIFGAVGI SLAMYFTAAP ENRVKVAGLL IPATLTAMLA ASMSTVMYLF GVVGNMGGGL IDQVLPQNWI PMFSNHADMM LTQIAIGLCF TLLYFVVFRT LILQFNMCTP GREDAEVKLY SKAEYKASRG QTTAAEPKKE LDQAAGILQA LGGVGNISSI NNCATRLRIA LHDMSQTLDD EVFKKLGAHG VFRSGDAIQV IIGLHVSQLR EQLDSLINSH QSAENVAITE AV
|
| |