Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SbBS512_E4517 |
Symbol | |
ID | 6272866 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Shigella boydii CDC 3083-94 |
Kingdom | Bacteria |
Replicon accession | NC_010658 |
Strand | - |
Start bp | 4226443 |
End bp | 4227597 |
Gene Length | 1155 bp |
Protein Length | 384 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 641728305 |
Product | PTS system, mannose/fructose/sorbose family, IID component |
Protein accession | YP_001882703 |
Protein GI | 187732311 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG3716] Phosphotransferase system, mannose/fructose/N-acetylgalactosamine-specific component IID |
TIGRFAM ID | [TIGR00828] PTS system, mannose/fructose/sorbose family, IID component |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 42 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGAACAGA GAAAAATTAC ACGCAGCGAT CTGGTGAGCA TGTTTCTGCG CTCCAACCTG CAACAGGCGT CCTTTAACTT TGAACGTATT CACGGGCTGG GTTTTTGCTA CGACATGATC CCCGCCATCA AACGACTTTA CCCATTAAAA GAGGATCAGG TTGCGGCGCT CAGGCGACAC CTGGTGTTCT TCAATACCAC GCCAGCCGTA TGTGGCCCGG TCATCGGCGT CACTGCCGCC ATGGAAGAGG CGCGGGCCAA CGGCGCGGAA ATTGATGACG GTACCATTAA CGGCATCAAA GTCGGTCTGA TGGGACTATT GGCAGGAGTT GGCGATCCAC TGGTCTGGGG AACGCTGCGC CCGATTACCA CCGCGCTCGG CGCATCTCTG GCACTTTCGG GCAACATTCT CGGCCCGCTG CTGTTCTTCT TTATTTTCAA TGCGGTGCGT CTGGCGATGA AGTGGTATGG CCTACAGCTC GGCTTTCGCA AAGGGGTGAA TATCGTCAGC GATATGGGCG GGAATGTGCT GCAAAAACTC ACCGAAGGCG CGTCGATTCT CGGGCTGTTT GTGATGGGCG TGCTGGTCAC CAAATGGACG TCAATCAACG TACCGTTGGT GGTTTCACAA ACGCATGCCG CCGATGGCTC CACCGTCACC ATGACCGTGC AGAACATTCT CGACCAACTT TGCCCTGGTT TGCTGGCGCT CGGTCTGACG CTGCTTATGG TTCGTCTGCT CAACAAAAAA ATTAACCCGG TATGGCTGAT TTTCGCCCTG TTTGGCTTAG GGATTATCGG TAATGCGCTG GGCTTCCTGT CCAGATTCTT CGCCCCGGCA CGACTGCCGG GGCCATCGCT CAACATGAGG TGGTTTATGA AAACAACAGC TCTGCGTCTT TATGGTAAAC GTGATTTACG CCTGGAAACC TTTGACCTTC CTGAAATGCA GGAGGATGAA ATCCTCGCGA CGGTGGTCAC TGACAGCCTG TGCCTCTCTT CCTGGAAAGA GGCCAATCTG GGTGAAAACC ATAAAAAAGT ACCCGACGAT GTGGCGCCCC CCCCCCCATC ATCATCGGCC ACGAGTTTTG CGGCGATATT CTGGCCGTGG GTAAAAAGTG GCAGCACAAA TTCCAGCCGG GTCAGCGTTA TGTGA
|
Protein sequence | MEQRKITRSD LVSMFLRSNL QQASFNFERI HGLGFCYDMI PAIKRLYPLK EDQVAALRRH LVFFNTTPAV CGPVIGVTAA MEEARANGAE IDDGTINGIK VGLMGLLAGV GDPLVWGTLR PITTALGASL ALSGNILGPL LFFFIFNAVR LAMKWYGLQL GFRKGVNIVS DMGGNVLQKL TEGASILGLF VMGVLVTKWT SINVPLVVSQ THAADGSTVT MTVQNILDQL CPGLLALGLT LLMVRLLNKK INPVWLIFAL FGLGIIGNAL GFLSRFFAPA RLPGPSLNMR WFMKTTALRL YGKRDLRLET FDLPEMQEDE ILATVVTDSL CLSSWKEANL GENHKKVPDD VAPPPPSSSA TSFAAIFWPW VKSGSTNSSR VSVM
|
| |