Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SbBS512_E2227 |
Symbol | |
ID | 6272475 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Shigella boydii CDC 3083-94 |
Kingdom | Bacteria |
Replicon accession | NC_010658 |
Strand | - |
Start bp | 2026581 |
End bp | 2027603 |
Gene Length | 1023 bp |
Protein Length | 340 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 641726248 |
Product | hypothetical protein |
Protein accession | YP_001880733 |
Protein GI | 187730726 |
COG category | [R] General function prediction only |
COG ID | [COG1559] Predicted periplasmic solute-binding protein |
TIGRFAM ID | [TIGR00247] conserved hypothetical protein, YceG family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.00000358206 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAAAAG TGTTATTGAT TATCTTGTTA TTGCTGGTGG TACTGGGTAT CGCCGCTGGT GTGGGCGTCT GGAAGGTTCG CCATCTTGCC GACAGCAAAT TGCTTATCAA AGAAGAGACG ATATTTACCC TGAAGCCAGG GACCGGACGT CTGGCGCTTG GTGAACAGCT TTATGCCGAT AAGATCATCA ATCGCCCACG GGTTTTTCAA TGGTTGCTGC GTATCGAACC GGATCTTTCT CACTTTAAAG CCGGGACTTA CCGCTTTACA CCGCAGATGA CCGTGCGCGA GATGCTGAAA TTGCTGGAAA GCGGTAAAGA AGCACAGTTC CCGCTGCGAC TGGTAGAAGG GATGCGTCTG AGCGACTACC TCAAGCAATT GCGTGAGGCT CCGTATATCA AGCACACGCT GAGTGACGAT AAGTACGCCA CCGTAGCGCA GGCACTTGAA CTGGAAAACC CGGAGTGGAT TGAAGGTTGG TTCTGGCCAG ACACCTGGAT GTATACCGCC AATACCACCG ATGTCGCGTT ACTCAAGCGA GCGCACAAGA AAATGGTGAA AGCGGTCGAT AGCGCCTGGG AAGGGCGTGC GGACGGTCTG CCTTATAAAG ATAAAAACCA GCTGGTGACG ATGGCATCAA TTATCGAAAA AGAAACCGCC GTTGCCAGTG AACGCGATCA GGTTGCCTCG GTATTTATCA ACCGTTTACG CATTGGTATG CGCCTGCAGA CCGACCCAAC CGTGATTTAC GGGATGGGAG AGCGTTATAA TGGCAAACTT TCTCGTGCAG ACCTGGAAAC GCCGACAGCG TATAACACCT ATACCATTAC CGGTTTGCCG CCAGGTGCGA TAGCTACGCC GGGGGCGGAT TCGCTGAAGG CTGCTGCGCA TCCGGCAAAA ACGCCGTATC TCTATTTTGT GGCCGATGGT AAAGGTGGTC ACACGTTTAA TACCAATCTT GCCAGCCATA ACAAGTCTGT GCAGGATTAT CTGAAAGTGC TTAAGGAAAA AAATGCGCAG TAA
|
Protein sequence | MKKVLLIILL LLVVLGIAAG VGVWKVRHLA DSKLLIKEET IFTLKPGTGR LALGEQLYAD KIINRPRVFQ WLLRIEPDLS HFKAGTYRFT PQMTVREMLK LLESGKEAQF PLRLVEGMRL SDYLKQLREA PYIKHTLSDD KYATVAQALE LENPEWIEGW FWPDTWMYTA NTTDVALLKR AHKKMVKAVD SAWEGRADGL PYKDKNQLVT MASIIEKETA VASERDQVAS VFINRLRIGM RLQTDPTVIY GMGERYNGKL SRADLETPTA YNTYTITGLP PGAIATPGAD SLKAAAHPAK TPYLYFVADG KGGHTFNTNL ASHNKSVQDY LKVLKEKNAQ
|
| |