Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SbBS512_E3938 |
Symbol | bcsB |
ID | 6273285 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Shigella boydii CDC 3083-94 |
Kingdom | Bacteria |
Replicon accession | NC_010658 |
Strand | - |
Start bp | 3671096 |
End bp | 3673426 |
Gene Length | 2331 bp |
Protein Length | 776 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 641727788 |
Product | cellulose synthase regulator protein |
Protein accession | YP_001882221 |
Protein GI | 187733631 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 0.047367 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAAGAA AACTATTCTG GATTTGTGCA GTGGCTATGG GGATGAGTGC GTTCCCCTCT TTCATGACGC AGGCGACGCC AGCAACGCAA CCACTGATCA ATGCTGAGCC AGCTGTAGCC GCCCAGACGG AACAAAATCC GCAGGTGGGG CAAGTGATGC CGGGCGTGCA GGGCGCTGAT GCGCCAGTCG TGGCGCAGAA CGGTCCTTCG CGTGATGTGA AGCTGACCTT TGCGCAAATT GCGCCGCCGC CGGGCAGCAT GGTGCTACGT GGCATTAACC CGAACGGCAG CATTGAGTTT GGTATGCGCA GCGATGAAGT GGTGACGAAG GCGATGCTCA ACCTCGAATA CATGCCATCG CCATCGTTAC TGCCTGTCCA GTCGCAGTTA AAGGTTTATC TCAATGATGA GCTGATGGGC GTACTGCCAG TGACCAAAGA ACAGTTGGGT AAAAAAACGC TGGCGCAAAT GCCCATTAAC CCACTGTTTA TTACCGACTT CAACCGTGTG CGGCTGGAGT TTGTCGGCCA TTATCAGGAC GTGTGCGAAA ACCCGGCCAG CACCACGCTT TGGCTGGATG TTGGGCGGAG CAGTGGACTG GATCTGACCT ATCAGACCCT GAATGTGAAG AATGACCTGT CACACTTCCC GGTGCCATTC TTTGACCCGC GCGATAACCG TACTAACACC TTGCCGATGG TCTTTGCGGG TGCGCCGGAT GTTGGGCTGC AACAAGCATC TGCCATTGTC GCCTCGTGGT TTGGTTCGCG TTCTGGCTGG CGTGGGCAGA ACTTCCCGGT GCTCTATAAC CAACTGCCGG ATCGTAATGC CATTGTCTTT GCAACCAACG ACAAACGGCC GGACTTCCTG CGCGATCATC CGGCGGTAAA AGCCCCGGTG ATTGAGATGA TCAACCATCC GCAGAATCCT TACGTCAAAC TGCTGGTGGT GTTTGGTCGT GACGACAAAG ACCTGTTGCA GGCAGCGAAA GGTATCGCTC AGGGTAACAT TCTGTTCCGT GGTGAAAGCG TGGTAGTGAA TGAAGTGAAA CCGCTGCTAC CGCGTAAGCC GTACGATGCG CCGAACTGGG TACGTACCGA TCGTCCGGTC ACATTTGGCG AACTGAAAAC CTATGAAGAA CAGTTACAAT CCAGCGGTCT TGAGCCAGCA GCGATTAACG TTTCGCTAAA CCTGCCGCCG GATCTCTACC TGATGCGCAG TACCGGCATT GATATGGATA TTAATTACCG CTACACCATG CCGCCGGTGA AAGACAGTTC GCGGATGGAT ATCAGCCTGA ATAACCAGTT CCTGCAATCC TTCAACCTGA GCAGCAAACA GGAGGCGAAC CGCCTGCTGC TGCGGATTCC GGTATTACAA GGTTTGCTGG ATGGCAAAAC AGATGTCTCT ATTCCGGCGC TGAAACTGGG CGCGACCAAC CAGCTGCGCT TCGACTTTGA GTATATGAAC CCGATGCCGG GCGGTTCGGT GGATAACTGT ATTACCTTCC AGCCGGTGCA GAATCATGTG GTGATTGGTG ACGACTCCAC CATCGACTTC TCGAAGTATT ACCACTTCAT CCCGATGCCG GATCTACGCG CCTTTGCTAA CGCGGGCTTC CCATTCAGCC GGATGGCGGA TCTGTCGCAA ACCATCACCG TGATGCCGAA AGCGCCTAAC GAAGCACAGA TGGAAACGTT GCTGAATACT GTTGGTTTTA TCGGCGCACA GACGGGCTTC CCGGCGATTA ATCTGACGGT GACCGATGAT GGCAGCACCA TTCAGGGCAA AGATGCCGAC ATCATGATCA TCGGTGGTAT CCCGGACAAA CTGAAAGACG ATAAGCAGAT CGACCTATTG GTGCAGGCGA CCGAAAGCTG GGTGAAAACA CCGATGCGCC AGACCCCGTT CCCCGGCATT GTGCCGGACG AGAGCGATCG CGCGGCAGAA ACCCGGTCAA CGCTGACCTC TTCCGGTGCG ATGGCGGCGG TGATTGGCTT CCAGTCGCCG TATAACGACC AGCGCAGCGT GATTGCGCTG TTGGCAGATA GCCCACGCGG TTATGAAATG CTTAACGATG CGGTGAACGA TAGCGGCAAA CGCGCCACCA TGTTCGGTTC GGTCGCGGTG ATCCGCGAGT CCGGTATCAA CAGCCTACGT GTTGGCGACG TTTATTACGT AGGTCATCTG CCGTGGTTCG AGCGCGTGTG GTATGCGCTG GCAAACCATC CGATTCTGCT GGCGGCTATC AGTGTGATAT TGCTGGCATG GATACTGTGG CGTCTGCTGC GAATTATTAG TCGTCGTCGT CTTAACCCGG ATAACGAGTA A
|
Protein sequence | MKRKLFWICA VAMGMSAFPS FMTQATPATQ PLINAEPAVA AQTEQNPQVG QVMPGVQGAD APVVAQNGPS RDVKLTFAQI APPPGSMVLR GINPNGSIEF GMRSDEVVTK AMLNLEYMPS PSLLPVQSQL KVYLNDELMG VLPVTKEQLG KKTLAQMPIN PLFITDFNRV RLEFVGHYQD VCENPASTTL WLDVGRSSGL DLTYQTLNVK NDLSHFPVPF FDPRDNRTNT LPMVFAGAPD VGLQQASAIV ASWFGSRSGW RGQNFPVLYN QLPDRNAIVF ATNDKRPDFL RDHPAVKAPV IEMINHPQNP YVKLLVVFGR DDKDLLQAAK GIAQGNILFR GESVVVNEVK PLLPRKPYDA PNWVRTDRPV TFGELKTYEE QLQSSGLEPA AINVSLNLPP DLYLMRSTGI DMDINYRYTM PPVKDSSRMD ISLNNQFLQS FNLSSKQEAN RLLLRIPVLQ GLLDGKTDVS IPALKLGATN QLRFDFEYMN PMPGGSVDNC ITFQPVQNHV VIGDDSTIDF SKYYHFIPMP DLRAFANAGF PFSRMADLSQ TITVMPKAPN EAQMETLLNT VGFIGAQTGF PAINLTVTDD GSTIQGKDAD IMIIGGIPDK LKDDKQIDLL VQATESWVKT PMRQTPFPGI VPDESDRAAE TRSTLTSSGA MAAVIGFQSP YNDQRSVIAL LADSPRGYEM LNDAVNDSGK RATMFGSVAV IRESGINSLR VGDVYYVGHL PWFERVWYAL ANHPILLAAI SVILLAWILW RLLRIISRRR LNPDNE
|
| |