Gene SbBS512_E3938 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSbBS512_E3938 
SymbolbcsB 
ID6273285 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShigella boydii CDC 3083-94 
KingdomBacteria 
Replicon accessionNC_010658 
Strand
Start bp3671096 
End bp3673426 
Gene Length2331 bp 
Protein Length776 aa 
Translation table11 
GC content55% 
IMG OID641727788 
Productcellulose synthase regulator protein 
Protein accessionYP_001882221 
Protein GI187733631 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.047367 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAAGAA AACTATTCTG GATTTGTGCA GTGGCTATGG GGATGAGTGC GTTCCCCTCT 
TTCATGACGC AGGCGACGCC AGCAACGCAA CCACTGATCA ATGCTGAGCC AGCTGTAGCC
GCCCAGACGG AACAAAATCC GCAGGTGGGG CAAGTGATGC CGGGCGTGCA GGGCGCTGAT
GCGCCAGTCG TGGCGCAGAA CGGTCCTTCG CGTGATGTGA AGCTGACCTT TGCGCAAATT
GCGCCGCCGC CGGGCAGCAT GGTGCTACGT GGCATTAACC CGAACGGCAG CATTGAGTTT
GGTATGCGCA GCGATGAAGT GGTGACGAAG GCGATGCTCA ACCTCGAATA CATGCCATCG
CCATCGTTAC TGCCTGTCCA GTCGCAGTTA AAGGTTTATC TCAATGATGA GCTGATGGGC
GTACTGCCAG TGACCAAAGA ACAGTTGGGT AAAAAAACGC TGGCGCAAAT GCCCATTAAC
CCACTGTTTA TTACCGACTT CAACCGTGTG CGGCTGGAGT TTGTCGGCCA TTATCAGGAC
GTGTGCGAAA ACCCGGCCAG CACCACGCTT TGGCTGGATG TTGGGCGGAG CAGTGGACTG
GATCTGACCT ATCAGACCCT GAATGTGAAG AATGACCTGT CACACTTCCC GGTGCCATTC
TTTGACCCGC GCGATAACCG TACTAACACC TTGCCGATGG TCTTTGCGGG TGCGCCGGAT
GTTGGGCTGC AACAAGCATC TGCCATTGTC GCCTCGTGGT TTGGTTCGCG TTCTGGCTGG
CGTGGGCAGA ACTTCCCGGT GCTCTATAAC CAACTGCCGG ATCGTAATGC CATTGTCTTT
GCAACCAACG ACAAACGGCC GGACTTCCTG CGCGATCATC CGGCGGTAAA AGCCCCGGTG
ATTGAGATGA TCAACCATCC GCAGAATCCT TACGTCAAAC TGCTGGTGGT GTTTGGTCGT
GACGACAAAG ACCTGTTGCA GGCAGCGAAA GGTATCGCTC AGGGTAACAT TCTGTTCCGT
GGTGAAAGCG TGGTAGTGAA TGAAGTGAAA CCGCTGCTAC CGCGTAAGCC GTACGATGCG
CCGAACTGGG TACGTACCGA TCGTCCGGTC ACATTTGGCG AACTGAAAAC CTATGAAGAA
CAGTTACAAT CCAGCGGTCT TGAGCCAGCA GCGATTAACG TTTCGCTAAA CCTGCCGCCG
GATCTCTACC TGATGCGCAG TACCGGCATT GATATGGATA TTAATTACCG CTACACCATG
CCGCCGGTGA AAGACAGTTC GCGGATGGAT ATCAGCCTGA ATAACCAGTT CCTGCAATCC
TTCAACCTGA GCAGCAAACA GGAGGCGAAC CGCCTGCTGC TGCGGATTCC GGTATTACAA
GGTTTGCTGG ATGGCAAAAC AGATGTCTCT ATTCCGGCGC TGAAACTGGG CGCGACCAAC
CAGCTGCGCT TCGACTTTGA GTATATGAAC CCGATGCCGG GCGGTTCGGT GGATAACTGT
ATTACCTTCC AGCCGGTGCA GAATCATGTG GTGATTGGTG ACGACTCCAC CATCGACTTC
TCGAAGTATT ACCACTTCAT CCCGATGCCG GATCTACGCG CCTTTGCTAA CGCGGGCTTC
CCATTCAGCC GGATGGCGGA TCTGTCGCAA ACCATCACCG TGATGCCGAA AGCGCCTAAC
GAAGCACAGA TGGAAACGTT GCTGAATACT GTTGGTTTTA TCGGCGCACA GACGGGCTTC
CCGGCGATTA ATCTGACGGT GACCGATGAT GGCAGCACCA TTCAGGGCAA AGATGCCGAC
ATCATGATCA TCGGTGGTAT CCCGGACAAA CTGAAAGACG ATAAGCAGAT CGACCTATTG
GTGCAGGCGA CCGAAAGCTG GGTGAAAACA CCGATGCGCC AGACCCCGTT CCCCGGCATT
GTGCCGGACG AGAGCGATCG CGCGGCAGAA ACCCGGTCAA CGCTGACCTC TTCCGGTGCG
ATGGCGGCGG TGATTGGCTT CCAGTCGCCG TATAACGACC AGCGCAGCGT GATTGCGCTG
TTGGCAGATA GCCCACGCGG TTATGAAATG CTTAACGATG CGGTGAACGA TAGCGGCAAA
CGCGCCACCA TGTTCGGTTC GGTCGCGGTG ATCCGCGAGT CCGGTATCAA CAGCCTACGT
GTTGGCGACG TTTATTACGT AGGTCATCTG CCGTGGTTCG AGCGCGTGTG GTATGCGCTG
GCAAACCATC CGATTCTGCT GGCGGCTATC AGTGTGATAT TGCTGGCATG GATACTGTGG
CGTCTGCTGC GAATTATTAG TCGTCGTCGT CTTAACCCGG ATAACGAGTA A
 
Protein sequence
MKRKLFWICA VAMGMSAFPS FMTQATPATQ PLINAEPAVA AQTEQNPQVG QVMPGVQGAD 
APVVAQNGPS RDVKLTFAQI APPPGSMVLR GINPNGSIEF GMRSDEVVTK AMLNLEYMPS
PSLLPVQSQL KVYLNDELMG VLPVTKEQLG KKTLAQMPIN PLFITDFNRV RLEFVGHYQD
VCENPASTTL WLDVGRSSGL DLTYQTLNVK NDLSHFPVPF FDPRDNRTNT LPMVFAGAPD
VGLQQASAIV ASWFGSRSGW RGQNFPVLYN QLPDRNAIVF ATNDKRPDFL RDHPAVKAPV
IEMINHPQNP YVKLLVVFGR DDKDLLQAAK GIAQGNILFR GESVVVNEVK PLLPRKPYDA
PNWVRTDRPV TFGELKTYEE QLQSSGLEPA AINVSLNLPP DLYLMRSTGI DMDINYRYTM
PPVKDSSRMD ISLNNQFLQS FNLSSKQEAN RLLLRIPVLQ GLLDGKTDVS IPALKLGATN
QLRFDFEYMN PMPGGSVDNC ITFQPVQNHV VIGDDSTIDF SKYYHFIPMP DLRAFANAGF
PFSRMADLSQ TITVMPKAPN EAQMETLLNT VGFIGAQTGF PAINLTVTDD GSTIQGKDAD
IMIIGGIPDK LKDDKQIDLL VQATESWVKT PMRQTPFPGI VPDESDRAAE TRSTLTSSGA
MAAVIGFQSP YNDQRSVIAL LADSPRGYEM LNDAVNDSGK RATMFGSVAV IRESGINSLR
VGDVYYVGHL PWFERVWYAL ANHPILLAAI SVILLAWILW RLLRIISRRR LNPDNE