Gene BURPS1710b_A0633 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS1710b_A0633 
SymbolbcsB 
ID3694257 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 1710b 
KingdomBacteria 
Replicon accessionNC_007435 
Strand
Start bp831470 
End bp833866 
Gene Length2397 bp 
Protein Length798 aa 
Translation table11 
GC content72% 
IMG OID637730887 
Productcellulose synthase regulator protein 
Protein accessionYP_335792 
Protein GI162210116 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.341501 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAGCCGA TTGCAACGTT TGCGCGCGCA TTTGCAATGG CTTGCGCGCT CCTTTGTCCG 
ATCGCGTCGT GGGCGGCGCT GCAGTCGGCA CCTCGGCCGG CGTCTCAGCC CGCGCTTCAG
CCCGCGTCCC GGCCGACGCC TGCGCCGGCG CTTCAGCCGC CGCGCGCGCG GCCTCGCGCT
CCGCGCTCCG CCTCGCTCGC CGCCGGCGCG TCCGCGCCGC TCCCGGCGTC CGCGCCGCTC
GTCATGCCGC TCGCCGCGCC GAAGCCGTCC GGCCTCGCCG GCGCGGCGAT CCACGTGCCG
TTCGCGACGC TCGGCGCATA CGAGCCGCTG CGCCTGCGCG GCAGCGACAC CGCGCGCACC
GTCAACGTCG GCGTGCGGCT CGACCGGATG GTCACGGCCG CGCGGCTGCG GCTCACGTAT
ACGTATTCGC CGTCGCTCGT GTTTCCGGTG TCGCATCTGA AAGTGTCGAT CAACGGCGAG
GCGGTCGCGA CGCTGCCGTT CGATAGCGAG CACGCGGGGC GCGCGGTGAC GCAGGAGATC
CCGCTCGATG CGCGCTATTT CACCGATTTC AACCAGATCG AGCTGCGCCT CATCGCGCAC
TACACGCTCG ATCATTGCGA AGACCCGGAG CACTCGGCGC TTTGGGCCGA CGTGAGCCCG
ACGAGCGAGC TGATCTTCGA CGAGGCCTCG GTGCGGCTGC CGAACGATCT CGCGCTGCTG
CCCGCGCCGT TCTTCGATCG CCGCGACAAC AGCCGGCTGC GGCTGCCGTT CGTGCTGCCC
GCGACGCCCG ATGACGCGAC GCTGCGCAGC GCGGGCGTGC TCGCGTCGTG GTTCGGCGCG
CTCGCCGATT ACCGGCAGGC GCGCTTTCCG GTGTCGTCGG CGCTGCCCGC GAACGATCAC
GCGGTCGTCG TCGGCACGGC GGCTCGATTG CCGGCATCGC TCGCGCTGCC GCCGATCGAC
GGGCCGATGC TCGTCGTCAC CGACAATCCC GCCGCGCCCG ACAAGAAGCT GCTGGTCGTC
ACGGGCCGCA GCGCGGCCGA CGTCGACGCC GCGACGAACG CGCTCGTGCT CGGCAACGCC
GCGCTGTCCG GCCCGTGGGC GCGCGTGTCG CGCATCGACA TCGGCGCGCC GCGCAAGCCG
TACGACGCGC CGCGCTGGGT GCCGGTGGAC CGGCCGGTGG CGTTGCGCGA GCTCGTCGAC
AGTCCGGCCG ACCTGCAAGT GCGCGGCAGC GCGCCCGATC CGATCCGCCT GAACCTGCGC
GTGCCGGCCG ATCTGCATTC GTGGGGCGGC TCGGGCGTGC CGCTCGCGCT GCACTATCGC
TACACCGCGC CCACCGTGCG CAGCGATTCG ATGCTCGCCG TCGAGATCAA CGATCAGCTC
GTGCAGTCGT ACCGGCTCTC GCCGCGCAGC CAGGACGCGC GCGGGCGCAT GCAGTTGCCG
CTGCTGTCCG GCGCGGACAG CCGCGCGACG AACGATGTCG ACATTCCGGC GTTCCGCGTC
GGCAGCGCGA ACCAGCTGCA ACTGCGCTTC ACGCTCGATT CGGAGAAGAC CGGGCTTTGC
ACGGCAGTCG CGAGCGAGCC GCAGCGCGCG GCGATCGATC CCGATTCGAC GATCGATTTC
TCGCGCTTCA TTCATTACGC GATGCTGCCG AACCTCGCGT ATTTCGCGAA CAGCGGCTTT
CCGTTCACGC GCTACGCGGA TCTTTCGCAG ACCGCCGTCG TGCTGCCGCC GCGGCCGTCG
CCCGCCGAGC AGGAAGCGTA TCTGACGATG CTCGGCCACA TGGGGCAGTG GACGGGTTTT
CCGGCGCTGC GCGTGCGGCT CGCGCGGGCG GCCGACGCGC CGGCGATCGC GGATAAGGAT
CTGCTCGTGA TCGACGGCGC GCCGCCTTAT GCGCAGCTCG CGAACTGGCG CGACGCGCTG
CCCGTCGCGA TCGGCGAGGC AACGGCCGGC GGCGGCTTCT CGCGCGCGGC GTTCTCGGTG
AAGGAGCGCT GGCACGACGA CGCGCGCTCG CCGGCCGGCG GCGCGCGCTT CGAGCAGAGC
GGCACGCTCG CCGCGCTGTT CGGCTTCGAG CGGCCCGGCG GCGACGGGCG CAGCGTCGTC
GCGCTCACGG CCACCGACGC GCGGCATCTC GGCGATCTGC TCGACGTGTT CGAAAAGCCC
GGCCTCGTCG CGCAACTGCA GGGCGACGTC GCGCTCGTGC GGGCGGGCGC GGTCGAGAGC
CTGCGCGTCG GCGAGCCCTA TCTCGTCGGC TACGTGCCGT GGTACGCGCG CGTGTGGACG
GCGGTCGCGA GGCATCCGAT GCTGCTCGGG CTGCTCGGGG CGGCGGCCGG GCTGCTGCTC
GCGCTCGGCG CGTTCGGCGC GTTGCAGCGG ATCGCCGCGC GGCGGCGAGG GCTCTGA
 
Protein sequence
MKPIATFARA FAMACALLCP IASWAALQSA PRPASQPALQ PASRPTPAPA LQPPRARPRA 
PRSASLAAGA SAPLPASAPL VMPLAAPKPS GLAGAAIHVP FATLGAYEPL RLRGSDTART
VNVGVRLDRM VTAARLRLTY TYSPSLVFPV SHLKVSINGE AVATLPFDSE HAGRAVTQEI
PLDARYFTDF NQIELRLIAH YTLDHCEDPE HSALWADVSP TSELIFDEAS VRLPNDLALL
PAPFFDRRDN SRLRLPFVLP ATPDDATLRS AGVLASWFGA LADYRQARFP VSSALPANDH
AVVVGTAARL PASLALPPID GPMLVVTDNP AAPDKKLLVV TGRSAADVDA ATNALVLGNA
ALSGPWARVS RIDIGAPRKP YDAPRWVPVD RPVALRELVD SPADLQVRGS APDPIRLNLR
VPADLHSWGG SGVPLALHYR YTAPTVRSDS MLAVEINDQL VQSYRLSPRS QDARGRMQLP
LLSGADSRAT NDVDIPAFRV GSANQLQLRF TLDSEKTGLC TAVASEPQRA AIDPDSTIDF
SRFIHYAMLP NLAYFANSGF PFTRYADLSQ TAVVLPPRPS PAEQEAYLTM LGHMGQWTGF
PALRVRLARA ADAPAIADKD LLVIDGAPPY AQLANWRDAL PVAIGEATAG GGFSRAAFSV
KERWHDDARS PAGGARFEQS GTLAALFGFE RPGGDGRSVV ALTATDARHL GDLLDVFEKP
GLVAQLQGDV ALVRAGAVES LRVGEPYLVG YVPWYARVWT AVARHPMLLG LLGAAAGLLL
ALGAFGALQR IAARRRGL