Gene SeD_A3994 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeD_A3994 
SymbolbcsB 
ID6874145 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Dublin str. CT_02021853 
KingdomBacteria 
Replicon accessionNC_011205 
Strand
Start bp3838306 
End bp3840606 
Gene Length2301 bp 
Protein Length766 aa 
Translation table11 
GC content57% 
IMG OID642786950 
Productcellulose synthase regulator protein 
Protein accessionYP_002217578 
Protein GI198244249 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.118962 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones61 
Fosmid unclonability p-value0.69945 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAAGAA AATTGTCCTG GATGTGTGCG GCGGTAATAG GATTAAGCGC GTTTCCTGCT 
TTCATGACGG CGGCGGCGCC TGCTACGCCG CCATTGATAA ATGCTGAACC CACCGAGCCT
GCGCCGTCGC CCGCAACTGA GGCGCCCGTC GTGGCACAGA CCGCGCCTTC GCGCGAGGTC
AAGCTGACCT TTGCGCAAAT CGCGCCGCCG CCGGGTAGTA TGGCGCTGCG TGGCGTTAAC
CCTAACGGCG GCATTGAATT TGGTATGCGC AGCGATGAAG TGGCGTCGAA AGCGGTGCTG
AATCTGGAAT ATACGCCCTC GCCGTCGCTC CTGCCGGTTC AGTCGCAGCT CAAGGTTTAT
CTCAATGATG AACTGATGGG CGTACTGCCG GTGACAAAAG AGCAGTTGGG GAAAAAGACG
CTGGCGCAGG TACCTATCAA TCCGCTATTT ATCACCGATT TTAACCGGGT GCGGCTGGAG
TTTGTCGGCC ACTATCGCGA CGTGTGTGAA AATCCGGCCA GCAGTACTCT GTGGTTAGAC
ATCGGGCGAA ATAGCGCCCT GGATCTGACC TATAACATGC TGGCGGTGAA TAACGATCTG
TCCCACTTCC CGGTGCCGTT TTTCGAGCCG CGGGATAACC GTCCGGTGAC GTTGCCGATA
GTGTTTGCTG ACATGCCGGA TCTGGCGCAG CAGCAGGCGG CTTCTATTGT CGCGTCCTGG
TTTGGCTCGC GGGCGGGCTG GCGCGGTCAG CGCTTCCCGG TGTTGTATAA TCACCTGCCG
GATCGCAATG CGATCGTGTT CGCCACCAAC GATCGACGCC CCGATTTCCT GCGCGATCAT
CCTGCGGTTA ACGCGCCGGT TATCGAGATG ATGAGCCATC CGGATAATCC GTATGTGAAG
TTGCTGGTCG TGTTTGGCCG TGATGATAAA GACCTGTTGC AAGCGGCAAA AGGTATCGCG
CAAGGGAATA TTCTCTTCCG TGGTTCCAGC GTGGTGGTCA ACGATGTAAA ACCGCTGCTG
GCGCGCAAAC CGTATGATGC GCCGAACTGG GTGCGTACCG ATCGCCCGGT CACTTTTGGC
GAGCTGAAAA CCTATGAAGA GCAGCTCCAG TCGAGTGGGC TGGAGCCGGC GCCCATCAAT
GTTTCTTTGA ATCTGCCGCC GGACCTCTAT TTGCTGCGTA GCAACGGTAT TGATATGGAT
CTCAACTACC GTTATACCTC GCCGCCGACC AAAGACAGTT CACGACTGGA CATCAGTCTG
AATAACCAGT TCCTGCAAGC CTTTAGCCTT AACAGCACGC AGGAAACTAA TCGACTCCTG
TTGCGCTTGC CGGTACTTCA GGGACTGCTG GATGGTAAAA CAGATGTGTC TATTCCGGCG
CTCAAACTGG GGGCGATGAA CCAACTACGT TTTGACTTCC GCTACATGAA TCCGATGCCG
GGCGGGTCGG TGGACAACTG TATTACCTTC CAGCCGGTAC CGAATCATGT GGTGATAGGG
GATGACTCCA CTATCGATTT TTCGAAATAT TACCACTTTA TCGCGATGCC GGATTTACGC
GCGTTCGCCA ATGCGGGTTT CCCGTTCAGC CGGATGGCCG ACTTGTCTGA CACGCTGGCG
GTGATGCCGA AGACCCCAAC CGAAGCGCAA ATGGAAACGC TGCTGAATAC GGTCGGTGCC
ATTGGCGGGC AGACCGGTTT CCCGGCAATT AATCTGACCA TCACCGATGA TAGCGCTCAG
ATAGCCGACA AAGACGCCGA TCTGCTGATT ATTGGCGCTA TTCCGGGCAA GCTAAAAGAT
GATAAGCGTA TCGATCTGTT GGTGCAGGCG ACACAAAGCT GGGTAAAAAC CCCGATGCGG
CAGACCGCTT TCCCGTCGAT TATGCCGGAT GAGGCCGATC GCGCGGCGGA TGCGCAGTCC
ACCGTCACCG CCAGCGGCCC GATGGCGGCG GTGGTGGGCT TCCAGTCGCC GTTTAATGAT
CAGCGCAGCG TGATTGCTCT GCTGGCTGAT AGCCCGCGCG GTTACCAGCT ACTGAACGAC
GCTGTGAACG ACAGCGGTAA ACGCGCCGCG ATGTTTGGTT CCGTGGCGGT GATCCGCGAG
TCCGGCGTTC ACAGTCTGCG CGTTGGCGAT ATCTATTACG TCGGACATCT GCCGTGGTTT
GAGCGGCTGT GGTATGCGCT GGCGAATCAC CCGGTGCTGC TGGCGGTACT GGCGGCCCTC
AGTGTGGTAT TACTGGCGTG GGTATTGTGG CGTCTGCTAC GTATTCTCAG TCGCCGTCGT
CTCGACCCTG ACCATGAGTA A
 
Protein sequence
MKRKLSWMCA AVIGLSAFPA FMTAAAPATP PLINAEPTEP APSPATEAPV VAQTAPSREV 
KLTFAQIAPP PGSMALRGVN PNGGIEFGMR SDEVASKAVL NLEYTPSPSL LPVQSQLKVY
LNDELMGVLP VTKEQLGKKT LAQVPINPLF ITDFNRVRLE FVGHYRDVCE NPASSTLWLD
IGRNSALDLT YNMLAVNNDL SHFPVPFFEP RDNRPVTLPI VFADMPDLAQ QQAASIVASW
FGSRAGWRGQ RFPVLYNHLP DRNAIVFATN DRRPDFLRDH PAVNAPVIEM MSHPDNPYVK
LLVVFGRDDK DLLQAAKGIA QGNILFRGSS VVVNDVKPLL ARKPYDAPNW VRTDRPVTFG
ELKTYEEQLQ SSGLEPAPIN VSLNLPPDLY LLRSNGIDMD LNYRYTSPPT KDSSRLDISL
NNQFLQAFSL NSTQETNRLL LRLPVLQGLL DGKTDVSIPA LKLGAMNQLR FDFRYMNPMP
GGSVDNCITF QPVPNHVVIG DDSTIDFSKY YHFIAMPDLR AFANAGFPFS RMADLSDTLA
VMPKTPTEAQ METLLNTVGA IGGQTGFPAI NLTITDDSAQ IADKDADLLI IGAIPGKLKD
DKRIDLLVQA TQSWVKTPMR QTAFPSIMPD EADRAADAQS TVTASGPMAA VVGFQSPFND
QRSVIALLAD SPRGYQLLND AVNDSGKRAA MFGSVAVIRE SGVHSLRVGD IYYVGHLPWF
ERLWYALANH PVLLAVLAAL SVVLLAWVLW RLLRILSRRR LDPDHE