Gene EcSMS35_3841 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_3841 
SymbolbcsB 
ID6147347 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp3914806 
End bp3917115 
Gene Length2310 bp 
Protein Length769 aa 
Translation table11 
GC content55% 
IMG OID641618667 
Productcellulose synthase regulator protein 
Protein accessionYP_001745807 
Protein GI170681414 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones46 
Fosmid unclonability p-value0.811358 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGCTATGG GGATGAGTGC GTTCCCCTCT TTCATGACGC AGGCGACGCC AGCAACGCAA 
CCACTGATCA ATGCTGAGCC AGCTGTAGCC GCCCAGACGG AACAAAATCC GCAGGTGGGG
CAAGTGATGC CGGGCGTGCA GGGCGCTGAT GCCCCCATCG TGGCGCAGAA CGGTCCTTCG
CGTGATGTGA AGCTGACCTT TGCGCAAATT GCGCCGCCGC CGGGCAGCAT GGTGCTACGT
GGCATTAACC CGAACGGCAG CATTGAGTTT GGTATGCGCA GTGATGAAGT GGTGACGAAG
GCGATGCTCA ACCTCGAATA CACGCCATCG CCATCGTTAC TGCCTGTCCA GTCGCAGTTA
AAGGTTTATC TCAATGATGA ACTGATGGGC GTGCTGCCAG TGACCAAAGA ACAGTTGGGT
AAAAAAACGC TGGCGCAAAT GCCCATTAAC CCACTGTTTA TTACCGACTT CAACCGTGTG
CGGCTGGAGT TTGTCGGCCA TTATCAGGAC GTGTGCGAAA ACCCGGCCAG CACCACGCTT
TGGCTGGATG TTGGGCGGAG CAGTGGACTG GATTTGACCT ATCAGACCCT GAATGTGAAG
AATGACCTGT CACACTTCCC GGTGCCATTC TTTGACCCGC GTGATAACCG CACCAACACC
TTGCCGATGG TCTTTGCGGG TGCGCCGGAT GTTGGGCTGC AACAAGCATC TGCCATTGTC
GCCTCGTGGT TTGGTTCGCG TTCTGGCTGG CGTGGGCAGA ACTTCCCGGT GCTCTATAAC
CAACTGCCGG ATCGCAATGC CATTGTTTTT GCCACTAATG ACAAACGGCC GGACTTCCTG
CGCGATCATC CGGCGGTAAA AGCCCCGGTG ATTGAAATGA TCAACCATCC GCAGAATCCT
TACGTCAAAC TGCTGGTGGT CTTTGGTCGT GACGACAAAG ACCTGTTGCA GGCAGCGAAA
GGTATCGCTC AGGGTAACAT TCTGTTCCGT GGTGAAAGCG TGGTAGTGAA TGAAGTGAAA
CCGCTGCTAC CGCGTAAGCC GTACGATGCG CCGAACTGGG TGCGTACCGA TCGTCCGGTC
ACCTTTGGTG AACTGAAAAC CTATGAAGAA CAGTTACAAT CCAGCGGTCT TGAGCCAGCA
GCGATTAACG TTTCGCTAAA TCTGCCGCCG GATCTCTACC TGATGCGCAG TACCGGCATT
GATATGGATA TCAACTATCG CTACACCATG CCGCCGGTGA AAGACAGTTC GCGGATGGAT
ATCAGCCTGA ATAACCAGTT CCTGCAATCC TTCAACCTGA GCAGTAAACA GGAGGCGAAC
CGCCTGCTGC TGCGGATTCC GGTATTACAA GGTTTGCTGG ATGGCAAAAC GGATGTCTCT
ATTCCGGCGC TGAAACTGGG CGCGACCAAC CAGTTGCGGT TCGACTTTGA GTATATGAAC
CCGATGCCGG GCGGTTCGGT GGATAACTGT ATTACCTTCC AGCCGGTGCA GAATCATGTG
GTGATTGGTG ACGACTCCAC CATCGACTTC TCGAAGTATT ACCACTTCAT CCCGATGCCG
GATCTACGCG CCTTTGCTAA CGCGGGCTTC CCGTTCAGCC GGATGGCGGA TCTGTCACAA
ACCATCACCG TGATGCCGAA AGCGCCTAAC GAAGCACAGA TGGAAACCTT GCTGAATACC
GTTGGCTTTA TCGGTGCGCA GACGGGCTTC CCGGCGATTA ACCTGACGGT GACCGATGAT
GGCAGCACCA TTCAGGGCAA AGATGCCGAC ATCATGATCG TCGGTGGTAT CCCGGACAAA
CTGAAAGACG ACAAGCAGAT CGACCTGTTG GTGCAGGCGA CCGAAAGCTG GGTGAAAACG
CCGATGCGCC AGACCCCGTT CCCCGGCATT GTACCGGACG AGAGCGATCG CGCGGCAGAA
ACTCAGTCAA CGCTGACCTC TTCCGGTGCG ATGGCAGCGG TGATTGGCTT CCAGTCGCCG
TATAACGACC AGCGTAGCGT GATTGCGCTG CTGGCAGATA GCCCACGCGG TTATGAAATG
CTTAACGATG CGGTGAACGA TAGCGGCAAA CGCGCCACCA TGTTCGGTTC GGTCGCGGTG
ATCCGCGAGT CCGGTATCAA TAGCCTGCGT GTTGGCGACG TCTATTACGT TGGCCATCTG
CCGTGGTTCG AACGCTTATG GTATGCCCTG GCAAACCATC CGATTCTGCT GGCGGTGCTG
GCGGCAATCA GCGTCGTGCT GCTGGCATGG GTACTGTGGC GTTTGCTGCG TATTATCAGC
CGTCGTCGTC TTAACCCGGA TAACGAGTAA
 
Protein sequence
MAMGMSAFPS FMTQATPATQ PLINAEPAVA AQTEQNPQVG QVMPGVQGAD APIVAQNGPS 
RDVKLTFAQI APPPGSMVLR GINPNGSIEF GMRSDEVVTK AMLNLEYTPS PSLLPVQSQL
KVYLNDELMG VLPVTKEQLG KKTLAQMPIN PLFITDFNRV RLEFVGHYQD VCENPASTTL
WLDVGRSSGL DLTYQTLNVK NDLSHFPVPF FDPRDNRTNT LPMVFAGAPD VGLQQASAIV
ASWFGSRSGW RGQNFPVLYN QLPDRNAIVF ATNDKRPDFL RDHPAVKAPV IEMINHPQNP
YVKLLVVFGR DDKDLLQAAK GIAQGNILFR GESVVVNEVK PLLPRKPYDA PNWVRTDRPV
TFGELKTYEE QLQSSGLEPA AINVSLNLPP DLYLMRSTGI DMDINYRYTM PPVKDSSRMD
ISLNNQFLQS FNLSSKQEAN RLLLRIPVLQ GLLDGKTDVS IPALKLGATN QLRFDFEYMN
PMPGGSVDNC ITFQPVQNHV VIGDDSTIDF SKYYHFIPMP DLRAFANAGF PFSRMADLSQ
TITVMPKAPN EAQMETLLNT VGFIGAQTGF PAINLTVTDD GSTIQGKDAD IMIVGGIPDK
LKDDKQIDLL VQATESWVKT PMRQTPFPGI VPDESDRAAE TQSTLTSSGA MAAVIGFQSP
YNDQRSVIAL LADSPRGYEM LNDAVNDSGK RATMFGSVAV IRESGINSLR VGDVYYVGHL
PWFERLWYAL ANHPILLAVL AAISVVLLAW VLWRLLRIIS RRRLNPDNE