Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_3841 |
Symbol | bcsB |
ID | 6147347 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | - |
Start bp | 3914806 |
End bp | 3917115 |
Gene Length | 2310 bp |
Protein Length | 769 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 641618667 |
Product | cellulose synthase regulator protein |
Protein accession | YP_001745807 |
Protein GI | 170681414 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 46 |
Fosmid unclonability p-value | 0.811358 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGGCTATGG GGATGAGTGC GTTCCCCTCT TTCATGACGC AGGCGACGCC AGCAACGCAA CCACTGATCA ATGCTGAGCC AGCTGTAGCC GCCCAGACGG AACAAAATCC GCAGGTGGGG CAAGTGATGC CGGGCGTGCA GGGCGCTGAT GCCCCCATCG TGGCGCAGAA CGGTCCTTCG CGTGATGTGA AGCTGACCTT TGCGCAAATT GCGCCGCCGC CGGGCAGCAT GGTGCTACGT GGCATTAACC CGAACGGCAG CATTGAGTTT GGTATGCGCA GTGATGAAGT GGTGACGAAG GCGATGCTCA ACCTCGAATA CACGCCATCG CCATCGTTAC TGCCTGTCCA GTCGCAGTTA AAGGTTTATC TCAATGATGA ACTGATGGGC GTGCTGCCAG TGACCAAAGA ACAGTTGGGT AAAAAAACGC TGGCGCAAAT GCCCATTAAC CCACTGTTTA TTACCGACTT CAACCGTGTG CGGCTGGAGT TTGTCGGCCA TTATCAGGAC GTGTGCGAAA ACCCGGCCAG CACCACGCTT TGGCTGGATG TTGGGCGGAG CAGTGGACTG GATTTGACCT ATCAGACCCT GAATGTGAAG AATGACCTGT CACACTTCCC GGTGCCATTC TTTGACCCGC GTGATAACCG CACCAACACC TTGCCGATGG TCTTTGCGGG TGCGCCGGAT GTTGGGCTGC AACAAGCATC TGCCATTGTC GCCTCGTGGT TTGGTTCGCG TTCTGGCTGG CGTGGGCAGA ACTTCCCGGT GCTCTATAAC CAACTGCCGG ATCGCAATGC CATTGTTTTT GCCACTAATG ACAAACGGCC GGACTTCCTG CGCGATCATC CGGCGGTAAA AGCCCCGGTG ATTGAAATGA TCAACCATCC GCAGAATCCT TACGTCAAAC TGCTGGTGGT CTTTGGTCGT GACGACAAAG ACCTGTTGCA GGCAGCGAAA GGTATCGCTC AGGGTAACAT TCTGTTCCGT GGTGAAAGCG TGGTAGTGAA TGAAGTGAAA CCGCTGCTAC CGCGTAAGCC GTACGATGCG CCGAACTGGG TGCGTACCGA TCGTCCGGTC ACCTTTGGTG AACTGAAAAC CTATGAAGAA CAGTTACAAT CCAGCGGTCT TGAGCCAGCA GCGATTAACG TTTCGCTAAA TCTGCCGCCG GATCTCTACC TGATGCGCAG TACCGGCATT GATATGGATA TCAACTATCG CTACACCATG CCGCCGGTGA AAGACAGTTC GCGGATGGAT ATCAGCCTGA ATAACCAGTT CCTGCAATCC TTCAACCTGA GCAGTAAACA GGAGGCGAAC CGCCTGCTGC TGCGGATTCC GGTATTACAA GGTTTGCTGG ATGGCAAAAC GGATGTCTCT ATTCCGGCGC TGAAACTGGG CGCGACCAAC CAGTTGCGGT TCGACTTTGA GTATATGAAC CCGATGCCGG GCGGTTCGGT GGATAACTGT ATTACCTTCC AGCCGGTGCA GAATCATGTG GTGATTGGTG ACGACTCCAC CATCGACTTC TCGAAGTATT ACCACTTCAT CCCGATGCCG GATCTACGCG CCTTTGCTAA CGCGGGCTTC CCGTTCAGCC GGATGGCGGA TCTGTCACAA ACCATCACCG TGATGCCGAA AGCGCCTAAC GAAGCACAGA TGGAAACCTT GCTGAATACC GTTGGCTTTA TCGGTGCGCA GACGGGCTTC CCGGCGATTA ACCTGACGGT GACCGATGAT GGCAGCACCA TTCAGGGCAA AGATGCCGAC ATCATGATCG TCGGTGGTAT CCCGGACAAA CTGAAAGACG ACAAGCAGAT CGACCTGTTG GTGCAGGCGA CCGAAAGCTG GGTGAAAACG CCGATGCGCC AGACCCCGTT CCCCGGCATT GTACCGGACG AGAGCGATCG CGCGGCAGAA ACTCAGTCAA CGCTGACCTC TTCCGGTGCG ATGGCAGCGG TGATTGGCTT CCAGTCGCCG TATAACGACC AGCGTAGCGT GATTGCGCTG CTGGCAGATA GCCCACGCGG TTATGAAATG CTTAACGATG CGGTGAACGA TAGCGGCAAA CGCGCCACCA TGTTCGGTTC GGTCGCGGTG ATCCGCGAGT CCGGTATCAA TAGCCTGCGT GTTGGCGACG TCTATTACGT TGGCCATCTG CCGTGGTTCG AACGCTTATG GTATGCCCTG GCAAACCATC CGATTCTGCT GGCGGTGCTG GCGGCAATCA GCGTCGTGCT GCTGGCATGG GTACTGTGGC GTTTGCTGCG TATTATCAGC CGTCGTCGTC TTAACCCGGA TAACGAGTAA
|
Protein sequence | MAMGMSAFPS FMTQATPATQ PLINAEPAVA AQTEQNPQVG QVMPGVQGAD APIVAQNGPS RDVKLTFAQI APPPGSMVLR GINPNGSIEF GMRSDEVVTK AMLNLEYTPS PSLLPVQSQL KVYLNDELMG VLPVTKEQLG KKTLAQMPIN PLFITDFNRV RLEFVGHYQD VCENPASTTL WLDVGRSSGL DLTYQTLNVK NDLSHFPVPF FDPRDNRTNT LPMVFAGAPD VGLQQASAIV ASWFGSRSGW RGQNFPVLYN QLPDRNAIVF ATNDKRPDFL RDHPAVKAPV IEMINHPQNP YVKLLVVFGR DDKDLLQAAK GIAQGNILFR GESVVVNEVK PLLPRKPYDA PNWVRTDRPV TFGELKTYEE QLQSSGLEPA AINVSLNLPP DLYLMRSTGI DMDINYRYTM PPVKDSSRMD ISLNNQFLQS FNLSSKQEAN RLLLRIPVLQ GLLDGKTDVS IPALKLGATN QLRFDFEYMN PMPGGSVDNC ITFQPVQNHV VIGDDSTIDF SKYYHFIPMP DLRAFANAGF PFSRMADLSQ TITVMPKAPN EAQMETLLNT VGFIGAQTGF PAINLTVTDD GSTIQGKDAD IMIVGGIPDK LKDDKQIDLL VQATESWVKT PMRQTPFPGI VPDESDRAAE TQSTLTSSGA MAAVIGFQSP YNDQRSVIAL LADSPRGYEM LNDAVNDSGK RATMFGSVAV IRESGINSLR VGDVYYVGHL PWFERLWYAL ANHPILLAVL AAISVVLLAW VLWRLLRIIS RRRLNPDNE
|
| |