Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SNSL254_A3891 |
Symbol | bcsA |
ID | 6483519 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salmonella enterica subsp. enterica serovar Newport str. SL254 |
Kingdom | Bacteria |
Replicon accession | NC_011080 |
Strand | - |
Start bp | 3775395 |
End bp | 3778019 |
Gene Length | 2625 bp |
Protein Length | 874 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 642739155 |
Product | cellulose synthase catalytic subunit |
Protein accession | YP_002042866 |
Protein GI | 194445847 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG1215] Glycosyltransferases, probably involved in cell wall biogenesis |
TIGRFAM ID | [TIGR03030] cellulose synthase catalytic subunit (UDP-forming) |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.894407 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 67 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCGCCC TTTCCCGGTG GCTGCTTATC CCGCCGGTTA GCGCGCGTTT GAGCGAGCGC TATCAGGGTT ACCGCCGCCA CGGCGCGTCG CCGTTTAGCG CAGCGCTCGG CTGCCTGTGG ACGATTCTGG CGTGGATAGT GTTTCCGCTT GAGCATCCGC GCTGGCAGCG TATTCGCGAC GGGCATAAAG CGCTTTATCC GCATATTAAC GCCGCCCGCC CGCGCCCGCT GGACCCGGCC CGTTACCTCA TTCAGACCCT CTGGCTGGTG ATGATCTCGT CGACTAAAGA ACGTCATGAA CCGCGCTGGC GGTCATTTGC GCGCCTGAAG GACGTTCGTG GACGTTACCA TCAATGGATG GATACCTTAC CGGAACGGGT GCGCCAAAAG ACAACGCACC TGGAAAAGGA AAAAGAGCTG GGACATCTTA GCAACGGTGC CCGGCGTTTT ATTCTGGGCG TTATCGTGAC CTTTTCACTG ATACTGGCGC TGATCTGTAT TACGCAGCCA TTTAACCCGC TATCGCAATT TATCTTTCTG CTGTTGCTGT GGGGCGTTGC GCTGTTGGTA CGACGTATGC CGGGACGCTT TTCCGCGCTG ATGCTCATCG TGCTGTCGTT AACGGTCTCC TGTCGTTATA TCTGGTGGCG CTATACCTCG ACGCTAAATT GGGACGACCC GGTCAGTCTG GTGTGCGGGC TGATTCTGCT GTTTGCGGAA ACCTACGCCT GGATTGTGCT GGTGCTGGGG TACTTCCAGG TGGTGTGGCC GTTAAATCGT CAACCGGTGC CGTTGCCGAA AGAAATGTCG CAGTGGCCGA CGGTGGATAT TTTTGTACCG ACCTATAACG AAGACCTTAA CGTGGTCAAA AATACCATTT ACGCTTCGCT TGGCATTGAC TGGCCGAAGG ACAAGCTGAA TATCTGGATC CTTGATGACG GCGGGCGTGA ATCATTTCGT CATTTTGCCC GACATGTTGG CGTGCATTAC ATTGCCCGCG CCACGCATGA ACACGCCAAA GCCGGCAACA TCAACAATGC GCTAAAACAC GCGAAAGGCG AGTTTGTGGC GATCTTCGAT TGCGACCATG TGCCGACACG CTCGTTTCTG CAAATGACAA TGGGCTGGTT CCTGAAAGAA AAACAGCTGG CGATGATGCA GACGCCGCAT CATTTCTTCT CCCCGGACCC GTTTGAGCGC AACCTGGGAC GTTTTCGTAA AACGCCTAAC GAAGGCACGC TGTTTTACGG ACTGGTGCAG GACGGTAACG ATATGTGGGA CGCCACTTTC TTTTGCGGAT CGTGCGCGGT GATTCGCCGT AAGCCGCTGG ATGAGATTGG CGGTATCGCC GTTGAGACGG TGACGGAAGA TGCGCATACT TCGCTCCGGC TGCACCGGCG AGGTTATACC TCAGCGTATA TGCGCATTCC GCAGGCGGCG GGGCTGGCGA CGGAAAGCCT GTCGGCGCAT ATCGGGCAGC GTATTCGTTG GGCGCGGGGC ATGGTGCAAA TTTTCCGCCT CGATAACCCT CTGTTTGGTA AAGGCTTAAA ACTGGCGCAG CGGCTGTGCT ACCTCAACGC GATGTTCCAT TTCTTGTCCG GCATTCCGCG CCTGATCTTT CTGACCGCGC CGCTGGCTTT TCTGCTGCTG CACGCCTATA TCATTTATGC GCCTGCGTTG ATGATTGCGC TATTTGTGAT ACCGCACATG GTCCACGCCA GCCTGACTAA TTCGAAGATT CAGGGCAAGT ATCGTCACTC TTTCTGGAGT GAAATCTATG AAACGGTACT GGCATGGTAT ATCGCACCAC CGACTCTGGT CGCGTTGATC AATCCGCACA AAGGGAAATT TAACGTCACG GCGAAAGGCG GACTGGTGGA AGAGAAGTAC GTCGACTGGG TAATCTCGCG TCCGTATATC TTCCTTGTCT TGCTTAACCT GCTCGGCGTG GCGGCGGGCG TATGGCGATA CTATTACGGG CCGGAAAATG AAACGCTGAC CGTCATCGTT AGCCTGGTGT GGGTCTTCTA CAACCTGGTC ATTCTCGGCG GCGCGGTTGC GGTTTCGGTA GAGAGTAAAC AGGTCAGGCG CGCGCATCGG GTCGAGATTG CCATGCCGGG GGCCATCGCC CGCGAAGATG GACATTTGTT CTCCTGTACC GTACATGACT TCTCCGACGG CGGGTTAGGC ATCAAGATCA ACGGTCAGGC GCAGGTGCTG GAAGGGCAGA AAGTGAATCT GTTGCTTAAA CGCGGGCAGC AGGAATATGT CTTTCCAACG CAGGTGGTGC GCGTAACGGG CAATGAGGTC GGGCTGCAAC TGATGCCGCT CACCACCAAA CAACATATTG ATTTTGTGCA GTGTACCTTC GCGCGCGCCG ATACGTGGGC GCTTTGGCAA GATAGCTTCC CGGAAGATAA ACCGCTGGAA AGCTTGCTGG ATATTCTGAA GCTGGGCTTC CGCGGATATC GCCACCTCGC GGAGTTCGCG CCGCCTTCAG TAAAAGTAAT TTTCCGATCG TTGACGGCGT TAATTGCCTG GATTGTATCG TTTATTCCGC GTCGCCCGGA GCGGCAAGCG GCGATACAGC CGTCGGATCG GGTTATGGCT CAGGCTCAAC AATGA
|
Protein sequence | MSALSRWLLI PPVSARLSER YQGYRRHGAS PFSAALGCLW TILAWIVFPL EHPRWQRIRD GHKALYPHIN AARPRPLDPA RYLIQTLWLV MISSTKERHE PRWRSFARLK DVRGRYHQWM DTLPERVRQK TTHLEKEKEL GHLSNGARRF ILGVIVTFSL ILALICITQP FNPLSQFIFL LLLWGVALLV RRMPGRFSAL MLIVLSLTVS CRYIWWRYTS TLNWDDPVSL VCGLILLFAE TYAWIVLVLG YFQVVWPLNR QPVPLPKEMS QWPTVDIFVP TYNEDLNVVK NTIYASLGID WPKDKLNIWI LDDGGRESFR HFARHVGVHY IARATHEHAK AGNINNALKH AKGEFVAIFD CDHVPTRSFL QMTMGWFLKE KQLAMMQTPH HFFSPDPFER NLGRFRKTPN EGTLFYGLVQ DGNDMWDATF FCGSCAVIRR KPLDEIGGIA VETVTEDAHT SLRLHRRGYT SAYMRIPQAA GLATESLSAH IGQRIRWARG MVQIFRLDNP LFGKGLKLAQ RLCYLNAMFH FLSGIPRLIF LTAPLAFLLL HAYIIYAPAL MIALFVIPHM VHASLTNSKI QGKYRHSFWS EIYETVLAWY IAPPTLVALI NPHKGKFNVT AKGGLVEEKY VDWVISRPYI FLVLLNLLGV AAGVWRYYYG PENETLTVIV SLVWVFYNLV ILGGAVAVSV ESKQVRRAHR VEIAMPGAIA REDGHLFSCT VHDFSDGGLG IKINGQAQVL EGQKVNLLLK RGQQEYVFPT QVVRVTGNEV GLQLMPLTTK QHIDFVQCTF ARADTWALWQ DSFPEDKPLE SLLDILKLGF RGYRHLAEFA PPSVKVIFRS LTALIAWIVS FIPRRPERQA AIQPSDRVMA QAQQ
|
| |