Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SbBS512_E4000 |
Symbol | bcsE |
ID | 6269876 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Shigella boydii CDC 3083-94 |
Kingdom | Bacteria |
Replicon accession | NC_010658 |
Strand | - |
Start bp | 3729398 |
End bp | 3730534 |
Gene Length | 1137 bp |
Protein Length | 378 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 641727846 |
Product | cellulose biosynthesis protein BcsE |
Protein accession | YP_001882278 |
Protein GI | 187732428 |
COG category | |
COG ID | |
TIGRFAM ID | [TIGR03369] cellulose biosynthesis protein BcsE |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 32 |
Plasmid unclonability p-value | 0.878993 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TTGGTAATTA ATCCCGGAAA TAATAACGAT AAACAATTTT CATTGTTGCT TGAGGAATAC CGTTCACTTT TTGGTCTTGC CAGTTTGCGT TTTCAGGGTG ACCAACATTT GCTGGATATT GCCTTCTGGT GCAACGAAAA AGGGGTCAGC GCCCGTCAGC AGCTTAGCGT TCAGCAACAA AATGGTATCT GGACATTAGT TCAAAGCGAA GAGGCGGAGA TCCAACCACG CAGCGACGAA AAACGCATTC TGAGTAATGT TGCTGTACTG GAAGGTGCGC CGCCGCTATC GGAACACTGG CAACTGTTCA ACAATAACGA AGTCCTGTTC AATGAAGCCC GTACCGCTCA GGCGGCGACG GTGGTCTTTT CTTTACAGCA AAATGCGCAA ATCGAGCCAC TGGCCCGCAG CATTCATACC CTGCGTCGCC AGCGCGGTAG TGCGATGAAA ATCCTCGTGC GGGAAAATAC CGCTAGCCTG CGCGCCACCG ATGAACGTTT GTTATTGGCC TGCGGTGCAA ATATGGTTAT TCCGTGGAAT GCGCCACTCT CCCGTTGTCT GACGATGATC GAAAGCGTGC AAGGGCAGAA GTTTAGTCGC TATGTGCCGG AAGATATCAC TACCTTGCTG TCAATGACCC AGCCGCTCAA ACTGCGTGGT TTCCAGAAGT GGGATGTGTT CTGTAATGCC GTCAACAACA TGATGAATAA CCCTCTATTA CCTGCCCACG GTAAAGGCGT TCTGGTTGCC CTACGTCCGG TACCGGGTAT CCGCGTTGAA CAAGCCCTGA CGCTGTGTCG CCCTAACCGT ACCGGCGATA TCATGACCAT TGGCGGTAAT CGGCTGGTGC TGTTTCTCTC ATTCTGTCGG ATTAACGATC TGGATACCGC GTTGAATCAT ATTTTCCCAT TGCCTACTGG CGACATTTTC TCAAACCGTA TGGTCTGGTT TGAAGATGAT CAAATCAGTG CCGAGCTGGT GCAGATGCGC CTGCTTGCCC CAGAACAATG GGGCATGCCG CTGCCTTTAA CGCAAAGTTC TAAACCGGTC ATCAATGCCG AGCACGATGG TCGCCACTGG CGACGAATAC CAGAACCCAT GCGACTGTTA GATGATGCTG TGGAGCGCTC ATCATGA
|
Protein sequence | MVINPGNNND KQFSLLLEEY RSLFGLASLR FQGDQHLLDI AFWCNEKGVS ARQQLSVQQQ NGIWTLVQSE EAEIQPRSDE KRILSNVAVL EGAPPLSEHW QLFNNNEVLF NEARTAQAAT VVFSLQQNAQ IEPLARSIHT LRRQRGSAMK ILVRENTASL RATDERLLLA CGANMVIPWN APLSRCLTMI ESVQGQKFSR YVPEDITTLL SMTQPLKLRG FQKWDVFCNA VNNMMNNPLL PAHGKGVLVA LRPVPGIRVE QALTLCRPNR TGDIMTIGGN RLVLFLSFCR INDLDTALNH IFPLPTGDIF SNRMVWFEDD QISAELVQMR LLAPEQWGMP LPLTQSSKPV INAEHDGRHW RRIPEPMRLL DDAVERSS
|
| |