Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | B21_03337 |
Symbol | bcsE |
ID | 8112574 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli BL21 |
Kingdom | Bacteria |
Replicon accession | NC_012892 |
Strand | + |
Start bp | 3556857 |
End bp | 3558428 |
Gene Length | 1572 bp |
Protein Length | 523 aa |
Translation table | 11 |
GC content | 49% |
IMG OID | 644849512 |
Product | hypothetical protein |
Protein accession | YP_003001085 |
Protein GI | 251786781 |
COG category | |
COG ID | |
TIGRFAM ID | [TIGR03369] cellulose biosynthesis protein BcsE |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.362213 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGGGACA TTGTGGACCC TGTATTCTCT ATCGGTATCT CATCATTATG GGATGAGCTG CGACATATGC CAGCAGGCGG CGTCTGGTGG TTTAACGTCG ATCGCCATGA AGATGCTATC AGTCTGGCGA ATCAAACAAT TGCATCCCAG GCTGAAACCG CACACGTCGC GGTCATTAGC ATGGACAGCG ATCCGGCGAA AATCTTTCAA TTAGATGATT CTCAAGGGCC GGAAAAAATA AAATTATTTT CAATGCTAAA TCATGAAAAA GGTCTATACT ATTTGACCCG TGATTTGCAG TGTTCTATTG ATCCCCATAA TTACCTTTTT ATTCTTGTTT GCGCAAATAA CGCATGGCAA AACATTCCTG CCGAGCGGCT TCGCTCATGG TTGGATAAAA TGAATAAATG GAGCAGGTTA AACCATTGTT CGCTTTTGGT AATTAATCCC GGAAATAATA ACGATAAACA ATTTTCATTG TTGCTTGAGG AATACCGTTC ACTTTTTGGT CTTGCCAGTT TGCGTTTTCA GGGTGACCAA CATTTGCTGG ATATTGCCTT CTGGTGCAAC GAAAAAGGGG TCAGCGCCCG TCAGCAGCTT AGCGTTCAGC AACAAAATGG TATCTGGACA TTAGTTCAAA GCGAAGAGGC GGAGATCCAA CCACGCAGCG ACGAAAAACG CATTCTGAGT AATGTTGCTG TACTGGAAGG TGCGCCGCCG CTATCGGAAC ACTGGCAACT GTTCAACAAT AACGAAGTCC TGTTCAATGA AGCCCGTACC GCTCAGGCGG CGACGGTGGT CTTTTCTTTA CAGCAAAATG CGCAAATCGA GCCACTGGCC CGCAGCATTC ATACCCTGCG TCGCCAGCGC GGTAGTGCGA TGAAAATCCT CGTGCGGGAA AATACCGCTA GCCTGCGCGC CACCGATGAA CGTTTGTTAT TGGCCTGCGG TGCAAATATG GTTATTCCGT GGAATGCGCC ACTCTCCCGT TGTCTGACGA TGATCGAAAG CGTGCAAGGG CAGAAGTTTA GTCGCTATGT GCCGGAAGAT ATCACTACCT TGCTGTCAAT GACCCAGCCG CTCAAACTGC GTGGTTTCCA GAAGTGGGAT GTGTTCTGTA ATGCCGTCAA CAACATGATG AATAACCCTC TATTACCTGC CCACGGTAAA GGCGTTCTGG TTGCCCTACG TCCGGTACCG GGTATCCGCG TTGAACAAGC CCTGACGCTG TGTCGCCCTA ACCGTACCGG CGATATCATG ACCATTGGCG GTAATCGGCT GGTGCTGTTT CTCTCATTCT GTCGGATTAA CGATCTGGAT ACCGCGTTGA ATCATATTTT CCCATTGCCT ACTGGCGACA TTTTCTCAAA CCGTATGGTC TGGTTTGAAG ATGATCAAAT CAGTGCCGAG CTGGTGCAGA TGCGCTTGCT TGCCCCAGAA CAATGGGGCA TGCCGCTGCC TTTAACGCAA AGTTCTAAAC CGGTCATCAA TGCCGAGCAC GATGGTCGCC ACTGGCGACG AATACCAGAA CCCATGCGAC TGTTAGATGA TGCTGTGGAG CGCTCATCAT GA
|
Protein sequence | MRDIVDPVFS IGISSLWDEL RHMPAGGVWW FNVDRHEDAI SLANQTIASQ AETAHVAVIS MDSDPAKIFQ LDDSQGPEKI KLFSMLNHEK GLYYLTRDLQ CSIDPHNYLF ILVCANNAWQ NIPAERLRSW LDKMNKWSRL NHCSLLVINP GNNNDKQFSL LLEEYRSLFG LASLRFQGDQ HLLDIAFWCN EKGVSARQQL SVQQQNGIWT LVQSEEAEIQ PRSDEKRILS NVAVLEGAPP LSEHWQLFNN NEVLFNEART AQAATVVFSL QQNAQIEPLA RSIHTLRRQR GSAMKILVRE NTASLRATDE RLLLACGANM VIPWNAPLSR CLTMIESVQG QKFSRYVPED ITTLLSMTQP LKLRGFQKWD VFCNAVNNMM NNPLLPAHGK GVLVALRPVP GIRVEQALTL CRPNRTGDIM TIGGNRLVLF LSFCRINDLD TALNHIFPLP TGDIFSNRMV WFEDDQISAE LVQMRLLAPE QWGMPLPLTQ SSKPVINAEH DGRHWRRIPE PMRLLDDAVE RSS
|
| |