Gene B21_03337 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagB21_03337 
SymbolbcsE 
ID8112574 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli BL21 
KingdomBacteria 
Replicon accessionNC_012892 
Strand
Start bp3556857 
End bp3558428 
Gene Length1572 bp 
Protein Length523 aa 
Translation table11 
GC content49% 
IMG OID644849512 
Producthypothetical protein 
Protein accessionYP_003001085 
Protein GI251786781 
COG category 
COG ID 
TIGRFAM ID[TIGR03369] cellulose biosynthesis protein BcsE 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.362213 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGGGACA TTGTGGACCC TGTATTCTCT ATCGGTATCT CATCATTATG GGATGAGCTG 
CGACATATGC CAGCAGGCGG CGTCTGGTGG TTTAACGTCG ATCGCCATGA AGATGCTATC
AGTCTGGCGA ATCAAACAAT TGCATCCCAG GCTGAAACCG CACACGTCGC GGTCATTAGC
ATGGACAGCG ATCCGGCGAA AATCTTTCAA TTAGATGATT CTCAAGGGCC GGAAAAAATA
AAATTATTTT CAATGCTAAA TCATGAAAAA GGTCTATACT ATTTGACCCG TGATTTGCAG
TGTTCTATTG ATCCCCATAA TTACCTTTTT ATTCTTGTTT GCGCAAATAA CGCATGGCAA
AACATTCCTG CCGAGCGGCT TCGCTCATGG TTGGATAAAA TGAATAAATG GAGCAGGTTA
AACCATTGTT CGCTTTTGGT AATTAATCCC GGAAATAATA ACGATAAACA ATTTTCATTG
TTGCTTGAGG AATACCGTTC ACTTTTTGGT CTTGCCAGTT TGCGTTTTCA GGGTGACCAA
CATTTGCTGG ATATTGCCTT CTGGTGCAAC GAAAAAGGGG TCAGCGCCCG TCAGCAGCTT
AGCGTTCAGC AACAAAATGG TATCTGGACA TTAGTTCAAA GCGAAGAGGC GGAGATCCAA
CCACGCAGCG ACGAAAAACG CATTCTGAGT AATGTTGCTG TACTGGAAGG TGCGCCGCCG
CTATCGGAAC ACTGGCAACT GTTCAACAAT AACGAAGTCC TGTTCAATGA AGCCCGTACC
GCTCAGGCGG CGACGGTGGT CTTTTCTTTA CAGCAAAATG CGCAAATCGA GCCACTGGCC
CGCAGCATTC ATACCCTGCG TCGCCAGCGC GGTAGTGCGA TGAAAATCCT CGTGCGGGAA
AATACCGCTA GCCTGCGCGC CACCGATGAA CGTTTGTTAT TGGCCTGCGG TGCAAATATG
GTTATTCCGT GGAATGCGCC ACTCTCCCGT TGTCTGACGA TGATCGAAAG CGTGCAAGGG
CAGAAGTTTA GTCGCTATGT GCCGGAAGAT ATCACTACCT TGCTGTCAAT GACCCAGCCG
CTCAAACTGC GTGGTTTCCA GAAGTGGGAT GTGTTCTGTA ATGCCGTCAA CAACATGATG
AATAACCCTC TATTACCTGC CCACGGTAAA GGCGTTCTGG TTGCCCTACG TCCGGTACCG
GGTATCCGCG TTGAACAAGC CCTGACGCTG TGTCGCCCTA ACCGTACCGG CGATATCATG
ACCATTGGCG GTAATCGGCT GGTGCTGTTT CTCTCATTCT GTCGGATTAA CGATCTGGAT
ACCGCGTTGA ATCATATTTT CCCATTGCCT ACTGGCGACA TTTTCTCAAA CCGTATGGTC
TGGTTTGAAG ATGATCAAAT CAGTGCCGAG CTGGTGCAGA TGCGCTTGCT TGCCCCAGAA
CAATGGGGCA TGCCGCTGCC TTTAACGCAA AGTTCTAAAC CGGTCATCAA TGCCGAGCAC
GATGGTCGCC ACTGGCGACG AATACCAGAA CCCATGCGAC TGTTAGATGA TGCTGTGGAG
CGCTCATCAT GA
 
Protein sequence
MRDIVDPVFS IGISSLWDEL RHMPAGGVWW FNVDRHEDAI SLANQTIASQ AETAHVAVIS 
MDSDPAKIFQ LDDSQGPEKI KLFSMLNHEK GLYYLTRDLQ CSIDPHNYLF ILVCANNAWQ
NIPAERLRSW LDKMNKWSRL NHCSLLVINP GNNNDKQFSL LLEEYRSLFG LASLRFQGDQ
HLLDIAFWCN EKGVSARQQL SVQQQNGIWT LVQSEEAEIQ PRSDEKRILS NVAVLEGAPP
LSEHWQLFNN NEVLFNEART AQAATVVFSL QQNAQIEPLA RSIHTLRRQR GSAMKILVRE
NTASLRATDE RLLLACGANM VIPWNAPLSR CLTMIESVQG QKFSRYVPED ITTLLSMTQP
LKLRGFQKWD VFCNAVNNMM NNPLLPAHGK GVLVALRPVP GIRVEQALTL CRPNRTGDIM
TIGGNRLVLF LSFCRINDLD TALNHIFPLP TGDIFSNRMV WFEDDQISAE LVQMRLLAPE
QWGMPLPLTQ SSKPVINAEH DGRHWRRIPE PMRLLDDAVE RSS