Gene SbBS512_E4000 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSbBS512_E4000 
SymbolbcsE 
ID6269876 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShigella boydii CDC 3083-94 
KingdomBacteria 
Replicon accessionNC_010658 
Strand
Start bp3729398 
End bp3730534 
Gene Length1137 bp 
Protein Length378 aa 
Translation table11 
GC content51% 
IMG OID641727846 
Productcellulose biosynthesis protein BcsE 
Protein accessionYP_001882278 
Protein GI187732428 
COG category 
COG ID 
TIGRFAM ID[TIGR03369] cellulose biosynthesis protein BcsE 


Plasmid Coverage information

Num covering plasmid clones32 
Plasmid unclonability p-value0.878993 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGGTAATTA ATCCCGGAAA TAATAACGAT AAACAATTTT CATTGTTGCT TGAGGAATAC 
CGTTCACTTT TTGGTCTTGC CAGTTTGCGT TTTCAGGGTG ACCAACATTT GCTGGATATT
GCCTTCTGGT GCAACGAAAA AGGGGTCAGC GCCCGTCAGC AGCTTAGCGT TCAGCAACAA
AATGGTATCT GGACATTAGT TCAAAGCGAA GAGGCGGAGA TCCAACCACG CAGCGACGAA
AAACGCATTC TGAGTAATGT TGCTGTACTG GAAGGTGCGC CGCCGCTATC GGAACACTGG
CAACTGTTCA ACAATAACGA AGTCCTGTTC AATGAAGCCC GTACCGCTCA GGCGGCGACG
GTGGTCTTTT CTTTACAGCA AAATGCGCAA ATCGAGCCAC TGGCCCGCAG CATTCATACC
CTGCGTCGCC AGCGCGGTAG TGCGATGAAA ATCCTCGTGC GGGAAAATAC CGCTAGCCTG
CGCGCCACCG ATGAACGTTT GTTATTGGCC TGCGGTGCAA ATATGGTTAT TCCGTGGAAT
GCGCCACTCT CCCGTTGTCT GACGATGATC GAAAGCGTGC AAGGGCAGAA GTTTAGTCGC
TATGTGCCGG AAGATATCAC TACCTTGCTG TCAATGACCC AGCCGCTCAA ACTGCGTGGT
TTCCAGAAGT GGGATGTGTT CTGTAATGCC GTCAACAACA TGATGAATAA CCCTCTATTA
CCTGCCCACG GTAAAGGCGT TCTGGTTGCC CTACGTCCGG TACCGGGTAT CCGCGTTGAA
CAAGCCCTGA CGCTGTGTCG CCCTAACCGT ACCGGCGATA TCATGACCAT TGGCGGTAAT
CGGCTGGTGC TGTTTCTCTC ATTCTGTCGG ATTAACGATC TGGATACCGC GTTGAATCAT
ATTTTCCCAT TGCCTACTGG CGACATTTTC TCAAACCGTA TGGTCTGGTT TGAAGATGAT
CAAATCAGTG CCGAGCTGGT GCAGATGCGC CTGCTTGCCC CAGAACAATG GGGCATGCCG
CTGCCTTTAA CGCAAAGTTC TAAACCGGTC ATCAATGCCG AGCACGATGG TCGCCACTGG
CGACGAATAC CAGAACCCAT GCGACTGTTA GATGATGCTG TGGAGCGCTC ATCATGA
 
Protein sequence
MVINPGNNND KQFSLLLEEY RSLFGLASLR FQGDQHLLDI AFWCNEKGVS ARQQLSVQQQ 
NGIWTLVQSE EAEIQPRSDE KRILSNVAVL EGAPPLSEHW QLFNNNEVLF NEARTAQAAT
VVFSLQQNAQ IEPLARSIHT LRRQRGSAMK ILVRENTASL RATDERLLLA CGANMVIPWN
APLSRCLTMI ESVQGQKFSR YVPEDITTLL SMTQPLKLRG FQKWDVFCNA VNNMMNNPLL
PAHGKGVLVA LRPVPGIRVE QALTLCRPNR TGDIMTIGGN RLVLFLSFCR INDLDTALNH
IFPLPTGDIF SNRMVWFEDD QISAELVQMR LLAPEQWGMP LPLTQSSKPV INAEHDGRHW
RRIPEPMRLL DDAVERSS