Gene ECH74115_4896 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_4896 
SymbolbcsZ 
ID6967104 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp4534370 
End bp4535482 
Gene Length1113 bp 
Protein Length370 aa 
Translation table11 
GC content54% 
IMG OID643388584 
Productendo-1,4-D-glucanase 
Protein accessionYP_002273012 
Protein GI209396242 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3405] Endoglucanase Y 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones72 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGAAGATGA ATGTGTTGCG TAGTGGACTC GTGACGATGC TGCTGTTGGC TGCCTTTAGT 
GTTCAGGCAG CCTGTACCTG GCCTGCCTGG GAGCAGTTTA AAAAGGATTA CATCAGTCAG
GAAGGGCGCG TCATCGACCC CAGCGACGCG CGCAAAATCA CCACCTCTGA AGGGCAAAGT
TACGGCATGT TCTTTGCCCT GGCGGCTAAC GACCGTGCAG CTTTCGATAA TATTCTCGAC
TGGACGCAGA ACAATCTCGC TCAGGGTTCT TTAAAAGAAC GTTTGCCCGC CTGGCTGTGG
GGCAAGAAAG AGAACAGTAA GTGGGAAGTG CTGGACAGCA ATTCGGCCTC CGATGGTGAT
GTCTGGATGG CTTGGTCGTT GCTGGAGGCG GGGCGTTTGT GGAAAGAGCA GCGTTATACC
GACATCGGCA GCGCGTTGCT AAAACGTATT GCGCGGGAGG AAGTGGTGAC GGTGCCTGGG
CTGGGTTCCA TGTTGTTACC GGGCAAAGTG GGTTTTGCTG AGGATAACAG CTGGCGTTTT
AACCCCAGCT ACCTGCCGCC GACGCTGGCG CAGTATTTCA CCCGCTTTGG CGCGCCGTGG
ACTACGCTGC GCGAAACCAA TCAACGTTTA TTGCTGGAAA CCGCCCCGAA AGGCTTTTCG
CCAGACTGGG TGCGCTATGA GAAAGACAAA GGCTGGCAGC TAAAAGCCGA AAAAACATTG
ATCAGCAGCT ACGACGCTAT CCGCGTTTAC ATGTGGGTAG GCATGATGCC TGACAGCGAT
CCGCAGAAAG CGCGGATGCT CAACCGGTTT AAACCGATGG CGACATTCAC TGAGAAAAAC
GGTTATCCGC CGGAAAAAGT GGATGTGGCT ACGGGGAAAG CGCAGGGTAA AGGACCGGTC
GGTTTTTCTG CCGCCATGCT GCCCTTTTTA CAAAACCGTG ATGCGCAGGC CGTTCAGCGC
CAGCGCGTGG CCGATAACTT TCCCGGCAGC GATGCCTATT ACAACTATGT GCTGACCCTG
TTTGGACAAG GCTGGGATCA ACACCGTTTC CGCTTCTCGA CAAAAGGTGA GTTATTACCT
GACTGGGGCC AGGAATGCGC AAATTCACAC TAA
 
Protein sequence
MKMNVLRSGL VTMLLLAAFS VQAACTWPAW EQFKKDYISQ EGRVIDPSDA RKITTSEGQS 
YGMFFALAAN DRAAFDNILD WTQNNLAQGS LKERLPAWLW GKKENSKWEV LDSNSASDGD
VWMAWSLLEA GRLWKEQRYT DIGSALLKRI AREEVVTVPG LGSMLLPGKV GFAEDNSWRF
NPSYLPPTLA QYFTRFGAPW TTLRETNQRL LLETAPKGFS PDWVRYEKDK GWQLKAEKTL
ISSYDAIRVY MWVGMMPDSD PQKARMLNRF KPMATFTEKN GYPPEKVDVA TGKAQGKGPV
GFSAAMLPFL QNRDAQAVQR QRVADNFPGS DAYYNYVLTL FGQGWDQHRF RFSTKGELLP
DWGQECANSH