Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH74115_4896 |
Symbol | bcsZ |
ID | 6967104 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli O157:H7 str. EC4115 |
Kingdom | Bacteria |
Replicon accession | NC_011353 |
Strand | - |
Start bp | 4534370 |
End bp | 4535482 |
Gene Length | 1113 bp |
Protein Length | 370 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 643388584 |
Product | endo-1,4-D-glucanase |
Protein accession | YP_002273012 |
Protein GI | 209396242 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG3405] Endoglucanase Y |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 27 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 72 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | TTGAAGATGA ATGTGTTGCG TAGTGGACTC GTGACGATGC TGCTGTTGGC TGCCTTTAGT GTTCAGGCAG CCTGTACCTG GCCTGCCTGG GAGCAGTTTA AAAAGGATTA CATCAGTCAG GAAGGGCGCG TCATCGACCC CAGCGACGCG CGCAAAATCA CCACCTCTGA AGGGCAAAGT TACGGCATGT TCTTTGCCCT GGCGGCTAAC GACCGTGCAG CTTTCGATAA TATTCTCGAC TGGACGCAGA ACAATCTCGC TCAGGGTTCT TTAAAAGAAC GTTTGCCCGC CTGGCTGTGG GGCAAGAAAG AGAACAGTAA GTGGGAAGTG CTGGACAGCA ATTCGGCCTC CGATGGTGAT GTCTGGATGG CTTGGTCGTT GCTGGAGGCG GGGCGTTTGT GGAAAGAGCA GCGTTATACC GACATCGGCA GCGCGTTGCT AAAACGTATT GCGCGGGAGG AAGTGGTGAC GGTGCCTGGG CTGGGTTCCA TGTTGTTACC GGGCAAAGTG GGTTTTGCTG AGGATAACAG CTGGCGTTTT AACCCCAGCT ACCTGCCGCC GACGCTGGCG CAGTATTTCA CCCGCTTTGG CGCGCCGTGG ACTACGCTGC GCGAAACCAA TCAACGTTTA TTGCTGGAAA CCGCCCCGAA AGGCTTTTCG CCAGACTGGG TGCGCTATGA GAAAGACAAA GGCTGGCAGC TAAAAGCCGA AAAAACATTG ATCAGCAGCT ACGACGCTAT CCGCGTTTAC ATGTGGGTAG GCATGATGCC TGACAGCGAT CCGCAGAAAG CGCGGATGCT CAACCGGTTT AAACCGATGG CGACATTCAC TGAGAAAAAC GGTTATCCGC CGGAAAAAGT GGATGTGGCT ACGGGGAAAG CGCAGGGTAA AGGACCGGTC GGTTTTTCTG CCGCCATGCT GCCCTTTTTA CAAAACCGTG ATGCGCAGGC CGTTCAGCGC CAGCGCGTGG CCGATAACTT TCCCGGCAGC GATGCCTATT ACAACTATGT GCTGACCCTG TTTGGACAAG GCTGGGATCA ACACCGTTTC CGCTTCTCGA CAAAAGGTGA GTTATTACCT GACTGGGGCC AGGAATGCGC AAATTCACAC TAA
|
Protein sequence | MKMNVLRSGL VTMLLLAAFS VQAACTWPAW EQFKKDYISQ EGRVIDPSDA RKITTSEGQS YGMFFALAAN DRAAFDNILD WTQNNLAQGS LKERLPAWLW GKKENSKWEV LDSNSASDGD VWMAWSLLEA GRLWKEQRYT DIGSALLKRI AREEVVTVPG LGSMLLPGKV GFAEDNSWRF NPSYLPPTLA QYFTRFGAPW TTLRETNQRL LLETAPKGFS PDWVRYEKDK GWQLKAEKTL ISSYDAIRVY MWVGMMPDSD PQKARMLNRF KPMATFTEKN GYPPEKVDVA TGKAQGKGPV GFSAAMLPFL QNRDAQAVQR QRVADNFPGS DAYYNYVLTL FGQGWDQHRF RFSTKGELLP DWGQECANSH
|
| |