Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_3840 |
Symbol | bcsZ |
ID | 6144897 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | - |
Start bp | 3913693 |
End bp | 3914805 |
Gene Length | 1113 bp |
Protein Length | 370 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 641618666 |
Product | endo-1,4-D-glucanase |
Protein accession | YP_001745806 |
Protein GI | 170683128 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG3405] Endoglucanase Y |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 31 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 46 |
Fosmid unclonability p-value | 0.634821 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | TTGAAGATGA ATGTGTTGCG TAGTGGACTC GTGACGATGC TGCTGCTGGC TGCCTTTAGT GTTCAGGCAG CCTGCAACTG GCCTGCCTGG GAGCAATTCA AAAAGGATTA CATCAGTCAG GAAGGGCGCG TCATTGACCC AAGTGACGCG CGTAAAATCA CCACTTCCGA AGGGCAAAGT TACGGCATGT TCTTTGCCCT GGCGGCTAAC GACCGTGCAG CTTTCGATAA TATTCTCGAC TGGACGCAGA ACAATCTCGC TCAGGGTTCT TTAAAAGAAC GTTTGCCCGC ATGGCTGTGG GGCAAGAAAG AGAACAGTAA GTGGGAAGTG CTGGACAGCA ATTCGGCCTC CGATGGTGAT GTCTGGATGG CCTGGTCGTT GCTGGAGGCG GGGCGTTTGT GGAAAGAGCA GCGTTATACC GACATCGGCA GCGCGTTGTT AAAACGTATT GCGCGGGAGG AAGTGGTGAC GGTGCCTGGG CTGGGTTCCA TGTTGTTGCC GGGCAAAGTG GGATTTGCAG AGGATAACAG CTGGCGTTTT AACCCCAGCT ACCTGCCGCC GACGCTGGCG CAGTATTTCA CCCGCTTTGG CGCGCCGTGG ACTACGCTGC GCGAAACCAA TCAACGTTTA TTGCTGGAAA CCGCCCCGAA AGGCTTTTCG CCAGACTGGG TGCGCTATGA GAAAGACAAA GGCTGGCAGC TAAAAGCCGA AAAAACATTG ATCAGCAGCT ACGACGCGAT CCGCGTTTAC ATGTGGGTAG GCATGATGCC TGACAGCGAT CCGCAAAAAG CGCGGATGCT CAACCGGTTT AAACCGATGG CGACATTCAC CGAGAAAAAC GGTTATCCGC CGGAAAAAGT GGATGTGGCT ACGGGGAAAG CACAGGGTAA AGGACCGGTC GGTTTTTCTG CCGCCATGCT GCCCTTTTTA CAAAACCGCG ATGCGCAGGC CGTTCAGCGC CAGCGCGTGG CCGATAACTT TCCCGGCAGC GATGCCTATT ACAACTATGT GCTGACCCTG TTTGGACAAG GCTGGGATCA ACACCGTTTC CGCTTCTCGA CAAAAGGTGA GTTATTACCT GACTGGGGCC AGGAATGCGC AAATTCACAC TAA
|
Protein sequence | MKMNVLRSGL VTMLLLAAFS VQAACNWPAW EQFKKDYISQ EGRVIDPSDA RKITTSEGQS YGMFFALAAN DRAAFDNILD WTQNNLAQGS LKERLPAWLW GKKENSKWEV LDSNSASDGD VWMAWSLLEA GRLWKEQRYT DIGSALLKRI AREEVVTVPG LGSMLLPGKV GFAEDNSWRF NPSYLPPTLA QYFTRFGAPW TTLRETNQRL LLETAPKGFS PDWVRYEKDK GWQLKAEKTL ISSYDAIRVY MWVGMMPDSD PQKARMLNRF KPMATFTEKN GYPPEKVDVA TGKAQGKGPV GFSAAMLPFL QNRDAQAVQR QRVADNFPGS DAYYNYVLTL FGQGWDQHRF RFSTKGELLP DWGQECANSH
|
| |