Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SNSL254_A3889 |
Symbol | bcsZ |
ID | 6484312 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salmonella enterica subsp. enterica serovar Newport str. SL254 |
Kingdom | Bacteria |
Replicon accession | NC_011080 |
Strand | - |
Start bp | 3771971 |
End bp | 3773080 |
Gene Length | 1110 bp |
Protein Length | 369 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 642739153 |
Product | endo-1,4-D-glucanase |
Protein accession | YP_002042864 |
Protein GI | 194444186 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG3405] Endoglucanase Y |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.0151883 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 76 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATGACTA TGCTGCGCGG ATGGATAACG ATGATCGTCA TGCTGACGGC AATAAATGCG CAGGCGGCCT GTAGCTGGCC TGCGTGGGAA CAGTTCAAGA AAGATTACAT TAGCCAGCAG GGACGCGTTA TCGATCCGGG CGATGCGCGA AAAATTACCA CCTCCGAAGG GCAAAGCTAC GCCATGTTCT TTGCCCTGGC GGCGAACGAT CGACCGGCGT TCGCGCAACT GTTTAACTGG ACGCAAAACA ATCTGGCGCA GGGATCGCTG CGTGAACATC TGCCCGCCTG GCTGTGGGGA CAAAAAGATC CCGACACCTG GTCGGTGCTG GACAGCAACT CCGCGTCCGA CGGCGATATC TGGATGGCAT GGTCGCTGCT GGAGGCCGGT CGCCTGTGGA AAGAGACGCG TTATACCGAG GTGGGCACGG CGTTGCTAAA ACGCATCGCC CGCGAAGAGG TCGTGAATGT GCCGGGACTG GGCTCAATGC TGCTACCTGG CAAAATCGGC TTTGCCGAGG CGAATAACTG GCGTTTTAAC CCAAGCTATC TGCCGCCGCA GTTGGCGCAA TACTTTAGCC GTTTTGGCGC GCCGTGGTCG ACGCTACGGG AAACCAATTT GCGGCTTTTG CTGGAAACCG CGCCGAAAGG TTTCTCGCCG GACTGGGTGC GTTATGAAAG CAAGCAAGGC TGGCAGTTGA AAGCGGAAAA GACGCTGATC AGTAGCTACG ATGCGATTCG CGTCTATTTA TGGGCGGGAA TGATGCATGA TGGCGATCCG CAAAAAGCGC GTTTACTGGC GCGATTTAAA CCGATGGCGA CGTTAACGAT GAAAAACGGC GTTCCACCGG AGAAAGTGGA TGTCGTCAGC GGGAATGCGC AAGGGACGGG GCCGGTCGGG TTTTCCGCCG CTTTACTGCC TTTCCTGCAA AATCGCGACG CCCAGGCCGT GCAGCGACAG CGGGTCGCAG ACCATTTTCC TGGCAGCGAT GCCTATTACA ACTATGTGCT GGCTCTCTTT GGACAAGGCT GGGATCAGCA CCGTTTTCGC TTCACCGTCA AAGGTGAATT ATTACCTGAC TGGGGCCAGG AATGCGTAAG TTCACGTTAA
|
Protein sequence | MMTMLRGWIT MIVMLTAINA QAACSWPAWE QFKKDYISQQ GRVIDPGDAR KITTSEGQSY AMFFALAAND RPAFAQLFNW TQNNLAQGSL REHLPAWLWG QKDPDTWSVL DSNSASDGDI WMAWSLLEAG RLWKETRYTE VGTALLKRIA REEVVNVPGL GSMLLPGKIG FAEANNWRFN PSYLPPQLAQ YFSRFGAPWS TLRETNLRLL LETAPKGFSP DWVRYESKQG WQLKAEKTLI SSYDAIRVYL WAGMMHDGDP QKARLLARFK PMATLTMKNG VPPEKVDVVS GNAQGTGPVG FSAALLPFLQ NRDAQAVQRQ RVADHFPGSD AYYNYVLALF GQGWDQHRFR FTVKGELLPD WGQECVSSR
|
| |