Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_2019 |
Symbol | nagZ |
ID | 6143482 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | - |
Start bp | 2041407 |
End bp | 2042432 |
Gene Length | 1026 bp |
Protein Length | 341 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 641616895 |
Product | beta-hexosaminidase |
Protein accession | YP_001744071 |
Protein GI | 170683662 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1472] Beta-glucosidase-related glycosidases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 0.760849 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 23 |
Fosmid unclonability p-value | 0.0000464707 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | GTGGGTCCAG TAATGTTGGA TGTCGAAGGT TACGAACTGG ACGCGGAAGA GCGTGAAATA CTGGCGCATC CGCTGGTGGG AGGGCTGATT CTCTTTACGC GTAACTATCA TGATCCTGCC CAGTTACGTG AACTGGTGCG CCAGATCCGC GCAGCTTCGC GCAATCATCT GGTGGTGGCG GTTGATCAGG AAGGTGGACG CGTGCAGCGT TTTCGTGAAG GTTTTACCCG CTTGCCAGCG GCACAATCAT TTGCTGCGCT GTTGGGAATG GAAGAAGGCG GAAAACTGGC GCAAGAGGCA GGTTGGTTGA TGGCCAGCGA AATGATTGCT ATGGATATTG ATATCAGCTT TGCGCCAGTG CTGGACGTCG GGCATATCAG CGCGGCGATT GGCGAGCGTT CTTATCATGC CGATCCCGAA AAAGCCCTGG CAATTGCCAG CCGGTTTATT GATGGTATGC ATGAAGCCGG AATGAAAACT ACCGGGAAAC ACTTCCCAGG TCACGGTGCA GTAACGGCAG ACTCACACAA AGAAACACCG TGCGACCCAC GTCCGCAAGC GGAGATTCGC GCTAAAGATA TGTCGGTCTT CAGTTCCTTA ATCCGCGAAA ATAAACTCGA CGCCATTATG CCTGCGCATG TGATCTACAG TGATGTTGAT CCGCGTCCGG CGAGCGGTTC TCCCTACTGG CTGAAAACCG TTTTGCGTCA GGAATTGGGT TTTGACGGCG TGATTTTCTC TGACGATTTA TCGATGGAAG GCGCAGCGAT TATGGGCAGT TATGCCGAGC GTGGACAGGC ATCACTGGAT GCAGGTTGCG ATATGATCCT GGTCTGCAAT AATCGTAAAG GGGCCGTCAG CGTGTTAGAT AATCTGTCAC CGATCAAGGC AGAACGTGTT ACACGTTTGT ATCATAAAGG TTCATTTTCG CGACAGGAAC TGATGGACTC GGCTCGCTGG AAAGCGATCA GCGCCCGTCT GAATCAGTTA CATGAACGCT GGCAGGAAGA GAAGGCAGGT CATTAA
|
Protein sequence | MGPVMLDVEG YELDAEEREI LAHPLVGGLI LFTRNYHDPA QLRELVRQIR AASRNHLVVA VDQEGGRVQR FREGFTRLPA AQSFAALLGM EEGGKLAQEA GWLMASEMIA MDIDISFAPV LDVGHISAAI GERSYHADPE KALAIASRFI DGMHEAGMKT TGKHFPGHGA VTADSHKETP CDPRPQAEIR AKDMSVFSSL IRENKLDAIM PAHVIYSDVD PRPASGSPYW LKTVLRQELG FDGVIFSDDL SMEGAAIMGS YAERGQASLD AGCDMILVCN NRKGAVSVLD NLSPIKAERV TRLYHKGSFS RQELMDSARW KAISARLNQL HERWQEEKAG H
|
| |