Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcHS_A1230 |
Symbol | nagZ |
ID | 5592036 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli HS |
Kingdom | Bacteria |
Replicon accession | NC_009800 |
Strand | + |
Start bp | 1227775 |
End bp | 1228800 |
Gene Length | 1026 bp |
Protein Length | 341 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 640920390 |
Product | beta-hexosaminidase |
Protein accession | YP_001457952 |
Protein GI | 157160634 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1472] Beta-glucosidase-related glycosidases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 43 |
Plasmid unclonability p-value | 0.644822 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGGGTCCAG TAATGTTGGA TGTCGAAGGT TACGAACTGG ACGCGGAAGA GCGTGAAATA CTGGCGCATC CGCTGGTGGG AGGGCTGATT CTCTTTACGC GTAACTATCA TGATCCTGCC CAGTTACGTG AACTGGTGCG CCAGATCCGC GCAGCTTCGC GCAATCATCT GGTGGTGGCG GTTGATCAGG AAGGTGGACG CGTGCAGCGT TTTCGTGAAG GTTTTACCCG CTTGCCAGCG GCGCAATCAT TCGCTGCGCT GTCAGGAATG GAAGAGGGTG GCAAACTGGC GCAGGAGGCA GGTTGGTTGA TGGCCAGCGA AATGATCGCT ATGGATATTG ATATCAGCTT TGCGCCTGTG CTGGATGTCG GGCATATCAG CGCGGCGATT GGCGAGCGTT CTTATCATGC CGATCCACAA AAAGCCCTGG CAATTGCCAG CCGGTTTATT GATGGTATGC ATGAAGCCGG AATGAAAACG ACCGGGAAAC ACTTCCCAGG ACACGGTGCA GTAACGGCAG ACTCACACAA AGAAACACCG TGCGATCCAC GTCCACAAGC GGAGATTCGC GCTAAAGATA TGTCGGTCTT CAGTTCCTTA ATCCGCGAAA ATAAACTCGA CGCCATTATG CCTGCGCATG TGATCTACAG TGATGTTGAT CCGCGTCCGG CGAGCGGTTC TCCCTACTGG CTGAAAACCG TTTTGCGTCA GGAACTGGGT TTTGACGGCG TGATTTTCTC TGACGATTTA TCGATGGAAG GTGCCGTGAT TATGGGCAGT TATGCCGAAC GCGGGCAGGC ATCACTGGAT GCGGGTTGCG ATATGATCCT GGTCTGCAAT AATCGTAAAG GGGCCGTCAG CGTGTTAGAT AATCTGTCAC CGATCAAGGC AGAACGTGTT ACACGTTTGT ATCATAAAGG TTCATTTTCG CGACAGGAAC TGATGGACTC GGCTCGCTGG AAAGCGATCA GCGCCCGTCT GAATCAGTTA CATGAACGCT GGCAGGAAGA GAAAGCAGGT CACTAA
|
Protein sequence | MGPVMLDVEG YELDAEEREI LAHPLVGGLI LFTRNYHDPA QLRELVRQIR AASRNHLVVA VDQEGGRVQR FREGFTRLPA AQSFAALSGM EEGGKLAQEA GWLMASEMIA MDIDISFAPV LDVGHISAAI GERSYHADPQ KALAIASRFI DGMHEAGMKT TGKHFPGHGA VTADSHKETP CDPRPQAEIR AKDMSVFSSL IRENKLDAIM PAHVIYSDVD PRPASGSPYW LKTVLRQELG FDGVIFSDDL SMEGAVIMGS YAERGQASLD AGCDMILVCN NRKGAVSVLD NLSPIKAERV TRLYHKGSFS RQELMDSARW KAISARLNQL HERWQEEKAG H
|
| |