Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcE24377A_1229 |
Symbol | nagZ |
ID | 5590708 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli E24377A |
Kingdom | Bacteria |
Replicon accession | NC_009801 |
Strand | + |
Start bp | 1241526 |
End bp | 1242551 |
Gene Length | 1026 bp |
Protein Length | 341 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 640924928 |
Product | beta-hexosaminidase |
Protein accession | YP_001462340 |
Protein GI | 157157822 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1472] Beta-glucosidase-related glycosidases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.000622374 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGGGTCCAG TAATGTTGGA TGTCAAAGGT TACGAACTGG ACGCGGAAGA GCGTGAAATA CTTGCGCATC CGCTGGTGGG AGGGCTGATT CTCTTTACGC GTAACTATCA TGATCCTGCC CAGTTACGTG AACTGGTGCG CCAGATCCGC GCAGCTTCGC GCAATCATCT GGTGGTGGCG GTAGATCAGG AAGGTGGACG CGTGCAGCGT TTTCGTGAAG GTTTTACCCG CTTGCCAGCG GCGCAATCAT TCGCTGCGCT GTCAGGAATG GAAGAGGGCG GCAAACTGGC GCAAGAGGCG GGTTGGCTGA TGGCCAGCGA AATGATCGCT ATGGATATTG ATATCAGCTT TGCGCCAGTG CTGGATGTCG GGCATATCAG CGCGGCGATT GGCGAGCGTT CTTATCATGC CGATCCACAA AAAGCCCTGG CAATTGCCAG CCGGTTTATT GATGGTATGC ATGAAGCCGG AATGAAAACG ACCGGGAAAC ACTTCCCAGG ACACGGTGCA GTAACGGCAG ACTCACACAA AGAAACACCG TGCGATCCAC GTCCACAAGC GGAGATTCGC GCTAAAGATA TGTCGGTCTT CAGTTCCTTA ATCCGCGAAA ATAAACTCGA CGCCATTATG CCTGCGCATG TGATCTACAG TGATGTTGAT TCGCGTCCGG CGAGCGGTTC TCCCTACTGG CTGAAAACCG TTTTGCGTCA GGAACTGGGT TTTGACGGCG TGATTTTCTC TGACGATTTA TCGATGGAAG GTGCCGCGAT TATGGGCAGT TATGCCGAAC GCGGGCAGGC ATCACTGGAT GCGGGTTGCG ATATGATCCT GGTCTGCAAT AATCGTAAAG GGGCCGTCAG CGTGTTAGAT AATCTGTCAC CGATCAAGGC AGAACGTGTT ACACGTTTGT ATCATAAAGG TTCATTTTCG CGACAGGAAC TGATGGACTC GGCTCGCTGG AAAGCGATCA GCACCCGTCT GAATCAGTTA CACGAACGCT GGCAGGAAGA GAAAGCAGGT CACTAA
|
Protein sequence | MGPVMLDVKG YELDAEEREI LAHPLVGGLI LFTRNYHDPA QLRELVRQIR AASRNHLVVA VDQEGGRVQR FREGFTRLPA AQSFAALSGM EEGGKLAQEA GWLMASEMIA MDIDISFAPV LDVGHISAAI GERSYHADPQ KALAIASRFI DGMHEAGMKT TGKHFPGHGA VTADSHKETP CDPRPQAEIR AKDMSVFSSL IRENKLDAIM PAHVIYSDVD SRPASGSPYW LKTVLRQELG FDGVIFSDDL SMEGAAIMGS YAERGQASLD AGCDMILVCN NRKGAVSVLD NLSPIKAERV TRLYHKGSFS RQELMDSARW KAISTRLNQL HERWQEEKAG H
|
| |