Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PG0043 |
Symbol | nahA |
ID | 2552216 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Porphyromonas gingivalis W83 |
Kingdom | Bacteria |
Replicon accession | NC_002950 |
Strand | + |
Start bp | 51206 |
End bp | 53539 |
Gene Length | 2334 bp |
Protein Length | 777 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 637148858 |
Product | beta-hexosaminidase |
Protein accession | NP_904396 |
Protein GI | 34539917 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG3525] N-acetyl-beta-hexosaminidase |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 26 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAACGAC TGACTTTCGG AGCATGCATT TGCTGCCTCC TGTCTCTTAT GGCCTGCTCA CAGAAAGCAA AGCAGGTGCA AATCCCCGAA TACGACAAGG GTATAAACAT CATTCCCTTG CCGATGCAGC TGACCGAATC GGACGACAGC TTTGAGGTCG ATGATAAGAC CACTATCTGC GTATCTGCCG AAGAGCTAAA GCCTATCGCT AAACTTCTTG CCGACAAGCT AAGAGCATCA GCCGACCTCT CTCTCCAGAT AGAGATAGGC GAGGAGCCTT CGGGGAATGC TATTTACATC GGTGTCGATA CGGCTCTTCC TCTTAAAGAA GAGGGTTATA TGCTCCGATC CGATAAGCGT GGTGTCAGTA TCATCGGCAA ATCTGCCCAT GGTGCTTTCT ACGGTATGCA GACTTTGCTC CAGCTCCTTC CTGCCGAAGT GGAATCTTCG AATGAGGTAC TGCTCCCCAT GACGGTGCCC GGCGTCGAGA TCAAGGACGA ACCGGCATTC GGCTATCGTG GCTTTATGCT GGATGTATGC CGTCATTTCC TTTCGGTGGA GGACATCAAG AAGCATATCG ACATCATGGC CATGTTCAAG ATCAATCGTT TCCATTGGCA CCTGACAGAG GATCAGGCAT GGCGTATCGA AATCAAGAAA TACCCACGAC TGACCGAAGT GGGGTCTACA AGGACGGAAG GGGACGGTAC GCAGTACTCC GGTTTCTACA CGCAGGAGCA AGTACGGGAT ATTGTACAAT ACGCATCGGA TCGTTTCATT ACGGTGATTC CCGAGATCGA AATGCCCGGA CATGCCATGG CTGCCCTCGC TGCTTATCCG CAGTTGGCTT GCTTCCCACG CGAATTCAAG CCACGGATTA TCTGGGGAGT GGAGCAGGAT GTTTATTGTG CCGGTAAGGA CAGCGTCTTC CGTTTTATCT CTGATGTTAT CGACGAGGTA GCACCCCTTT TCCCCGGCAC ATACTTCCAT ATCGGAGGGG ACGAATGCCC TAAAGATCGA TGGAAGGCTT GTTCGCTTTG TCAGAAGCGT ATGCGTGACA ATGGGTTGAA AGACGAACAC GAGCTGCAGA GTTATTTCAT CAAACAAGCT GAAAAGGTCT TACAAAAGCA CGGCAAGAGA CTGATCGGTT GGGATGAAAT CCTCGAAGGC GGGCTTGCAC CTTCTGCCAC CGTTATGAGC TGGCGTGGAG AGGATGGTGG CATCGCAGCG GCTAATATGA ATCACGATGT GATCATGACT CCGGGTAGCG GAGGTCTCTA CTTGGATCAT TATCAGGGAG ATCCGACCGT CGAGCCTGTT GCCATCGGAG GTTATGCTCC ATTGGAGCAA GTGTATGCTT ACAATCCTTT GCCGAAAGAA TTGCCGGCCG ATAAGCATCG CTACGTGCTC GGAGCACAGG CCAATCTGTG GGCAGAATAC CTCTATACTT CCGAACGATA CGACTATCAG GCCTATCCAA GGCTACTGGC TGTGGCAGAG CTTACCTGGA CACCGTTGGC CAAGAAAGAT TTTGCCGATT TCTGTCGCCG TTTGGATAAT GCCTGCGTTC GTCTGGACAT GCATGGTATC AATTACCACA TTCCGCTGCC CGAACAACCG GGTGGCTCTT CCGACTTTAT AGCCTTTACG GACAAGGCTA AGCTGACCTT CACGACATCG CGTCCGATGA AAATGGTCTA TACGCTGGAC GAAACCGAAC CGACCCTCAC ATCGACTCCT TACACGGTCC CTCTTGAATT TGCACAAACG GGCCTTCTGA AGATTCGTAC CGTCACGGCC GGTGGGAAGA TGAGTCCCGT ACGCCGCATT CGTGTGGAGA AACAACCCTT CAATATGTCA ATGGAAGTAC CGGCACCGAA ACCCGGACTG ACCATTCGTA CGGCTTACGG TGACTTATAT GATGTGCCTG ATCTGCAGCA GGTAGCCTCA TGGGAAGTAG GGACCGTTAG CTCTTTGGAG GAAATCATGC ACGGGAAAGA GAAGATAACT TCTCCTGAAG TACTGGAGCG CAGAGTTGTA GAGGCTACCG GTTATGTGCT TATTCCGGAG GATGGGGTAT ATGAGTTCTC TACGGAAAAC AACGAGTTTT GGATTGATAA TGTGAAGCTG ATCGACAATG TGGGCGAAGT AAAGAAATTC TCCCGTCGCA ATAGCAGTCG TGCCCTTCAG AAAGGCTACC ATCCGATCAA GACGATATGG GTCGGAGCCA TACAAGGTGG CTGGCCTACT TATTGGAACT ACAGCAGGGT AATGATACGG CTCAAGGGAG AAGAAAAGTT CAAGCCGATC TCGTCCGATA TGCTCTTTCA ATAA
|
Protein sequence | MKRLTFGACI CCLLSLMACS QKAKQVQIPE YDKGINIIPL PMQLTESDDS FEVDDKTTIC VSAEELKPIA KLLADKLRAS ADLSLQIEIG EEPSGNAIYI GVDTALPLKE EGYMLRSDKR GVSIIGKSAH GAFYGMQTLL QLLPAEVESS NEVLLPMTVP GVEIKDEPAF GYRGFMLDVC RHFLSVEDIK KHIDIMAMFK INRFHWHLTE DQAWRIEIKK YPRLTEVGST RTEGDGTQYS GFYTQEQVRD IVQYASDRFI TVIPEIEMPG HAMAALAAYP QLACFPREFK PRIIWGVEQD VYCAGKDSVF RFISDVIDEV APLFPGTYFH IGGDECPKDR WKACSLCQKR MRDNGLKDEH ELQSYFIKQA EKVLQKHGKR LIGWDEILEG GLAPSATVMS WRGEDGGIAA ANMNHDVIMT PGSGGLYLDH YQGDPTVEPV AIGGYAPLEQ VYAYNPLPKE LPADKHRYVL GAQANLWAEY LYTSERYDYQ AYPRLLAVAE LTWTPLAKKD FADFCRRLDN ACVRLDMHGI NYHIPLPEQP GGSSDFIAFT DKAKLTFTTS RPMKMVYTLD ETEPTLTSTP YTVPLEFAQT GLLKIRTVTA GGKMSPVRRI RVEKQPFNMS MEVPAPKPGL TIRTAYGDLY DVPDLQQVAS WEVGTVSSLE EIMHGKEKIT SPEVLERRVV EATGYVLIPE DGVYEFSTEN NEFWIDNVKL IDNVGEVKKF SRRNSSRALQ KGYHPIKTIW VGAIQGGWPT YWNYSRVMIR LKGEEKFKPI SSDMLFQ
|
| |