Gene PG0043 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPG0043 
SymbolnahA 
ID2552216 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePorphyromonas gingivalis W83 
KingdomBacteria 
Replicon accessionNC_002950 
Strand
Start bp51206 
End bp53539 
Gene Length2334 bp 
Protein Length777 aa 
Translation table11 
GC content51% 
IMG OID637148858 
Productbeta-hexosaminidase 
Protein accessionNP_904396 
Protein GI34539917 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3525] N-acetyl-beta-hexosaminidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAACGAC TGACTTTCGG AGCATGCATT TGCTGCCTCC TGTCTCTTAT GGCCTGCTCA 
CAGAAAGCAA AGCAGGTGCA AATCCCCGAA TACGACAAGG GTATAAACAT CATTCCCTTG
CCGATGCAGC TGACCGAATC GGACGACAGC TTTGAGGTCG ATGATAAGAC CACTATCTGC
GTATCTGCCG AAGAGCTAAA GCCTATCGCT AAACTTCTTG CCGACAAGCT AAGAGCATCA
GCCGACCTCT CTCTCCAGAT AGAGATAGGC GAGGAGCCTT CGGGGAATGC TATTTACATC
GGTGTCGATA CGGCTCTTCC TCTTAAAGAA GAGGGTTATA TGCTCCGATC CGATAAGCGT
GGTGTCAGTA TCATCGGCAA ATCTGCCCAT GGTGCTTTCT ACGGTATGCA GACTTTGCTC
CAGCTCCTTC CTGCCGAAGT GGAATCTTCG AATGAGGTAC TGCTCCCCAT GACGGTGCCC
GGCGTCGAGA TCAAGGACGA ACCGGCATTC GGCTATCGTG GCTTTATGCT GGATGTATGC
CGTCATTTCC TTTCGGTGGA GGACATCAAG AAGCATATCG ACATCATGGC CATGTTCAAG
ATCAATCGTT TCCATTGGCA CCTGACAGAG GATCAGGCAT GGCGTATCGA AATCAAGAAA
TACCCACGAC TGACCGAAGT GGGGTCTACA AGGACGGAAG GGGACGGTAC GCAGTACTCC
GGTTTCTACA CGCAGGAGCA AGTACGGGAT ATTGTACAAT ACGCATCGGA TCGTTTCATT
ACGGTGATTC CCGAGATCGA AATGCCCGGA CATGCCATGG CTGCCCTCGC TGCTTATCCG
CAGTTGGCTT GCTTCCCACG CGAATTCAAG CCACGGATTA TCTGGGGAGT GGAGCAGGAT
GTTTATTGTG CCGGTAAGGA CAGCGTCTTC CGTTTTATCT CTGATGTTAT CGACGAGGTA
GCACCCCTTT TCCCCGGCAC ATACTTCCAT ATCGGAGGGG ACGAATGCCC TAAAGATCGA
TGGAAGGCTT GTTCGCTTTG TCAGAAGCGT ATGCGTGACA ATGGGTTGAA AGACGAACAC
GAGCTGCAGA GTTATTTCAT CAAACAAGCT GAAAAGGTCT TACAAAAGCA CGGCAAGAGA
CTGATCGGTT GGGATGAAAT CCTCGAAGGC GGGCTTGCAC CTTCTGCCAC CGTTATGAGC
TGGCGTGGAG AGGATGGTGG CATCGCAGCG GCTAATATGA ATCACGATGT GATCATGACT
CCGGGTAGCG GAGGTCTCTA CTTGGATCAT TATCAGGGAG ATCCGACCGT CGAGCCTGTT
GCCATCGGAG GTTATGCTCC ATTGGAGCAA GTGTATGCTT ACAATCCTTT GCCGAAAGAA
TTGCCGGCCG ATAAGCATCG CTACGTGCTC GGAGCACAGG CCAATCTGTG GGCAGAATAC
CTCTATACTT CCGAACGATA CGACTATCAG GCCTATCCAA GGCTACTGGC TGTGGCAGAG
CTTACCTGGA CACCGTTGGC CAAGAAAGAT TTTGCCGATT TCTGTCGCCG TTTGGATAAT
GCCTGCGTTC GTCTGGACAT GCATGGTATC AATTACCACA TTCCGCTGCC CGAACAACCG
GGTGGCTCTT CCGACTTTAT AGCCTTTACG GACAAGGCTA AGCTGACCTT CACGACATCG
CGTCCGATGA AAATGGTCTA TACGCTGGAC GAAACCGAAC CGACCCTCAC ATCGACTCCT
TACACGGTCC CTCTTGAATT TGCACAAACG GGCCTTCTGA AGATTCGTAC CGTCACGGCC
GGTGGGAAGA TGAGTCCCGT ACGCCGCATT CGTGTGGAGA AACAACCCTT CAATATGTCA
ATGGAAGTAC CGGCACCGAA ACCCGGACTG ACCATTCGTA CGGCTTACGG TGACTTATAT
GATGTGCCTG ATCTGCAGCA GGTAGCCTCA TGGGAAGTAG GGACCGTTAG CTCTTTGGAG
GAAATCATGC ACGGGAAAGA GAAGATAACT TCTCCTGAAG TACTGGAGCG CAGAGTTGTA
GAGGCTACCG GTTATGTGCT TATTCCGGAG GATGGGGTAT ATGAGTTCTC TACGGAAAAC
AACGAGTTTT GGATTGATAA TGTGAAGCTG ATCGACAATG TGGGCGAAGT AAAGAAATTC
TCCCGTCGCA ATAGCAGTCG TGCCCTTCAG AAAGGCTACC ATCCGATCAA GACGATATGG
GTCGGAGCCA TACAAGGTGG CTGGCCTACT TATTGGAACT ACAGCAGGGT AATGATACGG
CTCAAGGGAG AAGAAAAGTT CAAGCCGATC TCGTCCGATA TGCTCTTTCA ATAA
 
Protein sequence
MKRLTFGACI CCLLSLMACS QKAKQVQIPE YDKGINIIPL PMQLTESDDS FEVDDKTTIC 
VSAEELKPIA KLLADKLRAS ADLSLQIEIG EEPSGNAIYI GVDTALPLKE EGYMLRSDKR
GVSIIGKSAH GAFYGMQTLL QLLPAEVESS NEVLLPMTVP GVEIKDEPAF GYRGFMLDVC
RHFLSVEDIK KHIDIMAMFK INRFHWHLTE DQAWRIEIKK YPRLTEVGST RTEGDGTQYS
GFYTQEQVRD IVQYASDRFI TVIPEIEMPG HAMAALAAYP QLACFPREFK PRIIWGVEQD
VYCAGKDSVF RFISDVIDEV APLFPGTYFH IGGDECPKDR WKACSLCQKR MRDNGLKDEH
ELQSYFIKQA EKVLQKHGKR LIGWDEILEG GLAPSATVMS WRGEDGGIAA ANMNHDVIMT
PGSGGLYLDH YQGDPTVEPV AIGGYAPLEQ VYAYNPLPKE LPADKHRYVL GAQANLWAEY
LYTSERYDYQ AYPRLLAVAE LTWTPLAKKD FADFCRRLDN ACVRLDMHGI NYHIPLPEQP
GGSSDFIAFT DKAKLTFTTS RPMKMVYTLD ETEPTLTSTP YTVPLEFAQT GLLKIRTVTA
GGKMSPVRRI RVEKQPFNMS MEVPAPKPGL TIRTAYGDLY DVPDLQQVAS WEVGTVSSLE
EIMHGKEKIT SPEVLERRVV EATGYVLIPE DGVYEFSTEN NEFWIDNVKL IDNVGEVKKF
SRRNSSRALQ KGYHPIKTIW VGAIQGGWPT YWNYSRVMIR LKGEEKFKPI SSDMLFQ