Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PG0011 |
Symbol | |
ID | 2551515 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Porphyromonas gingivalis W83 |
Kingdom | Bacteria |
Replicon accession | NC_002950 |
Strand | + |
Start bp | 11634 |
End bp | 14645 |
Gene Length | 3012 bp |
Protein Length | 1003 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 637148831 |
Product | glycosy hydrolase family protein |
Protein accession | NP_904370 |
Protein GI | 34539891 |
COG category | [G] Carbohydrate transport and metabolism [V] Defense mechanisms |
COG ID | [COG1472] Beta-glucosidase-related glycosidases [COG1680] Beta-lactamase class C and other penicillin binding proteins |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 0.417282 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAACGAT TTCTTTTCTC CGCCATCACA ATACTCTCTC TCTCGGTGCT GTACACAGCT ACGGCACCGG CCCGCACGAA CATGGGTGCC ATGCATGCCC AGCAGTCCAA AGACTATCCC TTTCTCCTCT TCGGGGGAGT CGAGAGCAAA GATGTCAAAC GCTGGGTTGA CGACCGCATG AAGGCTATGA GCACGGAGGA AAAAGTAGGC CAGCTGCTCA TGCCGATCGT CTATCCCTCT TTACAGGAAG AAAAAGTAAA GCAAGCCGAA CAGCTGGTGC GCACCTGCCA CATCGGGGGC ATACTCTTCC AAAAGGGTAC ACTCTCGGAG CAATACACGA TGACTCGCCG CTTGCAGGAA GCAGCCGGCA CCCCTCTCCT CATAGCACTG GACGGTGAGT GGGGTTTGCA CATGCGTCTG AAAGATGCCC CACGCTTCCC TCGCAATATG GGCTTGGGAC ACCAAAAAGA CAATCAGCTC CTCTACAACT ATGGTCGGGA GGTAGCGCGC CAATGCCGGC TGATGGGGAT TCATATCAAT TTTGCTCCGG TGCTGGACGT GAACAACAAT CCGAAGAACC CTGTTATCGG CACGCGCAGC TTCGGCGACA ACCCACGCCG AGTAGCAGAA AGAGGGATTG CCTATGCACA AGGATTGGAG GACGGAGGAG TGATGGCCGT GGCCAAGCAT TTCCCCGGAC ACGGCAATAC CACAGAGGAC TCGCACAAGA CCTTGCCCAC GGTCTTTGCC TCCCGAGAGG AATTGGAGAA TACTGAATTG TTCCCCTTCA AGGAGTTTTT CCGAGCCGGC CTCAGCGGAG TGATGACCGC TCACCTCAAT GTTCCGGCTT TGGAAGCAAA GAAAAATACG CCCTCCTCCC TCAGTCATGC CATCTGCACC GATCTGCTTC GGCAGGAAAT GGGTTTCAAG GGGCTGATCT TTACGGACGG ACTGGCCATG CAGGGAGTAC AGACAGCCGG TTCTCAACCG ATCTCCGTCC GTGCCATATT GGCCGGCAAT GACATCCTCC TCGGTCCGGT GGACCCGGTC AAGACTTTCT CCGAGGTGCT GGCCGCAGTG GAAGACAGAA CGATAAGCAA AGAATTGCTG GACGAGAAAT GTCGCAAAAT TCTGGCCTTC AAGTATGCGC TCATCATTTG TAAGGGAATA TCCAAGGAGC TACCGGCAGA GGAAGTGGTA CGACAGGTAA ACAGCCGTGA AGCGGAACGG ATGTCGGAAG ATCTTTGGCA GGCATCCATT ACGATTCTCA AAAACAAGAA GCACTTCCTC CCCCTCTCGG AGGGAAACCG TATTGCATGC GTCAATCTAG ATGGTGCAGC CTCCAATACT TTTACACAAG AGCTGGGGCT TACGAGCAAG GACTGCTATT CCTATGCCAA GGGCAGCAAC AGTACACGTA CACAAGAGTT GCTCTCCAAA CTCAAGGGGT ACGATGCTGT GATCGTCACA GTTCGACATA CACAGCCGGA CTGGGCAGGA ACGTTTCTCC ACCGGCTGAC GGAACAAAAC CACACTGCCA TCGTCTTCTT CACATCGCCC TATGTAGCCG ATAGGATTCC GGCGGCTATG GACAAAGCTC GGGCTATAGT CGTGGCATAT GAAAACGTAA AGGAGGCTGC TCGAATGGCA GCCTACAAGA TTCGGGACAG AGTTGTGGTA TCCTCGGGCG GTGGGATCCC GGTCGTTCAG GAAGAGGAGG ACACTGATCC GACAGCGAAC ATGATGCCGC CGACCGAATT GTCGTCCGGC CTCATCCCGA TGCCGGCGGT AGACCGCATA GCGAAAGAAG CCCTAAGGCA AGGGGCTTTC CCCGGTTGTC GTATCCTCGC TGTCCATCGA GACAAGGTGG TATATGACAA AAGCTTCGGC ACTCTCGACG GATCGGCACG TGGAGGCAAG GTGTCTTCTT CCACCATCTA CGACTTGGCA TCCGTCACCA AGGTCGTAGC CACTACTCCG GCTGTGATGC TATTGGTTCA GGATGGAAAA TTGAAGCTCT CCGACCGGCT CGGAACGCTA TTGCCTCGTT TTGCCCGAAC AGACCTCAAA GACATCACCG TACAGCAGCT TCTACTTCAT GAAGCAGGCC TTCGGCCATC GATCAATTTC TACGAATCTC TGATAGACAG TAGCAGTTTG GACGGCAAGT TGCTCTCTCC CCGGCGGAGT TCGGGATGGG TTCGAGTAGA TACCAATATG TGGGGCAATC CATTCTTCGG ATTTCGCTCC GATTTGGTTT CCGGGCAATT TCGGCCGGAC TATCCGTTTC GTTTTTCTTC CAATCTGTAT CTATCGAAAG AGGTCAAAGA AATTGTCCTC AATACCATCG CTTCTACCCC TCGCAATGGA GTCGGACGCT ACAAGTATTC GGATCTGGGG TTTATCCTGC TACAGCAAAT TGTGGAAAAA GTATCGGGCA AGAGCTTGAA CGTCTTTGTC GAAGAACGTA TATTCCGCCC CATCGGAGCC GGTTCGCTCG GTTATCTTCC ACTCGATAAA TATTCCGTTT CGCGCATTGC ACCGGCACAG AACGACAAAT TTCTACGTAA AAGCATCGTC CGCGGTACCG TGGACGACGA AGCTGCTGCA TGCCTCGGAG GTATATCGGG CAATGCAGGT GTTTTCGGCA CTGCGGAGGA CGTTGCCCGT GTACTGGACA TGTTCATTCA TGAAGGCACT TACAAAGGCC ACAGAATCAT CGATCAAAAG ATCTTCCGAC TGTTCATAAC GACTCATGGC AAGGGCAACA GGCGTTGTCT CGGATTTGAC AAAGGCCGGG CGAGTATGGC AGAATCGGCG TCAGGCTCCA CCTACGGGCA CACAGGGTTT ACCGGCACAT GCGTTTGGGT CGATCCCGAG AACGAACTTA TCTTTGTATT CCTTTCCAAT CGGACTTACC CCAATCGGCT GAACAAGACA CTGATGACGG CAAGCATACG CCCCAGACTG CATCAAGCAA TATACGAGGC TTTGGGAATA GCGGAGCAAT GA
|
Protein sequence | MKRFLFSAIT ILSLSVLYTA TAPARTNMGA MHAQQSKDYP FLLFGGVESK DVKRWVDDRM KAMSTEEKVG QLLMPIVYPS LQEEKVKQAE QLVRTCHIGG ILFQKGTLSE QYTMTRRLQE AAGTPLLIAL DGEWGLHMRL KDAPRFPRNM GLGHQKDNQL LYNYGREVAR QCRLMGIHIN FAPVLDVNNN PKNPVIGTRS FGDNPRRVAE RGIAYAQGLE DGGVMAVAKH FPGHGNTTED SHKTLPTVFA SREELENTEL FPFKEFFRAG LSGVMTAHLN VPALEAKKNT PSSLSHAICT DLLRQEMGFK GLIFTDGLAM QGVQTAGSQP ISVRAILAGN DILLGPVDPV KTFSEVLAAV EDRTISKELL DEKCRKILAF KYALIICKGI SKELPAEEVV RQVNSREAER MSEDLWQASI TILKNKKHFL PLSEGNRIAC VNLDGAASNT FTQELGLTSK DCYSYAKGSN STRTQELLSK LKGYDAVIVT VRHTQPDWAG TFLHRLTEQN HTAIVFFTSP YVADRIPAAM DKARAIVVAY ENVKEAARMA AYKIRDRVVV SSGGGIPVVQ EEEDTDPTAN MMPPTELSSG LIPMPAVDRI AKEALRQGAF PGCRILAVHR DKVVYDKSFG TLDGSARGGK VSSSTIYDLA SVTKVVATTP AVMLLVQDGK LKLSDRLGTL LPRFARTDLK DITVQQLLLH EAGLRPSINF YESLIDSSSL DGKLLSPRRS SGWVRVDTNM WGNPFFGFRS DLVSGQFRPD YPFRFSSNLY LSKEVKEIVL NTIASTPRNG VGRYKYSDLG FILLQQIVEK VSGKSLNVFV EERIFRPIGA GSLGYLPLDK YSVSRIAPAQ NDKFLRKSIV RGTVDDEAAA CLGGISGNAG VFGTAEDVAR VLDMFIHEGT YKGHRIIDQK IFRLFITTHG KGNRRCLGFD KGRASMAESA SGSTYGHTGF TGTCVWVDPE NELIFVFLSN RTYPNRLNKT LMTASIRPRL HQAIYEALGI AEQ
|
| |