Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PG2204 |
Symbol | |
ID | 2551841 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Porphyromonas gingivalis W83 |
Kingdom | Bacteria |
Replicon accession | NC_002950 |
Strand | + |
Start bp | 2310521 |
End bp | 2316247 |
Gene Length | 5727 bp |
Protein Length | 1908 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 637150766 |
Product | hypothetical protein |
Protein accession | NP_906246 |
Protein GI | 34541767 |
COG category | [R] General function prediction only |
COG ID | [COG2373] Large extracellular alpha-helical protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 23 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATGAACA TACGAATTAT TTTCAGTCTA TTTCTTGCTT TGACTATCGG TCTTACTGCA ATGAATGCAC AGAATCCTTA TGCGACCTTG GAGAAGGAGC TGGAACAATA CCGTCAGGAA CGCCGTCCGC GCAAGGTGGC CGAGACGCTG AAGAAGATCA GGGTGATGGC CGAATCTCGC CGTGATGTCC CCATGCTCCT GCACAGCATG TTCCTCTATG ACGATCCGGA GAACGACATC AACGACTATC CTGAGGAGCC GTTGATGAAC GACCTCGACA AACTCATGCA GGCCTCATGG CTCAGGCCTG TGGATCGGGC CATGATTCTC TTCATGCGAC TGAGAATGTA CGTCCGATAC GGACAGTTCG GGCCTGCGTA TATGATGGAT GCAGTCCATG GCGATCCGAA AGACAATCGT CTGAGCCTAT GGAGCGAAAA ACAGTTCGAA GACGCTTTCC TGCGAGATTT CGATGTATTT CTCTCCTATC GCAAAGCCCT TCTGCAAGCC CGAACGGAGG ACTTCCGTCC TTTGTTCGAA TCTCAACCGA AAAAATGTGC TGCACCTCCC CGTTCACTCT ATCAGAGCTG TATATCCGTC CTTGCCCTTG CTGTCGGTAG CAGTTCGGAA GCTGACGGAA AGATTTCGCA TGCTCTCGAA CAGGAACTGC GCCAAGCAGC ATCTTATCTC TCTGACAGAC GTGAGCGGCT CGAGATGGAG ACCGACCGAT TGGACTACGA GAAGCGGTGT CGAAAGCTCG ATGATGCAGC TTCCTATCGT CGGATGGACG AGCTGATAGC AGCGTATTCG GATCTCCCCG ATGTAATCGG TATGGTGGAC AGACGAGTGA ATGCCTATTG TGATGAAGAG GAGTATGTAT CTGCTCTCGA CCTGTGCAAC AGCTATCTGT CCAAGTATCC GGAGGCTCCT CGTATCAACC TGCTGAAAAG CTCTCGGGCT TCGCGCATTT TGGCTTCTTT CGTCAAGTGC GACTATCCCC TCCGGCTGCA TACACGCCTC CCGAAGAGCA TGCGGATCGC ATCGCGCAAT GTCGGCTCCG TCCTCATCAG TCTGTACTGG CTGGATATGA ATGGCTCGCA AGCTGCTGAT ATAAGGCAGG AGGATCTGCA CTTGTATGCG AAAGGAACAC CTTTTTGGAC GGAAAGCATC CGGACACAGG GCCGGGCCGA TATGAAAATG GATTCGATCC GCTGTACACT TCCGGATCTG CCCAAGGGGG TCTATCTGGT GAAAGCCTCG GCAGAGCAGT GTGTACTCGA AAAGACCAAG CCGGTGCCCC AATACAGTCT GCTCCATGTA TCGGATCTCT TCTTCCTCGC TCAGCAGGCC ATGCCGGGTG TGCAGGTACT CGATGCGCTG ACGGGCAAGC CCGTGGAAGG CGCATCGGTG GATGCTTCCA TGCGTTATTC TATAGACCGA TCGGTTCATA CCCTTTCTCC TACGGATCGG TATGGTCGGA TCGATCTGCC GAAAAGCTAC AATCGCTTCT TCCCCTATAC CGACGGTGAC CGCTCGTATC CCGTTGTTTC TGCCTATACC TACAAACAGG ATCAGGTGGA TCTTTCCAAG AAGAATGAGC GCAAGTATAT CGTCTCCACG GATCGCGCCA TCTATCGTCC CGGGCAGAAA GTCCATTTCT TCGGCCAGTG CGACCGGATA GGTTATGCCG TGGAGGACGC ACGCGCCATC GGTGGATCGG AAGTTGAAGT CGTGCTGGAA GATGCTAACT CCAAGGAGAT CGGTCGGCTG CTATGCCAGG CAGATGAGAT GGGACGTTTC TCCGGTAGCT TCGACCTCCC GACAGGGATC TTGAATGGCC TGTTCGGACT ACGAGTGGGA GGGGATACAT ATCCCTTTAG TGTGGAAGAG TACAAGCGGC CTACATTCGA AGTCGGCCTC CGGAGTCCCG ACGCTGCATA TGCTATGGGC GACACCCTCC GCATACGAGG CGAAGCCAAG ACCTTTACCG GAATCGGTAT GCGAGGAGCT ACGGTGAACT ATCGGCTGAG CCTGACCCCC TATACGCGTC GGTGGTGGGG TAGACCTGTG GCCGATAGGG TCGTACAGAC CGGTGAGGCA GTGGTAGATG AATCGGGGTA CTTCGTCATA CCTGTTTCGC TGTCTCGGCC GGAAGGACGG GAAGACTATT CCTACTGCCT CTACACGCTA TCGGCGGATG TGACCGCCCC CGGTGGCGAG ACGCAGGCCG CCGTACTCCG AATACCCGTG GGAAAAGAAC CCAAGAGGGT GGACGTCGAA GTGGGTAAAT ACATTCGTGC CAACGAAGAC AACTGGCTTG CCACCTCTCT GCCGGATCTG ACATTCTCCC TTACCAATAC TTCCGGACAG ACGGTGGAGG GATCCATCGC CTACTTCCTG TGCGATGCCG ACAATGAACG TATCGGTCGG CTTTATACCG CAGCTTCAGG TGTGGTGACG CCTGCTCCGG CCGAATGGGG CAAGTTGCCT TCGGGACAGT ACCGATTGCG TTTTGGCGAG AGTGGCGGAG AGGATTCCGA CTTTGTTACG AAGGACGTTT ATCTCTTCCG TCCGAAAGAT CGTCAGTTGT CGAATCCCTC CCAAGGCCTT TGGACATATG TGGCGGAGGA GAAATACGAC AGTCACCGCC CTGCCCGCAT CCTGGTCGGT GCTACTCCCA AGGATGCGGA TGCTCAGGAG TTGTATCTGT TCTACGACCT GACTCAGGCA GGCCGATTCA TCGAAAGGAA GATGATCGCT TGCCGGCCGG GTGAGATCGT CGAGATAGTG CCGATGCTTC CGGCAGCTCC GTTGCCGGAG AGTATGAATG TTTCTCTCTA TTATGTATAT GGAGGGAGGC TGTATAGTCG AAGCATAGAT CTGGAACGCA GACTGCCCAA GCGAGAGATA AAGATGTCGT GGAGTACTTT CCGCGACAAG CTCCGACCTG GTGAGAAAGA GACATGGAAG CTCCGTCTCG CCGATGCGGA GGGCAAACCG CTCTCTCATA CCATGATGGC TGCTTGGATG TACGATGCTG CTCTTGATAA GATCGTGCAC AGTTCTGTTT TCTCCTCCCG ATTCTTCTCT TTTGCCGACT CTCCGGCTTC TTTGGTTAGG TTGTCTTTCC CGAATAGCGG GTATCGAGTG TCGGGGTTTG TGGAGGATTC TTACGTGCAA GTCGGGTTCA AACACCCTGT TTTCAAAACT CCCCAACTCT GCCTCTGGTC TGCTACCGGT TATAGATATG TGTCTTCGGA TTATTATTAT GATGGACCGA TGATATTTGT GGGATATGGA GTTCCGGAAG CCAAATCGGT CGATGCTTTC CCGATGGAGG AGAAAGCTGC TCGGGCAGAT GCGGTAAATG AAGCAGAACC CGAATCGAAG GAATCCGGGA TTCAAGAATC CTCCGTGCGT ACCAATTTCG CTGAGACTGC ATTCTTTGAG CCGGCATTGC TGACCGATGA ACGGGGCGAA GTTTCTTGGT CTTTCACTCT GCCCGAGACA CTGACGCGCT GGCACCTGCT CCTCTTTGCA CATACGAAGG ATATGCGGTT GGGCATGAAA GACGAAAGCG TGGAGGTGCG GAAAGACTTT ATGCTTACAC CCAATTTGCC ACGATTCCTG AGAATGGGTG ACAAGGGTAC GGCTTCGGCT TCGATTCGGA ACGGGAGCGA GACGATGCAG CAGGGCTTCG TCCGTATGGA ACTGTTCGAC CCTGCTACCG ACAAGCTATT GGGCGGCGAG AGGCTTCCGT TCTCGGTGGA AGCCGGTGGT ACGGTTACGG TGTCGTTTCC GCTCGATCCT GTCTCCGGAT ATGATGCTCT GGGTGTACGT TTGTTAGCCG AAAGCCATGA CTTCAGCGAT GGAGAGCAAC ACTCGATCGT GCAACTGCCG GCTACGGAGC GAGTGGTGGA GACCATACCG TTGATTCTCT ACGGCGGACA GTCCCAAACG GTGGATCTGG ATAGTCTATT TCCCCATCGC AGCGGCAGAC CGGCTATGGG TACAATGGCA CTTCAGGTGG TGAGCAATCC GCTTTGGGTG GCCGTACAGG CCTTGCCCGT GATGACTGAT GTACACGAGG AAGATGCCGT TTCGGTGGCT TCGGCTCTTT ATGCCAATAG TATTGCTGCT GCCCTTGTTT CGGGTCGCTT GCTGCATACT CCTGCTTTGG GTGAGAGTTT GCGGCACTAC TTGGCGCAGC CCTTGGATAC ACTCGCTTTA CAAAAATCGC CCTTGTCCTC GGACGAATTG CCTTGGCGCG CGGAGATGCT GGCCGAACAG AACAATCGTC AGCGTTTGCA GGCTCTCGTG GCTTCGGATC GTCTGACATG GACGGAGCAG ATGCTGACTG ACAAACTGAA AAGGCTGCAA AAGTCTGACG GTTCGTGGGC TTGGCATCCT GAGATGTCTT CCAACGATTA TCTGACGGAC TATGTGATGA CGATGCTGGT ACGTCTCTCC GCTCTTACCG CTGCACCGGA GCATAAGGAG TTTCAGGCGG TGAAGCGTAT GGGGTGGAAT GCTCTCGATG ATGCTGCCTC ACGACTGATG GATAGAATGA AGGAATACGA AAAGAAAACC AAGAGTAAGT ACAAAATACT GCCGGAACGA GCACTCAACT ATCTCTATCT GCTTACGATC GACAGTCGCA ACCCAAGCTC TCGTGGCCAT GCTGCCCGCA TGTATTTCCT CGATATTCTC GGCCGATCCC TTCCGCATAT TTCCATTTTG GATATGCCGC GAGCTGCCAT CGTACTCCAC GGTGTAGGGC GTAAGGCACT GGCCGATGAT TTCCTCCGTT CTATCCGTGA GCATATCATC CGCACACCGG ATCAGGGTGC GCACTTTGCG CTGCCTTCAG GCGGATATTA TTGGTGCGAC CGTCGCTATG GTATGCAGAC GGAAGCGATC GAGGCCTTCG CTCGTATCGG TACGTCTCAG GACAGCTCTC ATATCGAAGG CTTGCAGATG TGGCTCCTCA ATCAGAAAAG AACTCAGCTT TGGCCGAGTC TGCCGGCTTC TTCCGATGCG ATATATGCTC TCTTGTTAGG TGCAGGAAAA GATCGAATGG TAGAAGCCAC TGTGTCTGTC CGGGCTCCTG TACCTGCCTT GTCAGGTACA TTTGCCGGTG CAGGCGAAAG TCGGATCGTT ACGGTGGAAG ATCTTCCGCA AGGCAAACTA AGTGCAGAGC TTGCTCGTTC CGGAAAAGGA TTTGCTTGGG CTTCCTTCTT GGCGGAATAC GATGCGCCGG CGGCGGATCT CGTAGCTACG GGTAATGGCT TGTCGGTAGA GAAGCATCTT TTCTCCGAGC AGGTGATAGA TGGCAAGCTA ACGCTCGTAC CTCTCCGAGA GGGCGATCGG CTGGAAGTCG GTAAACGGCT CGTGACGCAA CTGACCATTA CGCTCGATCG CGATATGGAC TTCATTGTCC TGACGGACAA GCGAACGGCT GCGGTGGAAC CTATCGGCCA ACTTTCGGGT TATGATTATG CAGCCGGAAC GTTCTACTAC AGAGAAATAA AGGATAGCAG TACGCGTTTC TATTTCGATC GTCTGGTAAG AGGATCCTAC AAGCTCCAAT ACTCCACCGT TGTGGTTCGT TCGGGAGCTT ATGCCTCAGG TATCGCTACC GTGAGCAGTG CCTATGCACC GGAGTTTACC GGTCATACCG ATGGCGGCAG ACAGCTGCAA ACAGTTCCGG TCGCAAATAC TCAATAA
|
Protein sequence | MMNIRIIFSL FLALTIGLTA MNAQNPYATL EKELEQYRQE RRPRKVAETL KKIRVMAESR RDVPMLLHSM FLYDDPENDI NDYPEEPLMN DLDKLMQASW LRPVDRAMIL FMRLRMYVRY GQFGPAYMMD AVHGDPKDNR LSLWSEKQFE DAFLRDFDVF LSYRKALLQA RTEDFRPLFE SQPKKCAAPP RSLYQSCISV LALAVGSSSE ADGKISHALE QELRQAASYL SDRRERLEME TDRLDYEKRC RKLDDAASYR RMDELIAAYS DLPDVIGMVD RRVNAYCDEE EYVSALDLCN SYLSKYPEAP RINLLKSSRA SRILASFVKC DYPLRLHTRL PKSMRIASRN VGSVLISLYW LDMNGSQAAD IRQEDLHLYA KGTPFWTESI RTQGRADMKM DSIRCTLPDL PKGVYLVKAS AEQCVLEKTK PVPQYSLLHV SDLFFLAQQA MPGVQVLDAL TGKPVEGASV DASMRYSIDR SVHTLSPTDR YGRIDLPKSY NRFFPYTDGD RSYPVVSAYT YKQDQVDLSK KNERKYIVST DRAIYRPGQK VHFFGQCDRI GYAVEDARAI GGSEVEVVLE DANSKEIGRL LCQADEMGRF SGSFDLPTGI LNGLFGLRVG GDTYPFSVEE YKRPTFEVGL RSPDAAYAMG DTLRIRGEAK TFTGIGMRGA TVNYRLSLTP YTRRWWGRPV ADRVVQTGEA VVDESGYFVI PVSLSRPEGR EDYSYCLYTL SADVTAPGGE TQAAVLRIPV GKEPKRVDVE VGKYIRANED NWLATSLPDL TFSLTNTSGQ TVEGSIAYFL CDADNERIGR LYTAASGVVT PAPAEWGKLP SGQYRLRFGE SGGEDSDFVT KDVYLFRPKD RQLSNPSQGL WTYVAEEKYD SHRPARILVG ATPKDADAQE LYLFYDLTQA GRFIERKMIA CRPGEIVEIV PMLPAAPLPE SMNVSLYYVY GGRLYSRSID LERRLPKREI KMSWSTFRDK LRPGEKETWK LRLADAEGKP LSHTMMAAWM YDAALDKIVH SSVFSSRFFS FADSPASLVR LSFPNSGYRV SGFVEDSYVQ VGFKHPVFKT PQLCLWSATG YRYVSSDYYY DGPMIFVGYG VPEAKSVDAF PMEEKAARAD AVNEAEPESK ESGIQESSVR TNFAETAFFE PALLTDERGE VSWSFTLPET LTRWHLLLFA HTKDMRLGMK DESVEVRKDF MLTPNLPRFL RMGDKGTASA SIRNGSETMQ QGFVRMELFD PATDKLLGGE RLPFSVEAGG TVTVSFPLDP VSGYDALGVR LLAESHDFSD GEQHSIVQLP ATERVVETIP LILYGGQSQT VDLDSLFPHR SGRPAMGTMA LQVVSNPLWV AVQALPVMTD VHEEDAVSVA SALYANSIAA ALVSGRLLHT PALGESLRHY LAQPLDTLAL QKSPLSSDEL PWRAEMLAEQ NNRQRLQALV ASDRLTWTEQ MLTDKLKRLQ KSDGSWAWHP EMSSNDYLTD YVMTMLVRLS ALTAAPEHKE FQAVKRMGWN ALDDAASRLM DRMKEYEKKT KSKYKILPER ALNYLYLLTI DSRNPSSRGH AARMYFLDIL GRSLPHISIL DMPRAAIVLH GVGRKALADD FLRSIREHII RTPDQGAHFA LPSGGYYWCD RRYGMQTEAI EAFARIGTSQ DSSHIEGLQM WLLNQKRTQL WPSLPASSDA IYALLLGAGK DRMVEATVSV RAPVPALSGT FAGAGESRIV TVEDLPQGKL SAELARSGKG FAWASFLAEY DAPAADLVAT GNGLSVEKHL FSEQVIDGKL TLVPLREGDR LEVGKRLVTQ LTITLDRDMD FIVLTDKRTA AVEPIGQLSG YDYAAGTFYY REIKDSSTRF YFDRLVRGSY KLQYSTVVVR SGAYASGIAT VSSAYAPEFT GHTDGGRQLQ TVPVANTQ
|
| |