Gene PG2204 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPG2204 
Symbol 
ID2551841 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePorphyromonas gingivalis W83 
KingdomBacteria 
Replicon accessionNC_002950 
Strand
Start bp2310521 
End bp2316247 
Gene Length5727 bp 
Protein Length1908 aa 
Translation table11 
GC content54% 
IMG OID637150766 
Producthypothetical protein 
Protein accessionNP_906246 
Protein GI34541767 
COG category[R] General function prediction only 
COG ID[COG2373] Large extracellular alpha-helical protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATGAACA TACGAATTAT TTTCAGTCTA TTTCTTGCTT TGACTATCGG TCTTACTGCA 
ATGAATGCAC AGAATCCTTA TGCGACCTTG GAGAAGGAGC TGGAACAATA CCGTCAGGAA
CGCCGTCCGC GCAAGGTGGC CGAGACGCTG AAGAAGATCA GGGTGATGGC CGAATCTCGC
CGTGATGTCC CCATGCTCCT GCACAGCATG TTCCTCTATG ACGATCCGGA GAACGACATC
AACGACTATC CTGAGGAGCC GTTGATGAAC GACCTCGACA AACTCATGCA GGCCTCATGG
CTCAGGCCTG TGGATCGGGC CATGATTCTC TTCATGCGAC TGAGAATGTA CGTCCGATAC
GGACAGTTCG GGCCTGCGTA TATGATGGAT GCAGTCCATG GCGATCCGAA AGACAATCGT
CTGAGCCTAT GGAGCGAAAA ACAGTTCGAA GACGCTTTCC TGCGAGATTT CGATGTATTT
CTCTCCTATC GCAAAGCCCT TCTGCAAGCC CGAACGGAGG ACTTCCGTCC TTTGTTCGAA
TCTCAACCGA AAAAATGTGC TGCACCTCCC CGTTCACTCT ATCAGAGCTG TATATCCGTC
CTTGCCCTTG CTGTCGGTAG CAGTTCGGAA GCTGACGGAA AGATTTCGCA TGCTCTCGAA
CAGGAACTGC GCCAAGCAGC ATCTTATCTC TCTGACAGAC GTGAGCGGCT CGAGATGGAG
ACCGACCGAT TGGACTACGA GAAGCGGTGT CGAAAGCTCG ATGATGCAGC TTCCTATCGT
CGGATGGACG AGCTGATAGC AGCGTATTCG GATCTCCCCG ATGTAATCGG TATGGTGGAC
AGACGAGTGA ATGCCTATTG TGATGAAGAG GAGTATGTAT CTGCTCTCGA CCTGTGCAAC
AGCTATCTGT CCAAGTATCC GGAGGCTCCT CGTATCAACC TGCTGAAAAG CTCTCGGGCT
TCGCGCATTT TGGCTTCTTT CGTCAAGTGC GACTATCCCC TCCGGCTGCA TACACGCCTC
CCGAAGAGCA TGCGGATCGC ATCGCGCAAT GTCGGCTCCG TCCTCATCAG TCTGTACTGG
CTGGATATGA ATGGCTCGCA AGCTGCTGAT ATAAGGCAGG AGGATCTGCA CTTGTATGCG
AAAGGAACAC CTTTTTGGAC GGAAAGCATC CGGACACAGG GCCGGGCCGA TATGAAAATG
GATTCGATCC GCTGTACACT TCCGGATCTG CCCAAGGGGG TCTATCTGGT GAAAGCCTCG
GCAGAGCAGT GTGTACTCGA AAAGACCAAG CCGGTGCCCC AATACAGTCT GCTCCATGTA
TCGGATCTCT TCTTCCTCGC TCAGCAGGCC ATGCCGGGTG TGCAGGTACT CGATGCGCTG
ACGGGCAAGC CCGTGGAAGG CGCATCGGTG GATGCTTCCA TGCGTTATTC TATAGACCGA
TCGGTTCATA CCCTTTCTCC TACGGATCGG TATGGTCGGA TCGATCTGCC GAAAAGCTAC
AATCGCTTCT TCCCCTATAC CGACGGTGAC CGCTCGTATC CCGTTGTTTC TGCCTATACC
TACAAACAGG ATCAGGTGGA TCTTTCCAAG AAGAATGAGC GCAAGTATAT CGTCTCCACG
GATCGCGCCA TCTATCGTCC CGGGCAGAAA GTCCATTTCT TCGGCCAGTG CGACCGGATA
GGTTATGCCG TGGAGGACGC ACGCGCCATC GGTGGATCGG AAGTTGAAGT CGTGCTGGAA
GATGCTAACT CCAAGGAGAT CGGTCGGCTG CTATGCCAGG CAGATGAGAT GGGACGTTTC
TCCGGTAGCT TCGACCTCCC GACAGGGATC TTGAATGGCC TGTTCGGACT ACGAGTGGGA
GGGGATACAT ATCCCTTTAG TGTGGAAGAG TACAAGCGGC CTACATTCGA AGTCGGCCTC
CGGAGTCCCG ACGCTGCATA TGCTATGGGC GACACCCTCC GCATACGAGG CGAAGCCAAG
ACCTTTACCG GAATCGGTAT GCGAGGAGCT ACGGTGAACT ATCGGCTGAG CCTGACCCCC
TATACGCGTC GGTGGTGGGG TAGACCTGTG GCCGATAGGG TCGTACAGAC CGGTGAGGCA
GTGGTAGATG AATCGGGGTA CTTCGTCATA CCTGTTTCGC TGTCTCGGCC GGAAGGACGG
GAAGACTATT CCTACTGCCT CTACACGCTA TCGGCGGATG TGACCGCCCC CGGTGGCGAG
ACGCAGGCCG CCGTACTCCG AATACCCGTG GGAAAAGAAC CCAAGAGGGT GGACGTCGAA
GTGGGTAAAT ACATTCGTGC CAACGAAGAC AACTGGCTTG CCACCTCTCT GCCGGATCTG
ACATTCTCCC TTACCAATAC TTCCGGACAG ACGGTGGAGG GATCCATCGC CTACTTCCTG
TGCGATGCCG ACAATGAACG TATCGGTCGG CTTTATACCG CAGCTTCAGG TGTGGTGACG
CCTGCTCCGG CCGAATGGGG CAAGTTGCCT TCGGGACAGT ACCGATTGCG TTTTGGCGAG
AGTGGCGGAG AGGATTCCGA CTTTGTTACG AAGGACGTTT ATCTCTTCCG TCCGAAAGAT
CGTCAGTTGT CGAATCCCTC CCAAGGCCTT TGGACATATG TGGCGGAGGA GAAATACGAC
AGTCACCGCC CTGCCCGCAT CCTGGTCGGT GCTACTCCCA AGGATGCGGA TGCTCAGGAG
TTGTATCTGT TCTACGACCT GACTCAGGCA GGCCGATTCA TCGAAAGGAA GATGATCGCT
TGCCGGCCGG GTGAGATCGT CGAGATAGTG CCGATGCTTC CGGCAGCTCC GTTGCCGGAG
AGTATGAATG TTTCTCTCTA TTATGTATAT GGAGGGAGGC TGTATAGTCG AAGCATAGAT
CTGGAACGCA GACTGCCCAA GCGAGAGATA AAGATGTCGT GGAGTACTTT CCGCGACAAG
CTCCGACCTG GTGAGAAAGA GACATGGAAG CTCCGTCTCG CCGATGCGGA GGGCAAACCG
CTCTCTCATA CCATGATGGC TGCTTGGATG TACGATGCTG CTCTTGATAA GATCGTGCAC
AGTTCTGTTT TCTCCTCCCG ATTCTTCTCT TTTGCCGACT CTCCGGCTTC TTTGGTTAGG
TTGTCTTTCC CGAATAGCGG GTATCGAGTG TCGGGGTTTG TGGAGGATTC TTACGTGCAA
GTCGGGTTCA AACACCCTGT TTTCAAAACT CCCCAACTCT GCCTCTGGTC TGCTACCGGT
TATAGATATG TGTCTTCGGA TTATTATTAT GATGGACCGA TGATATTTGT GGGATATGGA
GTTCCGGAAG CCAAATCGGT CGATGCTTTC CCGATGGAGG AGAAAGCTGC TCGGGCAGAT
GCGGTAAATG AAGCAGAACC CGAATCGAAG GAATCCGGGA TTCAAGAATC CTCCGTGCGT
ACCAATTTCG CTGAGACTGC ATTCTTTGAG CCGGCATTGC TGACCGATGA ACGGGGCGAA
GTTTCTTGGT CTTTCACTCT GCCCGAGACA CTGACGCGCT GGCACCTGCT CCTCTTTGCA
CATACGAAGG ATATGCGGTT GGGCATGAAA GACGAAAGCG TGGAGGTGCG GAAAGACTTT
ATGCTTACAC CCAATTTGCC ACGATTCCTG AGAATGGGTG ACAAGGGTAC GGCTTCGGCT
TCGATTCGGA ACGGGAGCGA GACGATGCAG CAGGGCTTCG TCCGTATGGA ACTGTTCGAC
CCTGCTACCG ACAAGCTATT GGGCGGCGAG AGGCTTCCGT TCTCGGTGGA AGCCGGTGGT
ACGGTTACGG TGTCGTTTCC GCTCGATCCT GTCTCCGGAT ATGATGCTCT GGGTGTACGT
TTGTTAGCCG AAAGCCATGA CTTCAGCGAT GGAGAGCAAC ACTCGATCGT GCAACTGCCG
GCTACGGAGC GAGTGGTGGA GACCATACCG TTGATTCTCT ACGGCGGACA GTCCCAAACG
GTGGATCTGG ATAGTCTATT TCCCCATCGC AGCGGCAGAC CGGCTATGGG TACAATGGCA
CTTCAGGTGG TGAGCAATCC GCTTTGGGTG GCCGTACAGG CCTTGCCCGT GATGACTGAT
GTACACGAGG AAGATGCCGT TTCGGTGGCT TCGGCTCTTT ATGCCAATAG TATTGCTGCT
GCCCTTGTTT CGGGTCGCTT GCTGCATACT CCTGCTTTGG GTGAGAGTTT GCGGCACTAC
TTGGCGCAGC CCTTGGATAC ACTCGCTTTA CAAAAATCGC CCTTGTCCTC GGACGAATTG
CCTTGGCGCG CGGAGATGCT GGCCGAACAG AACAATCGTC AGCGTTTGCA GGCTCTCGTG
GCTTCGGATC GTCTGACATG GACGGAGCAG ATGCTGACTG ACAAACTGAA AAGGCTGCAA
AAGTCTGACG GTTCGTGGGC TTGGCATCCT GAGATGTCTT CCAACGATTA TCTGACGGAC
TATGTGATGA CGATGCTGGT ACGTCTCTCC GCTCTTACCG CTGCACCGGA GCATAAGGAG
TTTCAGGCGG TGAAGCGTAT GGGGTGGAAT GCTCTCGATG ATGCTGCCTC ACGACTGATG
GATAGAATGA AGGAATACGA AAAGAAAACC AAGAGTAAGT ACAAAATACT GCCGGAACGA
GCACTCAACT ATCTCTATCT GCTTACGATC GACAGTCGCA ACCCAAGCTC TCGTGGCCAT
GCTGCCCGCA TGTATTTCCT CGATATTCTC GGCCGATCCC TTCCGCATAT TTCCATTTTG
GATATGCCGC GAGCTGCCAT CGTACTCCAC GGTGTAGGGC GTAAGGCACT GGCCGATGAT
TTCCTCCGTT CTATCCGTGA GCATATCATC CGCACACCGG ATCAGGGTGC GCACTTTGCG
CTGCCTTCAG GCGGATATTA TTGGTGCGAC CGTCGCTATG GTATGCAGAC GGAAGCGATC
GAGGCCTTCG CTCGTATCGG TACGTCTCAG GACAGCTCTC ATATCGAAGG CTTGCAGATG
TGGCTCCTCA ATCAGAAAAG AACTCAGCTT TGGCCGAGTC TGCCGGCTTC TTCCGATGCG
ATATATGCTC TCTTGTTAGG TGCAGGAAAA GATCGAATGG TAGAAGCCAC TGTGTCTGTC
CGGGCTCCTG TACCTGCCTT GTCAGGTACA TTTGCCGGTG CAGGCGAAAG TCGGATCGTT
ACGGTGGAAG ATCTTCCGCA AGGCAAACTA AGTGCAGAGC TTGCTCGTTC CGGAAAAGGA
TTTGCTTGGG CTTCCTTCTT GGCGGAATAC GATGCGCCGG CGGCGGATCT CGTAGCTACG
GGTAATGGCT TGTCGGTAGA GAAGCATCTT TTCTCCGAGC AGGTGATAGA TGGCAAGCTA
ACGCTCGTAC CTCTCCGAGA GGGCGATCGG CTGGAAGTCG GTAAACGGCT CGTGACGCAA
CTGACCATTA CGCTCGATCG CGATATGGAC TTCATTGTCC TGACGGACAA GCGAACGGCT
GCGGTGGAAC CTATCGGCCA ACTTTCGGGT TATGATTATG CAGCCGGAAC GTTCTACTAC
AGAGAAATAA AGGATAGCAG TACGCGTTTC TATTTCGATC GTCTGGTAAG AGGATCCTAC
AAGCTCCAAT ACTCCACCGT TGTGGTTCGT TCGGGAGCTT ATGCCTCAGG TATCGCTACC
GTGAGCAGTG CCTATGCACC GGAGTTTACC GGTCATACCG ATGGCGGCAG ACAGCTGCAA
ACAGTTCCGG TCGCAAATAC TCAATAA
 
Protein sequence
MMNIRIIFSL FLALTIGLTA MNAQNPYATL EKELEQYRQE RRPRKVAETL KKIRVMAESR 
RDVPMLLHSM FLYDDPENDI NDYPEEPLMN DLDKLMQASW LRPVDRAMIL FMRLRMYVRY
GQFGPAYMMD AVHGDPKDNR LSLWSEKQFE DAFLRDFDVF LSYRKALLQA RTEDFRPLFE
SQPKKCAAPP RSLYQSCISV LALAVGSSSE ADGKISHALE QELRQAASYL SDRRERLEME
TDRLDYEKRC RKLDDAASYR RMDELIAAYS DLPDVIGMVD RRVNAYCDEE EYVSALDLCN
SYLSKYPEAP RINLLKSSRA SRILASFVKC DYPLRLHTRL PKSMRIASRN VGSVLISLYW
LDMNGSQAAD IRQEDLHLYA KGTPFWTESI RTQGRADMKM DSIRCTLPDL PKGVYLVKAS
AEQCVLEKTK PVPQYSLLHV SDLFFLAQQA MPGVQVLDAL TGKPVEGASV DASMRYSIDR
SVHTLSPTDR YGRIDLPKSY NRFFPYTDGD RSYPVVSAYT YKQDQVDLSK KNERKYIVST
DRAIYRPGQK VHFFGQCDRI GYAVEDARAI GGSEVEVVLE DANSKEIGRL LCQADEMGRF
SGSFDLPTGI LNGLFGLRVG GDTYPFSVEE YKRPTFEVGL RSPDAAYAMG DTLRIRGEAK
TFTGIGMRGA TVNYRLSLTP YTRRWWGRPV ADRVVQTGEA VVDESGYFVI PVSLSRPEGR
EDYSYCLYTL SADVTAPGGE TQAAVLRIPV GKEPKRVDVE VGKYIRANED NWLATSLPDL
TFSLTNTSGQ TVEGSIAYFL CDADNERIGR LYTAASGVVT PAPAEWGKLP SGQYRLRFGE
SGGEDSDFVT KDVYLFRPKD RQLSNPSQGL WTYVAEEKYD SHRPARILVG ATPKDADAQE
LYLFYDLTQA GRFIERKMIA CRPGEIVEIV PMLPAAPLPE SMNVSLYYVY GGRLYSRSID
LERRLPKREI KMSWSTFRDK LRPGEKETWK LRLADAEGKP LSHTMMAAWM YDAALDKIVH
SSVFSSRFFS FADSPASLVR LSFPNSGYRV SGFVEDSYVQ VGFKHPVFKT PQLCLWSATG
YRYVSSDYYY DGPMIFVGYG VPEAKSVDAF PMEEKAARAD AVNEAEPESK ESGIQESSVR
TNFAETAFFE PALLTDERGE VSWSFTLPET LTRWHLLLFA HTKDMRLGMK DESVEVRKDF
MLTPNLPRFL RMGDKGTASA SIRNGSETMQ QGFVRMELFD PATDKLLGGE RLPFSVEAGG
TVTVSFPLDP VSGYDALGVR LLAESHDFSD GEQHSIVQLP ATERVVETIP LILYGGQSQT
VDLDSLFPHR SGRPAMGTMA LQVVSNPLWV AVQALPVMTD VHEEDAVSVA SALYANSIAA
ALVSGRLLHT PALGESLRHY LAQPLDTLAL QKSPLSSDEL PWRAEMLAEQ NNRQRLQALV
ASDRLTWTEQ MLTDKLKRLQ KSDGSWAWHP EMSSNDYLTD YVMTMLVRLS ALTAAPEHKE
FQAVKRMGWN ALDDAASRLM DRMKEYEKKT KSKYKILPER ALNYLYLLTI DSRNPSSRGH
AARMYFLDIL GRSLPHISIL DMPRAAIVLH GVGRKALADD FLRSIREHII RTPDQGAHFA
LPSGGYYWCD RRYGMQTEAI EAFARIGTSQ DSSHIEGLQM WLLNQKRTQL WPSLPASSDA
IYALLLGAGK DRMVEATVSV RAPVPALSGT FAGAGESRIV TVEDLPQGKL SAELARSGKG
FAWASFLAEY DAPAADLVAT GNGLSVEKHL FSEQVIDGKL TLVPLREGDR LEVGKRLVTQ
LTITLDRDMD FIVLTDKRTA AVEPIGQLSG YDYAAGTFYY REIKDSSTRF YFDRLVRGSY
KLQYSTVVVR SGAYASGIAT VSSAYAPEFT GHTDGGRQLQ TVPVANTQ