Gene PG1651 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPG1651 
Symbol 
ID2552718 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePorphyromonas gingivalis W83 
KingdomBacteria 
Replicon accessionNC_002950 
Strand
Start bp1730466 
End bp1733453 
Gene Length2988 bp 
Protein Length995 aa 
Translation table11 
GC content52% 
IMG OID637150271 
ProductTPR domain-containing protein 
Protein accessionNP_905771 
Protein GI34541292 
COG category[S] Function unknown 
COG ID[COG1729] Uncharacterized protein conserved in bacteria 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value0.874566 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAAAAC TGATATTATC GGCTCTTGTG CTGGGTGGAG CAATGACTTT CACGGCACAA 
GCCCAAAATA AAGCGGCAGA GCCCGCCGAA GAGTTTTTCC TCGAAGGGAG AGCGATGTTT
ATCCGCCAAA ATTATGTGGG TGCCTTAGAC CAATTCCGTC TCTATCGCTT GCACGTAGGG
CAAGAGCACG CTTTGGAGGT GGATTATTAT AGTATCATCG CGGACTATCT ATTGGGAGCG
GATCACGATT ACGTCCTCAG CGCGGTAGGC AATTTCTTGG ACAATTTCCC CGAATCGATC
GACAAATCCC GTGTGCGACT CCTCACAGGG GAGCTGATGA TCCGGCATGA ACAGTATTTG
CGTGCCAATA CCATTTTCTC CGACATCGAC GATCGCACGC TCAACGATGA AGACTTGGCT
GCATACCTGC TTTATTATGC ATACGTGCGT ATGCAGTTGA GCGAGAGCAA TGCTTTGGCC
GAGCAGATGC TGCTGAAGGC TTCCGAGAGC AGAGGTGAGC TGGGCGGACG TGCTTGTCTC
CTCCTTGCTG CTATCCAAAT AGAGGATGGC CGAATCGACG AAGCCGAACA GACTCTTTCG
CGACTGAAGA ATCGCCCGCA ATTTGCGGAT GAAGCCGATG CTTATATCGC GGAAATCCAA
CTGTTGCGAG GCGACTACCG CGAAGCGGCC GAAATAGCCG ACAGACTCTT GTCGCGCAAT
TCGCCCATGA GACAGCGACC TCAACTCCTG CGTGTGGCAG GCAATGCTTA CTACCGTTTG
GGCGATTCGA ACAAAACCAT CGACTATCTT TCCGACTACA GCGAAAAGGT GGGGGATCGT
ATCGCGCCTG CCGATGCTTA TGCACTTGGT GTCACTTACT ATAAGCAAGG TCTGATGAAA
GAAGCACTTC GGCCACTTGC CGCTGCTACC ACTGACGCCG GTTCTCTCGG TGCAGAGTCG
GCTCTCTATC TCGGACAGGC ACAGCTGGCC GAGGGTATGA CGAGCGAGGC CCTTATGGCT
TTCGAGAAGG CTGCGACTCA GGACGTCAAT CGGCCGGTAC GGGAGGTCGG AATGTACAAT
ATGGCTATGC TCATGCGTAG TACGGGACAG TCGAGCTTCG GTCAGTCCGT ACGCATTGCG
GAGAACTTCC TCAATGAGTT TCCCCGATCT TCCCACCGCG AACAGATGGC TGCTATTCTC
GTCGAATCAT ATTTCACCGG CAAGGATTAC AATTCTTCCC TTAGATCGAT CCAAAAGATT
GCACAGCCGA CGGCTTCCAT TCTGGCCGCC AAACAGTTTG TGCTGAACCG TATGGCAGAA
CAAAAGGAAG CAGCGGGCTA TGACAGTGAA GCACTTTCTT TTGTGTCGAG CAGTATATCT
ATGGGAAACA AAGGGGAATA TTTCCCTGAA GCATATTTCC TCCGAGGCAA TCTGCGGTAT
AGAGCCGGTG ACTTTCCGAC TGCAGCTGCG GATTACAGAG CCTATATCTC GGCCGCCGGT
GATCGTGATG CGGCGAATCT TCCTTTAGGA TATTATCGTC TGGGCTATTC TCTATTCAAT
GCCGAACGTT ACGATATGGC TTTGGAGGCC TTCAAAGAGT ATGTATCCCG ATCCGGTATT
GCCCCGAATC TCTCGGCTGA TGCCTATGCC CGTATCGGAG ACTGTAGGTA TATGAAGCGC
GACTTCCACG GTGCACGCGA AGCCTATTCC ATGGCTTACC GCGTCTATCC TTCCGGAGGT
GATTATGCCC TTTTGCGTCG TGCTCGGTTG GAGGGACTGG CCAAACAGTA TGCCGATCAG
ATCCAAACGT TGGACAAGCT CATTCGGGAA TTCCCCGATA GCCGCCATCT GACAGCTGCT
CTCTATGAAA AGGGGTGTGG TGCAGTGCTT AGCGGTAAGC ATAACGTGGC AGAAGAAGCT
TTCAATGCAG TTGTCAAAAG AAGCCCCGAC AGCCGTGAAG CCCGACAGTC GTCCTTGCAG
TTGGGTCTGC TCTATTATAA TACGGGTCGA ACGAAAGAGG CTATTCGCAC ATACCAACGG
ATCATTGACC GCTACCCTCG TAGCGAAGAG ACGACTGTAG CCCTCTCGGA TCTCCGTTCT
ATCTATCTGG AAGAGGATCG GATAGATGAG TATTCTACCT ATGTGCGTGG TCTGAAAGAT
AAGGTGTCCA TTGCCCCATC CGAGACGGAA CAGCTCGGTT TCCTTTCGGC CGAGCGCAAG
TATCGCCGTC GCCAGCCCGA TGCTCGTCGA GATTTGGAGG CTTATCTGGA GCGCTATCCA
CAGGGAAGCG ATCGTCACAA AGCAGAATTG TATTTGGCCG ATCTCGATTA TCAGGCCGGC
AATGCTGATG CGGCTTACAA TCGTTATTCA CGGTTGGTGA ACAGCCCCGG TTTGCCCGAA
GACTATAAGA TCGATGCTCG CCTGCGTCTC GGTCGTATGC AATACGAACG CAAAGAGTAC
AAGGCTGCGC TGCAATCATT CCAATCCGTA CTCGATACGG ATGGTGCCGA AGCTGTCCGC
GATCAGGCGG TGCAGGGAGT AACCGAGTCT GCCTATGCCG ATAAGGATTA CCGGAGGGTG
ATCGACGTAA TTGCCGGATT GAAGAATCAG GCAGCTCTGC CTCATACTCT TCGTCTCTAT
CGTGCCAAAA GTTATCAGGC TCTGAAGATG AATCGGGAAG CGATTGCCGA CTATGAACTC
CTTGCCGAGG ATTTCTCAAC GGCCACAGGT GCCGAAGCCG TCGTGATGCA GGCACAGCTG
GAAATGGAAG CCAAGCGTTT GTCCAAGGCG AAGTCTATCC TCGAAAAATT TATTGCCAAG
AGTACTCCTC AGCAGTATTG GTTGGCACGA GGTTTCATCC TCTTGTCCGA TATTTATAAG
AAAGAAGGGG ATACGTTTAC GGCAAGGCAG TATTTGGAAA GCTTGGAAAA GAACTATCCT
AATCATGAGG ATGACATTCA CGAACAGATA GCCCAACGGC TTGAATAA
 
Protein sequence
MKKLILSALV LGGAMTFTAQ AQNKAAEPAE EFFLEGRAMF IRQNYVGALD QFRLYRLHVG 
QEHALEVDYY SIIADYLLGA DHDYVLSAVG NFLDNFPESI DKSRVRLLTG ELMIRHEQYL
RANTIFSDID DRTLNDEDLA AYLLYYAYVR MQLSESNALA EQMLLKASES RGELGGRACL
LLAAIQIEDG RIDEAEQTLS RLKNRPQFAD EADAYIAEIQ LLRGDYREAA EIADRLLSRN
SPMRQRPQLL RVAGNAYYRL GDSNKTIDYL SDYSEKVGDR IAPADAYALG VTYYKQGLMK
EALRPLAAAT TDAGSLGAES ALYLGQAQLA EGMTSEALMA FEKAATQDVN RPVREVGMYN
MAMLMRSTGQ SSFGQSVRIA ENFLNEFPRS SHREQMAAIL VESYFTGKDY NSSLRSIQKI
AQPTASILAA KQFVLNRMAE QKEAAGYDSE ALSFVSSSIS MGNKGEYFPE AYFLRGNLRY
RAGDFPTAAA DYRAYISAAG DRDAANLPLG YYRLGYSLFN AERYDMALEA FKEYVSRSGI
APNLSADAYA RIGDCRYMKR DFHGAREAYS MAYRVYPSGG DYALLRRARL EGLAKQYADQ
IQTLDKLIRE FPDSRHLTAA LYEKGCGAVL SGKHNVAEEA FNAVVKRSPD SREARQSSLQ
LGLLYYNTGR TKEAIRTYQR IIDRYPRSEE TTVALSDLRS IYLEEDRIDE YSTYVRGLKD
KVSIAPSETE QLGFLSAERK YRRRQPDARR DLEAYLERYP QGSDRHKAEL YLADLDYQAG
NADAAYNRYS RLVNSPGLPE DYKIDARLRL GRMQYERKEY KAALQSFQSV LDTDGAEAVR
DQAVQGVTES AYADKDYRRV IDVIAGLKNQ AALPHTLRLY RAKSYQALKM NREAIADYEL
LAEDFSTATG AEAVVMQAQL EMEAKRLSKA KSILEKFIAK STPQQYWLAR GFILLSDIYK
KEGDTFTARQ YLESLEKNYP NHEDDIHEQI AQRLE