Gene PG1837 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPG1837 
Symbol 
ID2551934 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePorphyromonas gingivalis W83 
KingdomBacteria 
Replicon accessionNC_002950 
Strand
Start bp1930477 
End bp1936794 
Gene Length6318 bp 
Protein Length2105 aa 
Translation table11 
GC content50% 
IMG OID637150442 
Producthemagglutinin protein HagA 
Protein accessionNP_905932 
Protein GI34541453 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCACGAA TTATCTTGGA GGCTCACGAT GTATGGGAAG ACGGCACAGG CTATCAAATG 
CTCTGGGATG CAGATCACAA TCAGTACGGC GCATCCATTC CCGAAGAATC TTTTTGGTTT
GCCAACGGAA CGATCCCGGC CGGTCTTTAC GATCCTTTCG AGTATAAAGT TCCGGTCAAT
GCCGATGCAT CTTTTTCTCC CACGAATTTC GTGCTTGATG GAACAGCATC AGCCGATATT
CCTGCCGGCA CTTATGACTA TGTAATCATT AACCCCAATC CTGGCATAAT ATATATAGTA
GGAGAGGGTG TCTCCAAAGG TAACGATTAT GTGGTAGAGG CCGGTAAGAC TTATCATTTC
ACTGTCCAAC GACAAGGCCC CGGCGATGCT GCGTCCGTTG TAGTGACCGG AGAAGGTGGC
AATGAATTCG CTCCCGTACA GAATCTCCAA TGGTCTGTAT CCGGGCAGAC AGTGACCCTC
ACTTGGCAAG CCCCCGCATC CGACAAACGG ACTTATGTGT TGAACGAAAG CTTCGATACG
CAAACGCTTC CTAACGGCTG GACAATGATC GATGCTGATG GTGATGGTCA CAATTGGCTA
TCTACAATAA ACGTTTACAA CACTGCTACT CATACAGGTG ACGGTGCTAT GTTTAGCAAA
TCATGGACAG CTAGCAGTGG TGCAAAAATT GATTTGAGTC CTGACAACTA TTTGGTAACT
CCTAAGTTTA CGGTTCCTGA GAATGGTAAA CTTTCTTATT GGGTTTCATC TCAAGAGCCT
TGGACTAATG AGCATTATGG AGTGTTCTTG TCCACAACCG GAAACGAGGC TGCAAACTTT
ACGATAAAGC TGCTGGAAGA AACCCTCGGA TCCGGCAAAC CTGCTCCGAT GAACTTGGTG
AAGAGTGAAG GAGTAAAGGC TCCGGCACCT TATCAGGAAA GAACCATCGA TCTCTCTGCC
TATGCCGGAC AACAGGTGTA CTTGGCATTC CGTCATTTCG GCTGTACAGG TATATTCCGT
CTTTATCTTG ATGACGTGGC TGTTTCTGGT GAAGGTTCTT CCAACGACTA CACGTACACG
GTATATCGTG ACAATGTTGT TATCGCCCAG AATCTCACGG CAACGACATT CAATCAGGAA
AATGTAGCTC CCGGCCAGTA CAACTACTGT GTTGAAGTTA AGTACACAGC CGGCGTATCT
CCGAAGGTAT GTAAAGACGT TACGGTAGAA GGATCCAATG AATTTGCTCC TGTACAGAAC
CTGACCGGTA GTGCAGTCGG CCAGAAAGTA ACGCTCAAGT GGGATGCACC TAATGGTACC
CCGAATCCGA ATCCGGGAAC AACAACACTT TCCGAATCAT TCGAAAATGG TATTCCTGCC
TCATGGAAGA CGATCGATGC AGACGGTGAC GGCAACAATT GGACGACGAC CCCTCCTCCC
GGAGGCTCCT CTTTTGCAGG TCACAACAGT GCAATCTGTG TCTCTTCGGC TTCTTATATC
AACTTTGAAG GCCCTCAGAA CCCTGATAAC TATCTGGTTA CACCGGAGCT TTCTCTTCCT
AACGGAGGAA CGCTTACTTT CTGGGTATGT GCACAAGATG CCAATTATGC ATCAGAGCAC
TATGCCGTGT ATGCATCTTC TACGGGTAAC GACGCTTCCA ACTTCGCCAA CGCTTTGTTG
GAAGAAGTGC TGACGGCCAA GACAGTTGTT ACGGCACCGG AAGCCATTCG TGGCACTCGT
GTTCAGGGCA CCTGGTATCA AAAGACGGTA CAGTTGCCTG CGGGTACTAA GTATGTTGCC
TTCCGTCACT TCGGCTGTAC GGACTTCTTC TGGATCAACC TCGATGATGT TGAGATCAAG
GCCAACGGCA AGCGCGCAGA CTTCACGGAA ACGTTCGAGT CTTCTACTCA TGGAGAGGCA
CCAGCGGAAT GGACTACTAT CGATGCCGAT GGCGATGGTC AGGGTTGGCT CTGTCTGTCT
TCCGGACAAT TGGGATGGCT GACAGCTCAT GGCGGCACCA ACGTAGTAGC CTCTTTCTCA
TGGAATGGAA TGGCTTTGAA TCCTGATAAC TATCTCATCT CAAAGGATGT TACAGGCGCA
ACGAAGGTAA AGTACTACTA TGCAGTCAAC GACGGTTTTC CCGGGGATCA CTATGCGGTG
ATGATCTCCA AGACGGGCAC GAACGCCGGA GACTTCACGG TCGTTTTCGA AGAAACGCCT
AACGGAATAA ATAAGGGCGG AGCAAGATTC GGTCTTTCCA CGGAAGCCAA TGGCGCCAAA
CCTCAAAGTG TATGGATCGA GCGTACGGTA GATTTGCCTG CGGGCACGAA GTATGTTGCT
TTCCGTCACT ACAATTGCTC GGATTTGAAC TACATTCTTT TGGATGATAT TCAGTTCACC
ATGGGTGGCA GCCCCACCCC GACCGATTAT ACCTACACGG TGTATCGTGA TGGTACGAAG
ATCAAGGAAG GTTTGACCGA AACGACCTTC GAAGAAGACG GCGTAGCTAC GGGCAATCAT
GAGTATTGCG TGGAAGTGAA GTACACAGCC GGCGTATCTC CGAAAGAGTG TGTAAACGTA
ACTGTTGATC CTGTGCAGTT TAATCCTGTA CAGAACCTGA CCGGTAGTGC AGTAGGTCAG
AAAGTAACGC TTAAGTGGGA TGCACCTAAT GGTACCCCGA ATCCGAATCC CGGAACAACT
ACACTTTCCG AATCATTCGA AAATGGTATT CCTGCCTCAT GGAAGACGAT CGATGCAGAC
GGTGACGGCA ACAATTGGAC GACGACCCCT CCTCCCGGAG GCACCTCTTT TGCAGGTCAC
AACAGTGCGA TCTGTGTCTC TTCGGCTTCT TATATCAACT TTGAAGGCCC TCAGAACCCT
GATAACTATC TGGTTACACC GGAGCTATCT CTTCCTAACG GAGGAACGCT TACTTTCTGG
GTATGTGCAC AAGATGCCAA TTATGCATCA GAGCACTATG CCGTGTATGC ATCTTCTACG
GGTAACGACG CTTCCAACTT CGCCAACGCT TTGTTGGAAG AAGTGCTGAC GGCCAAGACA
GTTGTTACGG CACCTGAAGC CATTCGTGGC ACTCGTGTTC AGGGCACCTG GTATCAAAAG
ACGGTACAGT TGCCTGCGGG TACTAAGTAT GTTGCCTTCC GTCACTTCGG CTGTACGGAC
TTCTTCTGGA TCAACCTCGA TGATGTTGAG ATCAAGGCCA ACGGCAAGCG CGCAGACTTC
ACGGAAACGT TCGAGTCTTC TACTCATGGA GAGGCACCAG CGGAATGGAC TACTATCGAT
GCCGATGGCG ATGGTCAGGG TTGGCTCTGT CTGTCTTCCG GACAATTGGA CTGGCTGACA
GCTCATGGCG GCACCAACGT AGTAGCCTCT TTCTCATGGA ATGGAATGGC TTTGAATCCT
GATAACTATC TCATCTCAAA GGATGTTACA GGCGCAACGA AGGTAAAGTA CTACTATGCA
GTCAACGACG GTTTTCCCGG GGATCACTAT GCGGTGATGA TCTCCAAGAC GGGCACGAAC
GCCGGAGACT TCACGGTTGT TTTCGAAGAA ACGCCTAACG GAATAAATAA GGGCGGAGCA
AGATTCGGTC TTTCCACGGA AGCCAATGGC GCCAAACCTC AAAGTGTATG GATCGAGCGT
ACGGTAGATT TGCCTGCGGG CACGAAGTAT GTTGCTTTCC GTCACTACAA TTGCTCGGAT
TTGAACTACA TTCTTTTGGA TGATATTCAG TTCACCATGG GTGGCAGCCC CACCCCGACC
GATTATACCT ACACGGTGTA TCGTGATGGT ACGAAGATCA AGGAAGGTTT GACCGAAACG
ACCTTCGAAG AAGACGGCGT AGCTACGGGC AATCATGAGT ATTGCGTGGA AGTGAAGTAC
ACAGCCGGCG TATCTCCGAA AGAGTGCGTA AACGTAACTG TTGATCCTGT GCAGTTCAAT
CCTGTACAGA ACCTGACCGG TAGTGCAGTC GGCCAGAAAG TAACGCTCAA GTGGGATGCA
CCTAATGGTA CCCCGAATCC GAATCCGGGA ACAACAACAC TTTCCGAATC ATTCGAAAAT
GGTATTCCTG CCTCATGGAA GACGATCGAT GCAGACGGTG ACGGCAACAA TTGGACGACG
ACCCCTCCTC CCGGAGGCAC CTCTTTTGCA GGTCACAACA GTGCGATCTG TGTCTCTTCG
GCTTCTTATA TCAACTTTGA AGGCCCTCAG AACCCTGATA ACTATCTGGT TACACCGGAG
CTTTCTCTTC CTAACGGAGG AACGCTTACT TTCTGGGTAT GTGCACAAGA TGCCAATTAT
GCATCAGAGC ACTATGCCGT GTATGCATCT TCTACGGGTA ACGACGCTTC CAACTTCGCC
AACGCTTTGT TGGAAGAAGT GCTGACGGCC AAGACAGTTG TTACGGCACC GGAAGCCATT
CGTGGTACTC GTGTTCAGGG CACCTGGTAT CAAAAGACGG TACAGTTGCC TGCGGGTACT
AAGTATGTTG CCTTCCGTCA CTTCGGCTGT ACGGACTTCT TCTGGATCAA CCTCGATGAT
GTTGAGATCA AGGCCAACGG CAAGCGCGCA GACTTCACGG AAACGTTCGA GTCTTCTACT
CATGGAGAGG CACCAGCGGA ATGGACTACT ATCGATGCCG ATGGCGATGG TCAGGGTTGG
CTCTGTCTGT CTTCCGGACA ATTGGGATGG CTGACAGCTC ATGGCGGCAC CAACGTAGTA
GCCTCTTTCT CATGGAATGG AATGGCTTTG AATCCTGATA ACTATCTCAT CTCAAAGGAT
GTTACAGGCG CAACGAAGGT AAAGTACTAC TATGCAGTCA ACGACGGTTT TCCCGGGGAT
CACTATGCGG TGATGATCTC CAAGACGGGC ACGAACGCCG GAGACTTCAC GGTCGTTTTC
GAAGAAACGC CTAACGGAAT AAATAAGGGC GGAGCAAGAT TCGGTCTTTC CACGGAAGCC
AATGGCGCCA AACCTCAAAG TGTATGGATC GAGCGTACGG TAGATTTGCC TGCGGGCACG
AAGTATGTTG CTTTCCGTCA CTACAATTGC TCGGATTTGA ACTACATTCT TTTGGATGAT
ATTCAGTTCA CCATGGGTGG CAGCCCCACC CCGACCGATT ATACCTACAC GGTGTATCGT
GATGGTACGA AGATCAAGGA AGGTTTGACC GAAACGACCT TCGAAGAAGA CGGCGTAGCT
ACGGGCAATC ATGAGTATTG CGTGGAAGTG AAGTACACAG CCGGCGTATC TCCGAAAGAG
TGCGTAAACG TAACTATTAA TCCGACACAG TTCAATCCTG TACAGAACCT GACGGCAGAA
CAAGCTCCTA ACAGCATGGA TGCAATCCTT AAATGGAATG CACCGGCATC TAAGCGTGCG
GAAGTTCTGA ACGAAGACTT CGAAAATGGT ATTCCTGCCT CATGGAAGAC GATCGATGCA
GACGGTGACG GCAACAATTG GACGACGACC CCTCCTCCCG GAGGCTCCTC TTTTGCAGGT
CACAACAGTG CGATCTGTGT CTCTTCGGCT TCTTATATCA ACTTTGAAGG TCCTCAGAAC
CCTGATAACT ATCTGGTTAC ACCGGAGCTT TCTCTTCCTG GCGGAGGAAC GCTTACTTTC
TGGGTATGTG CACAAGATGC CAATTATGCA TCAGAGCACT ATGCCGTGTA TGCATCTTCT
ACGGGTAACG ACGCTTCCAA CTTCGCCAAC GCTTTGTTGG AAGAAGTGCT GACGGCCAAG
ACAGTTGTTA CGGCACCGGA AGCCATTCGT GGTACTCGTG TTCAGGGCAC CTGGTATCAA
AAGACGGTAC AGTTGCCTGC GGGTACTAAG TATGTTGCCT TCCGTCACTT CGGCTGTACG
GACTTCTTCT GGATCAACCT TGATGATGTT GTAATCACTT CAGGGAACGC TCCGTCTTAC
ACCTATACGA TCTATCGTAA TAATACACAG ATAGCATCAG GCGTAACGGA GACTACTTAC
CGAGATCCGG ACTTGGCTAC CGGTTTTTAC ACGTACGGTG TTAAGGTTGT TTACCCGAAC
GGAGAATCAG CTATCGAAAC TGCTACGTTG AATATCACTT CGTTGGCAGA CGTAACGGCT
CAGAAGCCTT ACACGCTGAC AGTTGTAGGA AAGACGATCA CGGTAACTTG CCAAGGCGAA
GCTATGATCT ACGACATGAA CGGTCGTCGT CTGGCAGCGG GTCGCAACAC GGTTGTTTAC
ACGGCTCAGG GCGGCCACTA TGCAGTCATG GTTGTCGTTG ACGGCAAGTC CTACGTAGAG
AAACTCGCTG TAAAGTAA
 
Protein sequence
MARIILEAHD VWEDGTGYQM LWDADHNQYG ASIPEESFWF ANGTIPAGLY DPFEYKVPVN 
ADASFSPTNF VLDGTASADI PAGTYDYVII NPNPGIIYIV GEGVSKGNDY VVEAGKTYHF
TVQRQGPGDA ASVVVTGEGG NEFAPVQNLQ WSVSGQTVTL TWQAPASDKR TYVLNESFDT
QTLPNGWTMI DADGDGHNWL STINVYNTAT HTGDGAMFSK SWTASSGAKI DLSPDNYLVT
PKFTVPENGK LSYWVSSQEP WTNEHYGVFL STTGNEAANF TIKLLEETLG SGKPAPMNLV
KSEGVKAPAP YQERTIDLSA YAGQQVYLAF RHFGCTGIFR LYLDDVAVSG EGSSNDYTYT
VYRDNVVIAQ NLTATTFNQE NVAPGQYNYC VEVKYTAGVS PKVCKDVTVE GSNEFAPVQN
LTGSAVGQKV TLKWDAPNGT PNPNPGTTTL SESFENGIPA SWKTIDADGD GNNWTTTPPP
GGSSFAGHNS AICVSSASYI NFEGPQNPDN YLVTPELSLP NGGTLTFWVC AQDANYASEH
YAVYASSTGN DASNFANALL EEVLTAKTVV TAPEAIRGTR VQGTWYQKTV QLPAGTKYVA
FRHFGCTDFF WINLDDVEIK ANGKRADFTE TFESSTHGEA PAEWTTIDAD GDGQGWLCLS
SGQLGWLTAH GGTNVVASFS WNGMALNPDN YLISKDVTGA TKVKYYYAVN DGFPGDHYAV
MISKTGTNAG DFTVVFEETP NGINKGGARF GLSTEANGAK PQSVWIERTV DLPAGTKYVA
FRHYNCSDLN YILLDDIQFT MGGSPTPTDY TYTVYRDGTK IKEGLTETTF EEDGVATGNH
EYCVEVKYTA GVSPKECVNV TVDPVQFNPV QNLTGSAVGQ KVTLKWDAPN GTPNPNPGTT
TLSESFENGI PASWKTIDAD GDGNNWTTTP PPGGTSFAGH NSAICVSSAS YINFEGPQNP
DNYLVTPELS LPNGGTLTFW VCAQDANYAS EHYAVYASST GNDASNFANA LLEEVLTAKT
VVTAPEAIRG TRVQGTWYQK TVQLPAGTKY VAFRHFGCTD FFWINLDDVE IKANGKRADF
TETFESSTHG EAPAEWTTID ADGDGQGWLC LSSGQLDWLT AHGGTNVVAS FSWNGMALNP
DNYLISKDVT GATKVKYYYA VNDGFPGDHY AVMISKTGTN AGDFTVVFEE TPNGINKGGA
RFGLSTEANG AKPQSVWIER TVDLPAGTKY VAFRHYNCSD LNYILLDDIQ FTMGGSPTPT
DYTYTVYRDG TKIKEGLTET TFEEDGVATG NHEYCVEVKY TAGVSPKECV NVTVDPVQFN
PVQNLTGSAV GQKVTLKWDA PNGTPNPNPG TTTLSESFEN GIPASWKTID ADGDGNNWTT
TPPPGGTSFA GHNSAICVSS ASYINFEGPQ NPDNYLVTPE LSLPNGGTLT FWVCAQDANY
ASEHYAVYAS STGNDASNFA NALLEEVLTA KTVVTAPEAI RGTRVQGTWY QKTVQLPAGT
KYVAFRHFGC TDFFWINLDD VEIKANGKRA DFTETFESST HGEAPAEWTT IDADGDGQGW
LCLSSGQLGW LTAHGGTNVV ASFSWNGMAL NPDNYLISKD VTGATKVKYY YAVNDGFPGD
HYAVMISKTG TNAGDFTVVF EETPNGINKG GARFGLSTEA NGAKPQSVWI ERTVDLPAGT
KYVAFRHYNC SDLNYILLDD IQFTMGGSPT PTDYTYTVYR DGTKIKEGLT ETTFEEDGVA
TGNHEYCVEV KYTAGVSPKE CVNVTINPTQ FNPVQNLTAE QAPNSMDAIL KWNAPASKRA
EVLNEDFENG IPASWKTIDA DGDGNNWTTT PPPGGSSFAG HNSAICVSSA SYINFEGPQN
PDNYLVTPEL SLPGGGTLTF WVCAQDANYA SEHYAVYASS TGNDASNFAN ALLEEVLTAK
TVVTAPEAIR GTRVQGTWYQ KTVQLPAGTK YVAFRHFGCT DFFWINLDDV VITSGNAPSY
TYTIYRNNTQ IASGVTETTY RDPDLATGFY TYGVKVVYPN GESAIETATL NITSLADVTA
QKPYTLTVVG KTITVTCQGE AMIYDMNGRR LAAGRNTVVY TAQGGHYAVM VVVDGKSYVE
KLAVK