Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PG1837 |
Symbol | |
ID | 2551934 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Porphyromonas gingivalis W83 |
Kingdom | Bacteria |
Replicon accession | NC_002950 |
Strand | + |
Start bp | 1930477 |
End bp | 1936794 |
Gene Length | 6318 bp |
Protein Length | 2105 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 637150442 |
Product | hemagglutinin protein HagA |
Protein accession | NP_905932 |
Protein GI | 34541453 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 20 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCACGAA TTATCTTGGA GGCTCACGAT GTATGGGAAG ACGGCACAGG CTATCAAATG CTCTGGGATG CAGATCACAA TCAGTACGGC GCATCCATTC CCGAAGAATC TTTTTGGTTT GCCAACGGAA CGATCCCGGC CGGTCTTTAC GATCCTTTCG AGTATAAAGT TCCGGTCAAT GCCGATGCAT CTTTTTCTCC CACGAATTTC GTGCTTGATG GAACAGCATC AGCCGATATT CCTGCCGGCA CTTATGACTA TGTAATCATT AACCCCAATC CTGGCATAAT ATATATAGTA GGAGAGGGTG TCTCCAAAGG TAACGATTAT GTGGTAGAGG CCGGTAAGAC TTATCATTTC ACTGTCCAAC GACAAGGCCC CGGCGATGCT GCGTCCGTTG TAGTGACCGG AGAAGGTGGC AATGAATTCG CTCCCGTACA GAATCTCCAA TGGTCTGTAT CCGGGCAGAC AGTGACCCTC ACTTGGCAAG CCCCCGCATC CGACAAACGG ACTTATGTGT TGAACGAAAG CTTCGATACG CAAACGCTTC CTAACGGCTG GACAATGATC GATGCTGATG GTGATGGTCA CAATTGGCTA TCTACAATAA ACGTTTACAA CACTGCTACT CATACAGGTG ACGGTGCTAT GTTTAGCAAA TCATGGACAG CTAGCAGTGG TGCAAAAATT GATTTGAGTC CTGACAACTA TTTGGTAACT CCTAAGTTTA CGGTTCCTGA GAATGGTAAA CTTTCTTATT GGGTTTCATC TCAAGAGCCT TGGACTAATG AGCATTATGG AGTGTTCTTG TCCACAACCG GAAACGAGGC TGCAAACTTT ACGATAAAGC TGCTGGAAGA AACCCTCGGA TCCGGCAAAC CTGCTCCGAT GAACTTGGTG AAGAGTGAAG GAGTAAAGGC TCCGGCACCT TATCAGGAAA GAACCATCGA TCTCTCTGCC TATGCCGGAC AACAGGTGTA CTTGGCATTC CGTCATTTCG GCTGTACAGG TATATTCCGT CTTTATCTTG ATGACGTGGC TGTTTCTGGT GAAGGTTCTT CCAACGACTA CACGTACACG GTATATCGTG ACAATGTTGT TATCGCCCAG AATCTCACGG CAACGACATT CAATCAGGAA AATGTAGCTC CCGGCCAGTA CAACTACTGT GTTGAAGTTA AGTACACAGC CGGCGTATCT CCGAAGGTAT GTAAAGACGT TACGGTAGAA GGATCCAATG AATTTGCTCC TGTACAGAAC CTGACCGGTA GTGCAGTCGG CCAGAAAGTA ACGCTCAAGT GGGATGCACC TAATGGTACC CCGAATCCGA ATCCGGGAAC AACAACACTT TCCGAATCAT TCGAAAATGG TATTCCTGCC TCATGGAAGA CGATCGATGC AGACGGTGAC GGCAACAATT GGACGACGAC CCCTCCTCCC GGAGGCTCCT CTTTTGCAGG TCACAACAGT GCAATCTGTG TCTCTTCGGC TTCTTATATC AACTTTGAAG GCCCTCAGAA CCCTGATAAC TATCTGGTTA CACCGGAGCT TTCTCTTCCT AACGGAGGAA CGCTTACTTT CTGGGTATGT GCACAAGATG CCAATTATGC ATCAGAGCAC TATGCCGTGT ATGCATCTTC TACGGGTAAC GACGCTTCCA ACTTCGCCAA CGCTTTGTTG GAAGAAGTGC TGACGGCCAA GACAGTTGTT ACGGCACCGG AAGCCATTCG TGGCACTCGT GTTCAGGGCA CCTGGTATCA AAAGACGGTA CAGTTGCCTG CGGGTACTAA GTATGTTGCC TTCCGTCACT TCGGCTGTAC GGACTTCTTC TGGATCAACC TCGATGATGT TGAGATCAAG GCCAACGGCA AGCGCGCAGA CTTCACGGAA ACGTTCGAGT CTTCTACTCA TGGAGAGGCA CCAGCGGAAT GGACTACTAT CGATGCCGAT GGCGATGGTC AGGGTTGGCT CTGTCTGTCT TCCGGACAAT TGGGATGGCT GACAGCTCAT GGCGGCACCA ACGTAGTAGC CTCTTTCTCA TGGAATGGAA TGGCTTTGAA TCCTGATAAC TATCTCATCT CAAAGGATGT TACAGGCGCA ACGAAGGTAA AGTACTACTA TGCAGTCAAC GACGGTTTTC CCGGGGATCA CTATGCGGTG ATGATCTCCA AGACGGGCAC GAACGCCGGA GACTTCACGG TCGTTTTCGA AGAAACGCCT AACGGAATAA ATAAGGGCGG AGCAAGATTC GGTCTTTCCA CGGAAGCCAA TGGCGCCAAA CCTCAAAGTG TATGGATCGA GCGTACGGTA GATTTGCCTG CGGGCACGAA GTATGTTGCT TTCCGTCACT ACAATTGCTC GGATTTGAAC TACATTCTTT TGGATGATAT TCAGTTCACC ATGGGTGGCA GCCCCACCCC GACCGATTAT ACCTACACGG TGTATCGTGA TGGTACGAAG ATCAAGGAAG GTTTGACCGA AACGACCTTC GAAGAAGACG GCGTAGCTAC GGGCAATCAT GAGTATTGCG TGGAAGTGAA GTACACAGCC GGCGTATCTC CGAAAGAGTG TGTAAACGTA ACTGTTGATC CTGTGCAGTT TAATCCTGTA CAGAACCTGA CCGGTAGTGC AGTAGGTCAG AAAGTAACGC TTAAGTGGGA TGCACCTAAT GGTACCCCGA ATCCGAATCC CGGAACAACT ACACTTTCCG AATCATTCGA AAATGGTATT CCTGCCTCAT GGAAGACGAT CGATGCAGAC GGTGACGGCA ACAATTGGAC GACGACCCCT CCTCCCGGAG GCACCTCTTT TGCAGGTCAC AACAGTGCGA TCTGTGTCTC TTCGGCTTCT TATATCAACT TTGAAGGCCC TCAGAACCCT GATAACTATC TGGTTACACC GGAGCTATCT CTTCCTAACG GAGGAACGCT TACTTTCTGG GTATGTGCAC AAGATGCCAA TTATGCATCA GAGCACTATG CCGTGTATGC ATCTTCTACG GGTAACGACG CTTCCAACTT CGCCAACGCT TTGTTGGAAG AAGTGCTGAC GGCCAAGACA GTTGTTACGG CACCTGAAGC CATTCGTGGC ACTCGTGTTC AGGGCACCTG GTATCAAAAG ACGGTACAGT TGCCTGCGGG TACTAAGTAT GTTGCCTTCC GTCACTTCGG CTGTACGGAC TTCTTCTGGA TCAACCTCGA TGATGTTGAG ATCAAGGCCA ACGGCAAGCG CGCAGACTTC ACGGAAACGT TCGAGTCTTC TACTCATGGA GAGGCACCAG CGGAATGGAC TACTATCGAT GCCGATGGCG ATGGTCAGGG TTGGCTCTGT CTGTCTTCCG GACAATTGGA CTGGCTGACA GCTCATGGCG GCACCAACGT AGTAGCCTCT TTCTCATGGA ATGGAATGGC TTTGAATCCT GATAACTATC TCATCTCAAA GGATGTTACA GGCGCAACGA AGGTAAAGTA CTACTATGCA GTCAACGACG GTTTTCCCGG GGATCACTAT GCGGTGATGA TCTCCAAGAC GGGCACGAAC GCCGGAGACT TCACGGTTGT TTTCGAAGAA ACGCCTAACG GAATAAATAA GGGCGGAGCA AGATTCGGTC TTTCCACGGA AGCCAATGGC GCCAAACCTC AAAGTGTATG GATCGAGCGT ACGGTAGATT TGCCTGCGGG CACGAAGTAT GTTGCTTTCC GTCACTACAA TTGCTCGGAT TTGAACTACA TTCTTTTGGA TGATATTCAG TTCACCATGG GTGGCAGCCC CACCCCGACC GATTATACCT ACACGGTGTA TCGTGATGGT ACGAAGATCA AGGAAGGTTT GACCGAAACG ACCTTCGAAG AAGACGGCGT AGCTACGGGC AATCATGAGT ATTGCGTGGA AGTGAAGTAC ACAGCCGGCG TATCTCCGAA AGAGTGCGTA AACGTAACTG TTGATCCTGT GCAGTTCAAT CCTGTACAGA ACCTGACCGG TAGTGCAGTC GGCCAGAAAG TAACGCTCAA GTGGGATGCA CCTAATGGTA CCCCGAATCC GAATCCGGGA ACAACAACAC TTTCCGAATC ATTCGAAAAT GGTATTCCTG CCTCATGGAA GACGATCGAT GCAGACGGTG ACGGCAACAA TTGGACGACG ACCCCTCCTC CCGGAGGCAC CTCTTTTGCA GGTCACAACA GTGCGATCTG TGTCTCTTCG GCTTCTTATA TCAACTTTGA AGGCCCTCAG AACCCTGATA ACTATCTGGT TACACCGGAG CTTTCTCTTC CTAACGGAGG AACGCTTACT TTCTGGGTAT GTGCACAAGA TGCCAATTAT GCATCAGAGC ACTATGCCGT GTATGCATCT TCTACGGGTA ACGACGCTTC CAACTTCGCC AACGCTTTGT TGGAAGAAGT GCTGACGGCC AAGACAGTTG TTACGGCACC GGAAGCCATT CGTGGTACTC GTGTTCAGGG CACCTGGTAT CAAAAGACGG TACAGTTGCC TGCGGGTACT AAGTATGTTG CCTTCCGTCA CTTCGGCTGT ACGGACTTCT TCTGGATCAA CCTCGATGAT GTTGAGATCA AGGCCAACGG CAAGCGCGCA GACTTCACGG AAACGTTCGA GTCTTCTACT CATGGAGAGG CACCAGCGGA ATGGACTACT ATCGATGCCG ATGGCGATGG TCAGGGTTGG CTCTGTCTGT CTTCCGGACA ATTGGGATGG CTGACAGCTC ATGGCGGCAC CAACGTAGTA GCCTCTTTCT CATGGAATGG AATGGCTTTG AATCCTGATA ACTATCTCAT CTCAAAGGAT GTTACAGGCG CAACGAAGGT AAAGTACTAC TATGCAGTCA ACGACGGTTT TCCCGGGGAT CACTATGCGG TGATGATCTC CAAGACGGGC ACGAACGCCG GAGACTTCAC GGTCGTTTTC GAAGAAACGC CTAACGGAAT AAATAAGGGC GGAGCAAGAT TCGGTCTTTC CACGGAAGCC AATGGCGCCA AACCTCAAAG TGTATGGATC GAGCGTACGG TAGATTTGCC TGCGGGCACG AAGTATGTTG CTTTCCGTCA CTACAATTGC TCGGATTTGA ACTACATTCT TTTGGATGAT ATTCAGTTCA CCATGGGTGG CAGCCCCACC CCGACCGATT ATACCTACAC GGTGTATCGT GATGGTACGA AGATCAAGGA AGGTTTGACC GAAACGACCT TCGAAGAAGA CGGCGTAGCT ACGGGCAATC ATGAGTATTG CGTGGAAGTG AAGTACACAG CCGGCGTATC TCCGAAAGAG TGCGTAAACG TAACTATTAA TCCGACACAG TTCAATCCTG TACAGAACCT GACGGCAGAA CAAGCTCCTA ACAGCATGGA TGCAATCCTT AAATGGAATG CACCGGCATC TAAGCGTGCG GAAGTTCTGA ACGAAGACTT CGAAAATGGT ATTCCTGCCT CATGGAAGAC GATCGATGCA GACGGTGACG GCAACAATTG GACGACGACC CCTCCTCCCG GAGGCTCCTC TTTTGCAGGT CACAACAGTG CGATCTGTGT CTCTTCGGCT TCTTATATCA ACTTTGAAGG TCCTCAGAAC CCTGATAACT ATCTGGTTAC ACCGGAGCTT TCTCTTCCTG GCGGAGGAAC GCTTACTTTC TGGGTATGTG CACAAGATGC CAATTATGCA TCAGAGCACT ATGCCGTGTA TGCATCTTCT ACGGGTAACG ACGCTTCCAA CTTCGCCAAC GCTTTGTTGG AAGAAGTGCT GACGGCCAAG ACAGTTGTTA CGGCACCGGA AGCCATTCGT GGTACTCGTG TTCAGGGCAC CTGGTATCAA AAGACGGTAC AGTTGCCTGC GGGTACTAAG TATGTTGCCT TCCGTCACTT CGGCTGTACG GACTTCTTCT GGATCAACCT TGATGATGTT GTAATCACTT CAGGGAACGC TCCGTCTTAC ACCTATACGA TCTATCGTAA TAATACACAG ATAGCATCAG GCGTAACGGA GACTACTTAC CGAGATCCGG ACTTGGCTAC CGGTTTTTAC ACGTACGGTG TTAAGGTTGT TTACCCGAAC GGAGAATCAG CTATCGAAAC TGCTACGTTG AATATCACTT CGTTGGCAGA CGTAACGGCT CAGAAGCCTT ACACGCTGAC AGTTGTAGGA AAGACGATCA CGGTAACTTG CCAAGGCGAA GCTATGATCT ACGACATGAA CGGTCGTCGT CTGGCAGCGG GTCGCAACAC GGTTGTTTAC ACGGCTCAGG GCGGCCACTA TGCAGTCATG GTTGTCGTTG ACGGCAAGTC CTACGTAGAG AAACTCGCTG TAAAGTAA
|
Protein sequence | MARIILEAHD VWEDGTGYQM LWDADHNQYG ASIPEESFWF ANGTIPAGLY DPFEYKVPVN ADASFSPTNF VLDGTASADI PAGTYDYVII NPNPGIIYIV GEGVSKGNDY VVEAGKTYHF TVQRQGPGDA ASVVVTGEGG NEFAPVQNLQ WSVSGQTVTL TWQAPASDKR TYVLNESFDT QTLPNGWTMI DADGDGHNWL STINVYNTAT HTGDGAMFSK SWTASSGAKI DLSPDNYLVT PKFTVPENGK LSYWVSSQEP WTNEHYGVFL STTGNEAANF TIKLLEETLG SGKPAPMNLV KSEGVKAPAP YQERTIDLSA YAGQQVYLAF RHFGCTGIFR LYLDDVAVSG EGSSNDYTYT VYRDNVVIAQ NLTATTFNQE NVAPGQYNYC VEVKYTAGVS PKVCKDVTVE GSNEFAPVQN LTGSAVGQKV TLKWDAPNGT PNPNPGTTTL SESFENGIPA SWKTIDADGD GNNWTTTPPP GGSSFAGHNS AICVSSASYI NFEGPQNPDN YLVTPELSLP NGGTLTFWVC AQDANYASEH YAVYASSTGN DASNFANALL EEVLTAKTVV TAPEAIRGTR VQGTWYQKTV QLPAGTKYVA FRHFGCTDFF WINLDDVEIK ANGKRADFTE TFESSTHGEA PAEWTTIDAD GDGQGWLCLS SGQLGWLTAH GGTNVVASFS WNGMALNPDN YLISKDVTGA TKVKYYYAVN DGFPGDHYAV MISKTGTNAG DFTVVFEETP NGINKGGARF GLSTEANGAK PQSVWIERTV DLPAGTKYVA FRHYNCSDLN YILLDDIQFT MGGSPTPTDY TYTVYRDGTK IKEGLTETTF EEDGVATGNH EYCVEVKYTA GVSPKECVNV TVDPVQFNPV QNLTGSAVGQ KVTLKWDAPN GTPNPNPGTT TLSESFENGI PASWKTIDAD GDGNNWTTTP PPGGTSFAGH NSAICVSSAS YINFEGPQNP DNYLVTPELS LPNGGTLTFW VCAQDANYAS EHYAVYASST GNDASNFANA LLEEVLTAKT VVTAPEAIRG TRVQGTWYQK TVQLPAGTKY VAFRHFGCTD FFWINLDDVE IKANGKRADF TETFESSTHG EAPAEWTTID ADGDGQGWLC LSSGQLDWLT AHGGTNVVAS FSWNGMALNP DNYLISKDVT GATKVKYYYA VNDGFPGDHY AVMISKTGTN AGDFTVVFEE TPNGINKGGA RFGLSTEANG AKPQSVWIER TVDLPAGTKY VAFRHYNCSD LNYILLDDIQ FTMGGSPTPT DYTYTVYRDG TKIKEGLTET TFEEDGVATG NHEYCVEVKY TAGVSPKECV NVTVDPVQFN PVQNLTGSAV GQKVTLKWDA PNGTPNPNPG TTTLSESFEN GIPASWKTID ADGDGNNWTT TPPPGGTSFA GHNSAICVSS ASYINFEGPQ NPDNYLVTPE LSLPNGGTLT FWVCAQDANY ASEHYAVYAS STGNDASNFA NALLEEVLTA KTVVTAPEAI RGTRVQGTWY QKTVQLPAGT KYVAFRHFGC TDFFWINLDD VEIKANGKRA DFTETFESST HGEAPAEWTT IDADGDGQGW LCLSSGQLGW LTAHGGTNVV ASFSWNGMAL NPDNYLISKD VTGATKVKYY YAVNDGFPGD HYAVMISKTG TNAGDFTVVF EETPNGINKG GARFGLSTEA NGAKPQSVWI ERTVDLPAGT KYVAFRHYNC SDLNYILLDD IQFTMGGSPT PTDYTYTVYR DGTKIKEGLT ETTFEEDGVA TGNHEYCVEV KYTAGVSPKE CVNVTINPTQ FNPVQNLTAE QAPNSMDAIL KWNAPASKRA EVLNEDFENG IPASWKTIDA DGDGNNWTTT PPPGGSSFAG HNSAICVSSA SYINFEGPQN PDNYLVTPEL SLPGGGTLTF WVCAQDANYA SEHYAVYASS TGNDASNFAN ALLEEVLTAK TVVTAPEAIR GTRVQGTWYQ KTVQLPAGTK YVAFRHFGCT DFFWINLDDV VITSGNAPSY TYTIYRNNTQ IASGVTETTY RDPDLATGFY TYGVKVVYPN GESAIETATL NITSLADVTA QKPYTLTVVG KTITVTCQGE AMIYDMNGRR LAAGRNTVVY TAQGGHYAVM VVVDGKSYVE KLAVK
|
| |