Gene PG2024 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPG2024 
SymbolhagE 
ID2552074 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePorphyromonas gingivalis W83 
KingdomBacteria 
Replicon accessionNC_002950 
Strand
Start bp2119173 
End bp2124293 
Gene Length5121 bp 
Protein Length1706 aa 
Translation table11 
GC content48% 
IMG OID637150603 
Producthemagglutinin protein HagE 
Protein accessionNP_906092 
Protein GI34541613 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.044838 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAAACT TGAACAAGTT TGTTTCGATT GCTCTTTGCT CTTCCTTATT AGGAGGAATG 
GCATTTGCGC AGCAGACAGA GTTGGGACGC AATCCGAATG TGAGATTGCT CGAATCCACT
CAGCAATCGG TGACAAAGGT TCAGTTCCGT ATGGACAACC TCAAGTTCAC CGAAGTTCAA
ACCCCTAAGG GAATGGCACA AGTGCCGACC TATACAGAAG GGGTTAATCT TTCTGAAAAA
GGGATGCCTA CGCTTCCCAT TCTATCACGC TCTTTGGCGG TTTCAGACAC TCGTGAGATG
AAGGTAGAGG TTGTTTCCTC AAAGTTCATC GAAAAGAAAA ATGTCCTGAT TGCACCCTCC
AAGGGCATGA TTATGCGTAA CGAAGATCCG AAAAAGATCC CTTACGTTTA TGGAAAGAGC
TACTCGCAAA ACAAATTCTT CCCGGGAGAG ATCGCCACGC TTGATGATCC TTTTATCCTT
CGTGATGTGC GTGGACAGGT TGTAAACTTT GCGCCTTTGC AGTATAACCC TGTGACAAAG
ACGTTGCGCA TCTATACGGA AATCACTGTG GCAGTGAGCG AAACTTCGGA GCAAGGCAAA
AATATTCTGA ACAAGAAAGG TACATTTGCC GGCTTTGAAG ACACATACAA GCGCATGTTC
ATGAACTACG AGCCAGGGCG TTACACACCG GTAGAGGAAA AACAAAATGG TCGTATGATC
GTCATCGTAG CCAAAAAGTA TGAGGGAGAT ATTAAAGATT TCGTTGATTG GAAAAACCAA
CGCGGTCTCC GTACCGAGGT GAAAGTGGCA GAAGATATTG CTTCTCCCGT TACAGCTAAT
GCTATTCAGC AATTCGTTAA GCAAGAATAC GAGAAAGAAG GTAATGATTT GACCTATGTT
CTTTTGATTG GCGATCACAA AGATATTCCT GCCAAAATTA CTCCGGGGAT CAAATCCGAC
CAGGTATATG GACAAATAGT AGGTAATGAC CACTACAACG AAGTCTTCAT CGGTCGTTTC
TCATGTGAGA GCAAAGAGGA TCTGAAGACA CAAATCGATC GGACTATTCA CTATGAGCGC
AATATAACCA CGGAAGACAA ATGGCTCGGT CAGGCTCTTT GTATTGCTTC GGCTGAAGGA
GGCCCATCCG CAGACAATGG TGAAAGTGAT ATCCAGCATG AGAATGTAAT CGCCAATCTG
CTTACCCAGT ATGGTTATAC CAAGATTATC AAATGTTATG ATCCGGGAGT AACTCCTAAA
AACATTATTG ATGCTTTCAA CGGAGGAATC TCGTTGGCCA ACTATACGGG CCACGGTAGC
GAAACAGCTT GGGGTACGTC TCACTTCGGC ACCACTCATG TGAAGCAGCT TACCAACAGC
AACCAGCTAC CGTTTATTTT CGACGTAGCT TGTGTGAATG GCGATTTCCT ATTCAGCATG
CCTTGTTTCG CAGAAGCATT GATGCGTGCA CAAAAAGATG GTAAGCCGAC AGGTACTGTT
GCTATCATAG CGTCTACGAT CAACCAGTCT TGGGCTTCTC CTATGCGCGG GCAGGATGAG
ATGAACGAAA TTCTGTGCGA AAAACACCCG AACAACATCA AGCGTACTTT CGGTGGTGTC
ACCATGAACG GTATGTTTGC TATGGTGGAA AAGTATAAAA AGGATGGTGA GAAGATGCTC
GACACATGGA CTGTATTCGG CGACCCCTCG CTGCTCGTTC GTACACTTGT CCCGACCAAA
ATGCAGGTTA CGGCTCCGGC TCAGATTAAT TTGACGGATG CTTCAGTCAA CGTATCTTGC
GATTATAATG GTGCTATTGC TACCATTTCA GCCAATGGAA AGATGTTCGG TTCTGCAGTT
GTCGAAAATG GAACAGCTAC AATCAATCTG ACAGGTCTGA CAAATGAAAG CACGCTTACC
CTTACAGTAG TTGGTTACAA CAAAGAGACG GTTATTAAGA CCATCAACAC TAATGGTGAG
CCTAACCCCT ACCAGCCTGT TTCCAACTTG ACTGCTACAA CGCAGGGTCA GAAAGTAACG
CTCAAGTGGG ATGCACCGAG CACGAAAACC AATGCAACCA CTAATACCGC TCGCAGCGTG
GATGGCATAC GAGAACTGGT TCTTCTGTCA GTCAGCGATG CCCCCGAACT TCTTCGCAGC
GGTCAGGCCG AGATTGTTCT TGAAGCTCAC GATGTTTGGA ATGATGGATC CGGTTATCAG
ATTCTTTTGG ATGCAGACCA TGATCAATAT GGACAGGTTA TACCCAGTGA TACCCATACT
CTTTGGCCGA ACTGTAGTGT CCCGGCCAAT CTGTTCGCTC CGTTCGAATA TACGGTTCCG
GAAAATGCAG ATCCTTCTTG TTCCCCTACC AATATGATAA TGGATGGTAC TGCATCCGTT
AATATACCGG CCGGAACTTA TGACTTTGCA ATTGCTGCTC CTCAAGCAAA TGCAAAGATT
TGGATTGCCG GACAAGGACC GACGAAAGAA GATGATTATG TATTTGAAGC CGGTAAAAAA
TACCATTTCC TTATGAAGAA GATGGGTAGC GGTGATGGAA CTGAATTGAC TATAAGCGAA
GGTGGTGGAA GCGATTACAC CTATACTGTC TATCGTGACG GCACGAAGAT CAAGGAAGGT
CTGACGGCTA CGACATTCGA AGAAGACGGT GTAGCTGCAG GCAATCATGA GTATTGCGTG
GAAGTTAAGT ACACAGCCGG CGTATCTCCG AAGGTATGTA AAGACGTTAC GGTAGAAGGA
TCCAATGAAT TTGCTCCTGT ACAGAACCTG ACCGGTAGTG CAGTCGGCCA GAAAGTAACG
CTTAAGTGGG ATGCACCTAA TGGTACCCCG AATCCAAATC CAAATCCGAA TCCAAATCCG
AATCCCGGAA CAACTACACT TTCCGAATCA TTCGAAAATG GTATTCCTGC CTCATGGAAG
ACGATCGATG CAGACGGTGA CGGGCATGGC TGGAAGCCTG GAAATGCTCC CGGAATCGCT
GGCTACAATA GCAATGGTTG TGTATATTCA GAGTCATTCG GTCTTGGTGG TATAGGAGTT
CTTACCCCTG ACAACTATCT GATAACACCG GCATTGGATT TGCCTAACGG AGGTAAGTTG
ACTTTCTGGG TATGCGCACA GGATGCTAAT TATGCATCCG AGCACTATGC GGTGTATGCA
TCTTCGACCG GTAACGATGC ATCCAACTTC ACGAATGCTT TGTTGGAAGA GACGATTACG
GCAAAAGGTG TTCGCTCGCC GGAAGCTATT CGTGGTCGTA TACAGGGTAC TTGGCGCCAG
AAGACGGTAG ACCTTCCCGC AGGTACGAAA TATGTTGCTT TCCGTCACTT CCAAAGCACG
GATATGTTCT ACATCGACCT TGATGAGGTT GAGATCAAGG CCAATGGCAA GCGCGCAGAC
TTCACGGAAA CGTTCGAGTC TTCTACTCAT GGAGAGGCAC CAGCGGAATG GACTACTATC
GATGCCGATG GCGATGGTCA GGGTTGGCTC TGTCTGTCTT CCGGACAATT GGACTGGCTG
ACAGCTCATG GCGGCACCAA CGTAGTAAGC TCTTTCTCAT GGAATGGAAT GGCTTTGAAT
CCTGATAACT ATCTCATCTC AAAGGATGTT ACAGGCGCAA CGAAGGTAAA GTACTACTAT
GCAGTCAACG ACGGTTTTCC CGGGGATCAC TATGCGGTGA TGATCTCCAA GACGGGCACG
AACGCCGGAG ACTTCACGGT TGTTTTCGAA GAAACGCCTA ACGGAATAAA TAAGGGCGGA
GCAAGATTCG GTCTTTCCAC GGAAGCCGAT GGCGCCAAAC CTCAAAGTGT ATGGATCGAG
CGTACGGTAG ATTTGCCTGC GGGCACGAAG TATGTTGCTT TCCGTCACTA CAATTGCTCG
GATTTGAACT ACATTCTTTT GGATGATATT CAGTTCACCA TGGGTGGCAG CCCCACCCCG
ACCGATTATA CCTACACGGT GTATCGTGAT GGTACGAAGA TCAAGGAAGG TTTGACCGAA
ACGACCTTCG AAGAAGACGG CGTAGCTACG GGCAATCATG AGTATTGCGT GGAAGTGAAG
TACACAGCCG GCGTATCTCC GAAGAAATGT GTAAACGTAA CTGTTAATTC GACACAGTTC
AATCCTGTAA AGAACCTGAA GGCACAACCG GATGGCGGCG ACGTGGTTCT CAAGTGGGAA
GCCCCGAGCG CAAAGAAGAC AGAAGGTTCT CGTGAAGTAA AACGGATCGG AGACGGTCTT
TTCGTTACGA TCGAACCTGC AAACGATGTA CGTGCCAACG AAGCCAAGGT TGTGCTCGCA
GCAGACAACG TATGGGGAGA CAATACGGGT TACCAGTTCT TGTTGGATGC CGATCACAAT
ACATTCGGAA GTGTCATTCC GGCAACCGGT CCTCTCTTTA CCGGAACAGC TTCTTCCGAT
CTTTACAGTG CGAACTTCGA GTATTTGATC CCGGCCAATG CCGATCCTGT TGTTACTACA
CAGAATATTA TCGTTACAGG ACAGGGTGAA GTTGTAATCC CCGGTGGTGT TTACGACTAT
TGCATTACGA ACCCGGAACC TGCATCCGGA AAGATGTGGA TCGCAGGAGA TGGAGGCAAC
CAGCCTGCAC GTTATGACGA TTTCACATTC GAAGCAGGCA AGAAGTACAC CTTCACGATG
CGTCGCGCCG GAATGGGAGA TGGAACTGAT ATGGAAGTCG AAGACGATTC ACCTGCAAGC
TATACCTATA CAGTCTATCG TGACGGCACG AAGATCAAGG AAGGTCTGAC CGAAACGACC
TACCGCGATG CAGGAATGAG TGCACAATCT CATGAGTATT GCGTGGAAGT TAAGTACACA
GCCGGCGTAT CTCCGAAGGT TTGTGTGGAT TATATTCCTG ACGGAGTGGC AGACGTAACG
GCTCAGAAGC CTTACACGCT GACAGTTGTA GGAAAGACGA TCACGGTAAC TTGCCAAGGC
GAAGCTATGA TCTACGACAT GAACGGTCGT CGTCTGGCAG CCGGTCGCAA CACGGTTGTT
TACACGGCTC AGGGCGGCTA CTATGCAGTT ATGGTTGTCG TTGACGGCAA GTCTTACGTA
GAGAAACTCG CTATCAAGTA A
 
Protein sequence
MKNLNKFVSI ALCSSLLGGM AFAQQTELGR NPNVRLLEST QQSVTKVQFR MDNLKFTEVQ 
TPKGMAQVPT YTEGVNLSEK GMPTLPILSR SLAVSDTREM KVEVVSSKFI EKKNVLIAPS
KGMIMRNEDP KKIPYVYGKS YSQNKFFPGE IATLDDPFIL RDVRGQVVNF APLQYNPVTK
TLRIYTEITV AVSETSEQGK NILNKKGTFA GFEDTYKRMF MNYEPGRYTP VEEKQNGRMI
VIVAKKYEGD IKDFVDWKNQ RGLRTEVKVA EDIASPVTAN AIQQFVKQEY EKEGNDLTYV
LLIGDHKDIP AKITPGIKSD QVYGQIVGND HYNEVFIGRF SCESKEDLKT QIDRTIHYER
NITTEDKWLG QALCIASAEG GPSADNGESD IQHENVIANL LTQYGYTKII KCYDPGVTPK
NIIDAFNGGI SLANYTGHGS ETAWGTSHFG TTHVKQLTNS NQLPFIFDVA CVNGDFLFSM
PCFAEALMRA QKDGKPTGTV AIIASTINQS WASPMRGQDE MNEILCEKHP NNIKRTFGGV
TMNGMFAMVE KYKKDGEKML DTWTVFGDPS LLVRTLVPTK MQVTAPAQIN LTDASVNVSC
DYNGAIATIS ANGKMFGSAV VENGTATINL TGLTNESTLT LTVVGYNKET VIKTINTNGE
PNPYQPVSNL TATTQGQKVT LKWDAPSTKT NATTNTARSV DGIRELVLLS VSDAPELLRS
GQAEIVLEAH DVWNDGSGYQ ILLDADHDQY GQVIPSDTHT LWPNCSVPAN LFAPFEYTVP
ENADPSCSPT NMIMDGTASV NIPAGTYDFA IAAPQANAKI WIAGQGPTKE DDYVFEAGKK
YHFLMKKMGS GDGTELTISE GGGSDYTYTV YRDGTKIKEG LTATTFEEDG VAAGNHEYCV
EVKYTAGVSP KVCKDVTVEG SNEFAPVQNL TGSAVGQKVT LKWDAPNGTP NPNPNPNPNP
NPGTTTLSES FENGIPASWK TIDADGDGHG WKPGNAPGIA GYNSNGCVYS ESFGLGGIGV
LTPDNYLITP ALDLPNGGKL TFWVCAQDAN YASEHYAVYA SSTGNDASNF TNALLEETIT
AKGVRSPEAI RGRIQGTWRQ KTVDLPAGTK YVAFRHFQST DMFYIDLDEV EIKANGKRAD
FTETFESSTH GEAPAEWTTI DADGDGQGWL CLSSGQLDWL TAHGGTNVVS SFSWNGMALN
PDNYLISKDV TGATKVKYYY AVNDGFPGDH YAVMISKTGT NAGDFTVVFE ETPNGINKGG
ARFGLSTEAD GAKPQSVWIE RTVDLPAGTK YVAFRHYNCS DLNYILLDDI QFTMGGSPTP
TDYTYTVYRD GTKIKEGLTE TTFEEDGVAT GNHEYCVEVK YTAGVSPKKC VNVTVNSTQF
NPVKNLKAQP DGGDVVLKWE APSAKKTEGS REVKRIGDGL FVTIEPANDV RANEAKVVLA
ADNVWGDNTG YQFLLDADHN TFGSVIPATG PLFTGTASSD LYSANFEYLI PANADPVVTT
QNIIVTGQGE VVIPGGVYDY CITNPEPASG KMWIAGDGGN QPARYDDFTF EAGKKYTFTM
RRAGMGDGTD MEVEDDSPAS YTYTVYRDGT KIKEGLTETT YRDAGMSAQS HEYCVEVKYT
AGVSPKVCVD YIPDGVADVT AQKPYTLTVV GKTITVTCQG EAMIYDMNGR RLAAGRNTVV
YTAQGGYYAV MVVVDGKSYV EKLAIK