Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PG2024 |
Symbol | hagE |
ID | 2552074 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Porphyromonas gingivalis W83 |
Kingdom | Bacteria |
Replicon accession | NC_002950 |
Strand | - |
Start bp | 2119173 |
End bp | 2124293 |
Gene Length | 5121 bp |
Protein Length | 1706 aa |
Translation table | 11 |
GC content | 48% |
IMG OID | 637150603 |
Product | hemagglutinin protein HagE |
Protein accession | NP_906092 |
Protein GI | 34541613 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 0.044838 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAAAACT TGAACAAGTT TGTTTCGATT GCTCTTTGCT CTTCCTTATT AGGAGGAATG GCATTTGCGC AGCAGACAGA GTTGGGACGC AATCCGAATG TGAGATTGCT CGAATCCACT CAGCAATCGG TGACAAAGGT TCAGTTCCGT ATGGACAACC TCAAGTTCAC CGAAGTTCAA ACCCCTAAGG GAATGGCACA AGTGCCGACC TATACAGAAG GGGTTAATCT TTCTGAAAAA GGGATGCCTA CGCTTCCCAT TCTATCACGC TCTTTGGCGG TTTCAGACAC TCGTGAGATG AAGGTAGAGG TTGTTTCCTC AAAGTTCATC GAAAAGAAAA ATGTCCTGAT TGCACCCTCC AAGGGCATGA TTATGCGTAA CGAAGATCCG AAAAAGATCC CTTACGTTTA TGGAAAGAGC TACTCGCAAA ACAAATTCTT CCCGGGAGAG ATCGCCACGC TTGATGATCC TTTTATCCTT CGTGATGTGC GTGGACAGGT TGTAAACTTT GCGCCTTTGC AGTATAACCC TGTGACAAAG ACGTTGCGCA TCTATACGGA AATCACTGTG GCAGTGAGCG AAACTTCGGA GCAAGGCAAA AATATTCTGA ACAAGAAAGG TACATTTGCC GGCTTTGAAG ACACATACAA GCGCATGTTC ATGAACTACG AGCCAGGGCG TTACACACCG GTAGAGGAAA AACAAAATGG TCGTATGATC GTCATCGTAG CCAAAAAGTA TGAGGGAGAT ATTAAAGATT TCGTTGATTG GAAAAACCAA CGCGGTCTCC GTACCGAGGT GAAAGTGGCA GAAGATATTG CTTCTCCCGT TACAGCTAAT GCTATTCAGC AATTCGTTAA GCAAGAATAC GAGAAAGAAG GTAATGATTT GACCTATGTT CTTTTGATTG GCGATCACAA AGATATTCCT GCCAAAATTA CTCCGGGGAT CAAATCCGAC CAGGTATATG GACAAATAGT AGGTAATGAC CACTACAACG AAGTCTTCAT CGGTCGTTTC TCATGTGAGA GCAAAGAGGA TCTGAAGACA CAAATCGATC GGACTATTCA CTATGAGCGC AATATAACCA CGGAAGACAA ATGGCTCGGT CAGGCTCTTT GTATTGCTTC GGCTGAAGGA GGCCCATCCG CAGACAATGG TGAAAGTGAT ATCCAGCATG AGAATGTAAT CGCCAATCTG CTTACCCAGT ATGGTTATAC CAAGATTATC AAATGTTATG ATCCGGGAGT AACTCCTAAA AACATTATTG ATGCTTTCAA CGGAGGAATC TCGTTGGCCA ACTATACGGG CCACGGTAGC GAAACAGCTT GGGGTACGTC TCACTTCGGC ACCACTCATG TGAAGCAGCT TACCAACAGC AACCAGCTAC CGTTTATTTT CGACGTAGCT TGTGTGAATG GCGATTTCCT ATTCAGCATG CCTTGTTTCG CAGAAGCATT GATGCGTGCA CAAAAAGATG GTAAGCCGAC AGGTACTGTT GCTATCATAG CGTCTACGAT CAACCAGTCT TGGGCTTCTC CTATGCGCGG GCAGGATGAG ATGAACGAAA TTCTGTGCGA AAAACACCCG AACAACATCA AGCGTACTTT CGGTGGTGTC ACCATGAACG GTATGTTTGC TATGGTGGAA AAGTATAAAA AGGATGGTGA GAAGATGCTC GACACATGGA CTGTATTCGG CGACCCCTCG CTGCTCGTTC GTACACTTGT CCCGACCAAA ATGCAGGTTA CGGCTCCGGC TCAGATTAAT TTGACGGATG CTTCAGTCAA CGTATCTTGC GATTATAATG GTGCTATTGC TACCATTTCA GCCAATGGAA AGATGTTCGG TTCTGCAGTT GTCGAAAATG GAACAGCTAC AATCAATCTG ACAGGTCTGA CAAATGAAAG CACGCTTACC CTTACAGTAG TTGGTTACAA CAAAGAGACG GTTATTAAGA CCATCAACAC TAATGGTGAG CCTAACCCCT ACCAGCCTGT TTCCAACTTG ACTGCTACAA CGCAGGGTCA GAAAGTAACG CTCAAGTGGG ATGCACCGAG CACGAAAACC AATGCAACCA CTAATACCGC TCGCAGCGTG GATGGCATAC GAGAACTGGT TCTTCTGTCA GTCAGCGATG CCCCCGAACT TCTTCGCAGC GGTCAGGCCG AGATTGTTCT TGAAGCTCAC GATGTTTGGA ATGATGGATC CGGTTATCAG ATTCTTTTGG ATGCAGACCA TGATCAATAT GGACAGGTTA TACCCAGTGA TACCCATACT CTTTGGCCGA ACTGTAGTGT CCCGGCCAAT CTGTTCGCTC CGTTCGAATA TACGGTTCCG GAAAATGCAG ATCCTTCTTG TTCCCCTACC AATATGATAA TGGATGGTAC TGCATCCGTT AATATACCGG CCGGAACTTA TGACTTTGCA ATTGCTGCTC CTCAAGCAAA TGCAAAGATT TGGATTGCCG GACAAGGACC GACGAAAGAA GATGATTATG TATTTGAAGC CGGTAAAAAA TACCATTTCC TTATGAAGAA GATGGGTAGC GGTGATGGAA CTGAATTGAC TATAAGCGAA GGTGGTGGAA GCGATTACAC CTATACTGTC TATCGTGACG GCACGAAGAT CAAGGAAGGT CTGACGGCTA CGACATTCGA AGAAGACGGT GTAGCTGCAG GCAATCATGA GTATTGCGTG GAAGTTAAGT ACACAGCCGG CGTATCTCCG AAGGTATGTA AAGACGTTAC GGTAGAAGGA TCCAATGAAT TTGCTCCTGT ACAGAACCTG ACCGGTAGTG CAGTCGGCCA GAAAGTAACG CTTAAGTGGG ATGCACCTAA TGGTACCCCG AATCCAAATC CAAATCCGAA TCCAAATCCG AATCCCGGAA CAACTACACT TTCCGAATCA TTCGAAAATG GTATTCCTGC CTCATGGAAG ACGATCGATG CAGACGGTGA CGGGCATGGC TGGAAGCCTG GAAATGCTCC CGGAATCGCT GGCTACAATA GCAATGGTTG TGTATATTCA GAGTCATTCG GTCTTGGTGG TATAGGAGTT CTTACCCCTG ACAACTATCT GATAACACCG GCATTGGATT TGCCTAACGG AGGTAAGTTG ACTTTCTGGG TATGCGCACA GGATGCTAAT TATGCATCCG AGCACTATGC GGTGTATGCA TCTTCGACCG GTAACGATGC ATCCAACTTC ACGAATGCTT TGTTGGAAGA GACGATTACG GCAAAAGGTG TTCGCTCGCC GGAAGCTATT CGTGGTCGTA TACAGGGTAC TTGGCGCCAG AAGACGGTAG ACCTTCCCGC AGGTACGAAA TATGTTGCTT TCCGTCACTT CCAAAGCACG GATATGTTCT ACATCGACCT TGATGAGGTT GAGATCAAGG CCAATGGCAA GCGCGCAGAC TTCACGGAAA CGTTCGAGTC TTCTACTCAT GGAGAGGCAC CAGCGGAATG GACTACTATC GATGCCGATG GCGATGGTCA GGGTTGGCTC TGTCTGTCTT CCGGACAATT GGACTGGCTG ACAGCTCATG GCGGCACCAA CGTAGTAAGC TCTTTCTCAT GGAATGGAAT GGCTTTGAAT CCTGATAACT ATCTCATCTC AAAGGATGTT ACAGGCGCAA CGAAGGTAAA GTACTACTAT GCAGTCAACG ACGGTTTTCC CGGGGATCAC TATGCGGTGA TGATCTCCAA GACGGGCACG AACGCCGGAG ACTTCACGGT TGTTTTCGAA GAAACGCCTA ACGGAATAAA TAAGGGCGGA GCAAGATTCG GTCTTTCCAC GGAAGCCGAT GGCGCCAAAC CTCAAAGTGT ATGGATCGAG CGTACGGTAG ATTTGCCTGC GGGCACGAAG TATGTTGCTT TCCGTCACTA CAATTGCTCG GATTTGAACT ACATTCTTTT GGATGATATT CAGTTCACCA TGGGTGGCAG CCCCACCCCG ACCGATTATA CCTACACGGT GTATCGTGAT GGTACGAAGA TCAAGGAAGG TTTGACCGAA ACGACCTTCG AAGAAGACGG CGTAGCTACG GGCAATCATG AGTATTGCGT GGAAGTGAAG TACACAGCCG GCGTATCTCC GAAGAAATGT GTAAACGTAA CTGTTAATTC GACACAGTTC AATCCTGTAA AGAACCTGAA GGCACAACCG GATGGCGGCG ACGTGGTTCT CAAGTGGGAA GCCCCGAGCG CAAAGAAGAC AGAAGGTTCT CGTGAAGTAA AACGGATCGG AGACGGTCTT TTCGTTACGA TCGAACCTGC AAACGATGTA CGTGCCAACG AAGCCAAGGT TGTGCTCGCA GCAGACAACG TATGGGGAGA CAATACGGGT TACCAGTTCT TGTTGGATGC CGATCACAAT ACATTCGGAA GTGTCATTCC GGCAACCGGT CCTCTCTTTA CCGGAACAGC TTCTTCCGAT CTTTACAGTG CGAACTTCGA GTATTTGATC CCGGCCAATG CCGATCCTGT TGTTACTACA CAGAATATTA TCGTTACAGG ACAGGGTGAA GTTGTAATCC CCGGTGGTGT TTACGACTAT TGCATTACGA ACCCGGAACC TGCATCCGGA AAGATGTGGA TCGCAGGAGA TGGAGGCAAC CAGCCTGCAC GTTATGACGA TTTCACATTC GAAGCAGGCA AGAAGTACAC CTTCACGATG CGTCGCGCCG GAATGGGAGA TGGAACTGAT ATGGAAGTCG AAGACGATTC ACCTGCAAGC TATACCTATA CAGTCTATCG TGACGGCACG AAGATCAAGG AAGGTCTGAC CGAAACGACC TACCGCGATG CAGGAATGAG TGCACAATCT CATGAGTATT GCGTGGAAGT TAAGTACACA GCCGGCGTAT CTCCGAAGGT TTGTGTGGAT TATATTCCTG ACGGAGTGGC AGACGTAACG GCTCAGAAGC CTTACACGCT GACAGTTGTA GGAAAGACGA TCACGGTAAC TTGCCAAGGC GAAGCTATGA TCTACGACAT GAACGGTCGT CGTCTGGCAG CCGGTCGCAA CACGGTTGTT TACACGGCTC AGGGCGGCTA CTATGCAGTT ATGGTTGTCG TTGACGGCAA GTCTTACGTA GAGAAACTCG CTATCAAGTA A
|
Protein sequence | MKNLNKFVSI ALCSSLLGGM AFAQQTELGR NPNVRLLEST QQSVTKVQFR MDNLKFTEVQ TPKGMAQVPT YTEGVNLSEK GMPTLPILSR SLAVSDTREM KVEVVSSKFI EKKNVLIAPS KGMIMRNEDP KKIPYVYGKS YSQNKFFPGE IATLDDPFIL RDVRGQVVNF APLQYNPVTK TLRIYTEITV AVSETSEQGK NILNKKGTFA GFEDTYKRMF MNYEPGRYTP VEEKQNGRMI VIVAKKYEGD IKDFVDWKNQ RGLRTEVKVA EDIASPVTAN AIQQFVKQEY EKEGNDLTYV LLIGDHKDIP AKITPGIKSD QVYGQIVGND HYNEVFIGRF SCESKEDLKT QIDRTIHYER NITTEDKWLG QALCIASAEG GPSADNGESD IQHENVIANL LTQYGYTKII KCYDPGVTPK NIIDAFNGGI SLANYTGHGS ETAWGTSHFG TTHVKQLTNS NQLPFIFDVA CVNGDFLFSM PCFAEALMRA QKDGKPTGTV AIIASTINQS WASPMRGQDE MNEILCEKHP NNIKRTFGGV TMNGMFAMVE KYKKDGEKML DTWTVFGDPS LLVRTLVPTK MQVTAPAQIN LTDASVNVSC DYNGAIATIS ANGKMFGSAV VENGTATINL TGLTNESTLT LTVVGYNKET VIKTINTNGE PNPYQPVSNL TATTQGQKVT LKWDAPSTKT NATTNTARSV DGIRELVLLS VSDAPELLRS GQAEIVLEAH DVWNDGSGYQ ILLDADHDQY GQVIPSDTHT LWPNCSVPAN LFAPFEYTVP ENADPSCSPT NMIMDGTASV NIPAGTYDFA IAAPQANAKI WIAGQGPTKE DDYVFEAGKK YHFLMKKMGS GDGTELTISE GGGSDYTYTV YRDGTKIKEG LTATTFEEDG VAAGNHEYCV EVKYTAGVSP KVCKDVTVEG SNEFAPVQNL TGSAVGQKVT LKWDAPNGTP NPNPNPNPNP NPGTTTLSES FENGIPASWK TIDADGDGHG WKPGNAPGIA GYNSNGCVYS ESFGLGGIGV LTPDNYLITP ALDLPNGGKL TFWVCAQDAN YASEHYAVYA SSTGNDASNF TNALLEETIT AKGVRSPEAI RGRIQGTWRQ KTVDLPAGTK YVAFRHFQST DMFYIDLDEV EIKANGKRAD FTETFESSTH GEAPAEWTTI DADGDGQGWL CLSSGQLDWL TAHGGTNVVS SFSWNGMALN PDNYLISKDV TGATKVKYYY AVNDGFPGDH YAVMISKTGT NAGDFTVVFE ETPNGINKGG ARFGLSTEAD GAKPQSVWIE RTVDLPAGTK YVAFRHYNCS DLNYILLDDI QFTMGGSPTP TDYTYTVYRD GTKIKEGLTE TTFEEDGVAT GNHEYCVEVK YTAGVSPKKC VNVTVNSTQF NPVKNLKAQP DGGDVVLKWE APSAKKTEGS REVKRIGDGL FVTIEPANDV RANEAKVVLA ADNVWGDNTG YQFLLDADHN TFGSVIPATG PLFTGTASSD LYSANFEYLI PANADPVVTT QNIIVTGQGE VVIPGGVYDY CITNPEPASG KMWIAGDGGN QPARYDDFTF EAGKKYTFTM RRAGMGDGTD MEVEDDSPAS YTYTVYRDGT KIKEGLTETT YRDAGMSAQS HEYCVEVKYT AGVSPKVCVD YIPDGVADVT AQKPYTLTVV GKTITVTCQG EAMIYDMNGR RLAAGRNTVV YTAQGGYYAV MVVVDGKSYV EKLAIK
|
| |