Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Pnuc_1149 |
Symbol | |
ID | 5052469 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Polynucleobacter necessarius subsp. asymbioticus QLW-P1DMWA-1 |
Kingdom | Bacteria |
Replicon accession | NC_009379 |
Strand | - |
Start bp | 1197574 |
End bp | 1204227 |
Gene Length | 6654 bp |
Protein Length | 2217 aa |
Translation table | 11 |
GC content | 47% |
IMG OID | 640471320 |
Product | filamentous haemagglutinin outer membrane protein |
Protein accession | YP_001155929 |
Protein GI | 145589332 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0810] Periplasmic protein TonB, links inner and outer membranes |
TIGRFAM ID | [TIGR01901] filamentous haemagglutinin family N-terminal domain |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGAATAGCA TCGGAAGGCA ATCTTCCTTT TCTCTTTCTT GCGTTACCAA TTCTTTGTTG AAGCGGACGA AGAGCGTACT GACACATTCT CTTTTTCTGA TAGCGTATGC AGGACTCTCT CACGCAGCCG CTCCCGTGCC AGCACCCACA GCAAAAACCC TCCCCACCGG CGGTCAGGTA GTCGCTGGCA GCGCCACAAT TAGTTCTTCA TCCACAGCGA ATACTGCTGT GATGAATATC AACCAAACCT CACAAAGAGC AGTAGTGAAC TGGGATAGTT TTAACGTTGG TAAAAACGCC ACGGTGAACT TTAATCAACC GAACTCTTCT GCAGTCACTC TCAATCGCGT GACGGGTGGT AATGCTTCAG TCATTAATGG CGCCATCCAC GCTAATGGCC AAGTCGTACT CGTTAATGAA AACGGGGTGG TCTTTGGTAA GGGTGCGCAG GTTAATGCAG CAGCAGTCAC TGCCTCTACT CTGAACATCG CCGATCAAGA GTTCATGGAT GGTAAGAGTA CTTACAAAGA TGATGGCATT GGAGTAGGTT CAAATGCCGG CAAGATCATC AATAAAGGCA AGATCCAAAC CAATAATGAC AATGGTGAAG GTGGCTTTAT TGCTCTATTA GCCCCAGAAG TGCGTAACCA AGGTTATCTC TTAGCCCAAA AAGGCGGCAC GGTTGCTATT GGCTCTGGCT CTCAAATCAC TCTGCATATC CAAGGCCAAA CTTTAGTAGC CATCAAAGTC GATGAGAGTG TCTATAACGG CCTGATCACC AATAAACGCA TTATCGAAGC CCCCGGTGGT TTAGTGGTTT TGGCTACTGG AGCAGCCAAT CAACTGATGG CAGGAGTCAT TAAGAACACA GGCCGCATTG CTGCCAATAG CTTAGAGAGC AACGGCGGGG TGATTGAGTT GGTAGCTAAA AACATCACCC AATCTGGGCA AGTGAGCGCA AATAGCCAAA CCAAAGAGGG TGGTCAAGTC AATCTGGTTG CTAGTGAGAT TACGCTTACT AAGAACTCCA AGACCACAGC CACTGGCGCA GCCGGTGGGG GCCAAGTCAA TATTGGTTTA GCTAGCACCC AAGTATCTGG TGGTGCCCAA GTCAACACCC CAACTCCAGT GGCAATCAAG GCCAATGCCA ACCAAGCAGC CCAAACGAAT CAGCTAGCCA ATACGGTTGC TGTGCAAGAG AGCGCCACAA TTGATACTTC AGCTACCCAA ACAGGTAACG GCGGATCGAT TGCGATCTGG TCCAAGGTAC AAACCACAGT GGCTGGCATC CTTAAATCCA TGGGCGGGGC AATCTTAGGT AACGGCGGCT TTATCGAAAC CAGCTCCCAA GGCCAAGTTG TTCTAGCCCC AACGGCAAGC ATCAATACCA GCGCAAACAA CCCAACTGGC AAAGCCGGCA CTTGGCTCTT AGATCCGATT GACCTGATCA TCAACAGTGA TGCTGCCAAT GTAATCGCCA ACGCCTTAGC AAACACCAAT GTGACCATTG CAGTTACCAA CAGCACCACT GCTTGCCCCA TTGGAAGTTG TACTCAAACT AATGTCAGTG GCGCCAACAG CAGTCTGACT ATTGCTAGTG GCGCCGATAT CCTCAAATTG GGCACCAACT ACACCACGCT CACTCTCTCA TCAGAAGGCA TCTTTAATCT CAATGCCAAT ATCAGTGGTC AGAATCTTGA TGTGATCATC AGCTCTTCGA TTGCCTATCT GAACGTAGGT AGCTCCATTA ATGCCTCCAA AGTAACCGTC CAAGCTCAAA CAATCTATAG CGCTGGCAGC ATCCAAACCA GTAATTACCT CCTAGGCGCA AACCCAGGAT CTTTAGGTAA TGCCATCGCC TTATTAGCCC AAGCCATTTA TGTGTCAGGC AGACTATCAG CTAATGCCAT AGGCAAAGTA GCTGGCAGCA TTACGCTCAC CGCCAACACT ATCAAGCTAT ATCCAAATGC AGTGTTAGAA GCCAATGGTG ATGAAGGTGG CCAAATTACG GTTGCTGCCA ATGACTCCCT CTGGAGTAGC GCTACAGTGC AAGCCAATGG TGGAAACGGT AGGGGAGGAA CACTGTCCTT AACCGCAGCA AACGATCAAT ACTTTGACCA ATCGGCATTA CAAGCAAACG GCACTACCGA TGGTGGCGCC ATCACGATCA CTACCCAAAG TGGTGACATC TCCTTTGCTA ACTCCCTCAT CCAAACCAAC GGCTCGACTG GTCGTGGCGG TAGCATTAGC CTATCGGCTA CTAACATTAC CCAAATTGCT AACAGCAATA TCAGCGCAAA CGGGTACTCC CAAGGTGGCA CAATATTAAT AGGCAATGAC GCTAGTAACG GTTCATTGCC TTTTTCTATG GCGACCACCA TTGATGAAAA GACCGCTATC AATGCAGCCC AGTTAGACCC CAATCCCACC AACCAACACG GTGGTTTTAT TGAAACCTCG GGTCACACTC TAAACCTGCT GGCAACTATT AATGCTGGTA GAGGGGGGAT GTGGTTGTTG GATCCGAATG ATATTTCAAT AGAGGCGCTA CCAGTTTTAG GTGGGACACC CTTTGCATAT GTAAGTGGCT CTTCATACAC CTATACGGCA GGGGCTAGTA GCGTGGTGTA TACCGCCCTG ATTACTTCTG CCCTGGCTAC TGCGGATGTC ATCATTACTA CAGCGTCAGG CAATATTACT GTTAACGGTG CGATTAGCGG AGTTCGCTCA TTAACCCTTC TTGCTAGCTC TGGAAATATT CAATTAAATG CAGGTATTCA GTTAACCGGT GCAGGATCCT CCATTGTTCT AAAGGCAAGT GGTTATATCT CTACTGCTGG TGCAAATACC TATTTGACCA ATGGTGGCGA TGTCATTTTT TGGTCTAACA CTGGAAATGT AACTAGTACT ACCGCAACAA ATGCCAATTT TATTTACCTT GATTCAGGTA CAAAACTTAT GACTGTCGGG GGCGCCATTT ATCTTGCAGG TGGCTTACCT TCTATAGGTA CTACATCTAA TGGAAACAAC TATCCAACTG GGTATGCATT TACTGGCAGT GTAAATTCTG GAGTTCTATT GGGGTCTTAT GCTGGTAACG GCATTCCCAT CATCATCAAG TCTGATGGCG GCAATATAGT GATTGCTGGA CAAACAACTA ATACTAATCT GCCGGGCTTT AGTAGCCAAT CTTCTCTGTT GATTGATTCA GGAATGGGCA CAATTTCGCT AACGGGAACT GCACATAATG GCCATGGTAT TGAGTTGGGC TATGGCTCTG CTTATTCAAA TATTGTGATT ACTTCCGCGT CATCGTCACC CACAGCAATT CAGATCAACG GGACGACAGA TGGCGGTTCG GGCTATAAAG GCTTCTGGGC GATCCATCTT GCTGGCTCCG CTACGCCAGG TGTTTTAATT CAGGCAACAG GATTTGGAGG TGGTGTTTCA CTTACTGGAA TCAATACGCC AAGTGGTATT GGGGTGTATT TAAGCGATAT TGCCATTCTT GCGAACAATG GCCCAATTCA AATCAATTCA ACCGCCCTTG TAACTCAATC GACCTCATCG TTTCTCGGGG CTTGTTCAGG CACTGCGAAT CCTTGTGCTT CTTTCGCCAT AGGTGACAAT ACATACACGC CAATTACTAA CTCTTCTTCT AATATTGCAA TTAATGTGGA TGCTAGGAAT GCAGGCGGGG TCTCTTGGAA CGGGGTATTG ACTGTTGATA CCACCGGCGG TCTAACAATT GCACCCTACA CAACTGACGG CTGGGCCACA ACCCCATTTA CATGGTCTGG AAGCAATGCT CTCTCGGGTG GAATAGCTAC ATTTACTGCA GATGCAAATA GTTTCATTGG CGTAGCTGAT AAGCTAATGA TTAAGAAAAT TACTAGTCTT ACTATTGGTG CATCCGACTC CACCAATACG GTGAATATTA ACCAAGACAC CGGCCCCTTT GGAAATCTCT CAGTTCATGC CGGAAATATC AATATTAATG CGCCAATCCA TTGGGTATCT GATGAAGCTA TCAATTTGAT TTCATCTGGC AATATTTCCG TTAATGCTGA TTTAGTCGGC CCCTCAAGTG GGTTAACTGC TGTCTATGGC GGCACTTATT CTTGCTCTGT TTGTTCCTCT TACTATGTAC TTTTAAATCC GGGTTCCAGC GTTTACGGTA GCGTACCAAA TCTGACTTAC GGAATTTATA CTAGCGCCGC AGGGACAACA TTAGCTTCAA GCGCAATTTC TAGCCTTGTT TCTGGTACGG CTAGTTGGAG TGGAGTCATG CCCGACTCAA CTTCTTCAGT TGGATCCTAT CCTTTAAGTT ACTCATCTGG ACTTTCTATC TCTAGTTTGT CTTATAGTCT GATTGCAGGT AATCTAAACA ATTGGACTAT TACTCCAGCA CCATTGGGTA TTAAAGTTAC TGCAACATAC ACGAGCGGTA GTGTATTTAC TCCCAGCAAT AGCGTTATTG CAGCAACTGG TTTAATGGCA GGGCAGACCA TTACGAGTCT TCAGGCGAAC TCTTCTAATG TAGGCCCTGG AAATTTTGTA ACTACCAGTG GATTGATTGG GGCAGGCGGT TTTCTAGCTA GCAACTATGC GATGTATGCT GGAAGTGAGG TAGGGAATTC TGGGGTGTTG GCTGGCGGTT TATACAGCGG GACAGTATCG GGCGCTACAA ATTCAGGCGG AACAAACACC GTCAATATTA ATAGAGCTAA CCTCGCCATT ACTGCTAATC CTAGCCCCTC TGGAAATGTG TATAACGGCA ACGCATACAA CGGTAGTTAC ATAACCAATG CCTTGGGATC TGACCAGGGA TTGATGACCA TCTCAGGCGT TGCTACTGGA GTTAATGCTG GAACTTATAC CTCTAACTTA GCTGTTAATG GCGCTGTCCT CTCTAACTAC AACACGCCGG TAATTACTAA TGCGAATTTA GTGATCTCAC CAAAACCAAT TACGATTACC GATATTGCAA GTCGTACCAT TTACGATGGT GTAACAAATT ACGCCACCTT GATGGCTAAT GCGGGTTATA CACTGAGCTC AGCATTGGTG GGTAGCGATA CGATTGGTGC AATCAATCAA ACAGCTTCAT CAGTTGGTGG TACTTCATTT TCTGGTATCG CTCAGGCTGG TATTTTCTTT AGCACTCCGA GCGCCGCTGT TTTATCAATT GGTAATGCCA ATAACTACCT TTTCTCTTAT GCAACTGCTA GCAATACTGT TGCCAAGGCT AGCCTCACCA TTGCTGCAAT ACCCAATCTC ACGGGGAACG TTTACAACGG CTCTACTTAT AACGGTACCT ATGTAACAAA TGCTTTGGGC TCTGATCAAG GCTTGATGAC TATCTCAGGT ATTGCTTCTG GAGTTGATGC TGGAATCTAT ACCTCTAATT TAGCAGTTAG CGGTTCAGTG CTATCTAACT ACAACAGTCC AGTGATTAGA AATGCGAATT TAGTGATCAC TCCTGCGCCG TTAGTTATCA AAGCGAATGA CTTAGCAAGA GTGGCAGGGA CTGCTTTTTA TGGCGGTAAT GGTGTGACCT ATACCGGTCT CCTGAATGGG GAGTCACCCT CAGTACTCTC AGGGGTATTA ACGTATTCTG GATCAGCTCA AGGCGCAAGT AATGCGGGGG CTTATGTAAT CCAACCCGGC GGTTTGACTA GTCAAAATTA TTCTATCGGA TTTATTAGCG GTGTTTTAGT GAATACTCCA CAGCCAGTGG TTAATGCGAT TACTCCGCCA CCGCCGCCTC CACCTCCTCC AATGGTTTTT AGTATTGCGC CTCCTCCACC CCCACCGGCA GCAGCTGCGA TGGTGTCCAT AGCCCCGCCA CCACCTGCTG CTGAAGCTGG GTCCGGGGGA CGAGAAGTTG CTCCTCCACC TGTCGCTGCT GCAATGGTTG CAGCGCCTGA TGGTGGAATT GCCTTGGCAC CTGCACCAGC AGCTCCGCCA CCAACTGCTT CTGCACCACC ACCTGCAGCT CCCTCAGGCG CAAATCCATC AAATACTCCT AGTTCATCTA CCAATACGGA TGCATCGAGT AATAAGCCGT CAGAAGGTAA ACCAAATGGC ACGAACTCAA AAGGAGAGCG TATTGGAAAA TATGACGCTC GAAGATTGGC GGAATCAAAA GAAGCCAAAG AGGGTAAGGG TGGATCTGAT AAACCTGGTT CTAAAAAAGC AGAACCTGGA GTGAATGGGG AAACCCCTCC TAAATATGCT GGTAAATATG CTAATGGTTT CCGGGCTGCT GAAAAGGCTG CTGGAAAAGA TGGCGCCAAT AAGTCATTGG CTTCAAATAA GGCTAACCCT CCTAGAGAGG GAAAATACAG CAACCGAGTA AATGCGCTTA ATAATGGTTC AGCGTCAGCT TCAGCTGCAA TGGCAGCGGC CGCGATGTCA CAACATCAAT TCTCCTCAAC TGGCTTACCC CCTTTACCAG GGGCTCCTGC TGAGGCAGCC ACCCCAACAA TATTGCGGGG CGGCGATAGC TTAACCCAGT CTTATGATGA TGTACCATCG ATACGTAATT CAGGGGTTGC TAATGCGGGA CGTTCACGGA ATACGGAAAA TTACCATGAG AGTCTTGAGT CGGTTAATTT AATGTCGACA CTGAATTTAT TTATAGTCCA TTAG
|
Protein sequence | MNSIGRQSSF SLSCVTNSLL KRTKSVLTHS LFLIAYAGLS HAAAPVPAPT AKTLPTGGQV VAGSATISSS STANTAVMNI NQTSQRAVVN WDSFNVGKNA TVNFNQPNSS AVTLNRVTGG NASVINGAIH ANGQVVLVNE NGVVFGKGAQ VNAAAVTAST LNIADQEFMD GKSTYKDDGI GVGSNAGKII NKGKIQTNND NGEGGFIALL APEVRNQGYL LAQKGGTVAI GSGSQITLHI QGQTLVAIKV DESVYNGLIT NKRIIEAPGG LVVLATGAAN QLMAGVIKNT GRIAANSLES NGGVIELVAK NITQSGQVSA NSQTKEGGQV NLVASEITLT KNSKTTATGA AGGGQVNIGL ASTQVSGGAQ VNTPTPVAIK ANANQAAQTN QLANTVAVQE SATIDTSATQ TGNGGSIAIW SKVQTTVAGI LKSMGGAILG NGGFIETSSQ GQVVLAPTAS INTSANNPTG KAGTWLLDPI DLIINSDAAN VIANALANTN VTIAVTNSTT ACPIGSCTQT NVSGANSSLT IASGADILKL GTNYTTLTLS SEGIFNLNAN ISGQNLDVII SSSIAYLNVG SSINASKVTV QAQTIYSAGS IQTSNYLLGA NPGSLGNAIA LLAQAIYVSG RLSANAIGKV AGSITLTANT IKLYPNAVLE ANGDEGGQIT VAANDSLWSS ATVQANGGNG RGGTLSLTAA NDQYFDQSAL QANGTTDGGA ITITTQSGDI SFANSLIQTN GSTGRGGSIS LSATNITQIA NSNISANGYS QGGTILIGND ASNGSLPFSM ATTIDEKTAI NAAQLDPNPT NQHGGFIETS GHTLNLLATI NAGRGGMWLL DPNDISIEAL PVLGGTPFAY VSGSSYTYTA GASSVVYTAL ITSALATADV IITTASGNIT VNGAISGVRS LTLLASSGNI QLNAGIQLTG AGSSIVLKAS GYISTAGANT YLTNGGDVIF WSNTGNVTST TATNANFIYL DSGTKLMTVG GAIYLAGGLP SIGTTSNGNN YPTGYAFTGS VNSGVLLGSY AGNGIPIIIK SDGGNIVIAG QTTNTNLPGF SSQSSLLIDS GMGTISLTGT AHNGHGIELG YGSAYSNIVI TSASSSPTAI QINGTTDGGS GYKGFWAIHL AGSATPGVLI QATGFGGGVS LTGINTPSGI GVYLSDIAIL ANNGPIQINS TALVTQSTSS FLGACSGTAN PCASFAIGDN TYTPITNSSS NIAINVDARN AGGVSWNGVL TVDTTGGLTI APYTTDGWAT TPFTWSGSNA LSGGIATFTA DANSFIGVAD KLMIKKITSL TIGASDSTNT VNINQDTGPF GNLSVHAGNI NINAPIHWVS DEAINLISSG NISVNADLVG PSSGLTAVYG GTYSCSVCSS YYVLLNPGSS VYGSVPNLTY GIYTSAAGTT LASSAISSLV SGTASWSGVM PDSTSSVGSY PLSYSSGLSI SSLSYSLIAG NLNNWTITPA PLGIKVTATY TSGSVFTPSN SVIAATGLMA GQTITSLQAN SSNVGPGNFV TTSGLIGAGG FLASNYAMYA GSEVGNSGVL AGGLYSGTVS GATNSGGTNT VNINRANLAI TANPSPSGNV YNGNAYNGSY ITNALGSDQG LMTISGVATG VNAGTYTSNL AVNGAVLSNY NTPVITNANL VISPKPITIT DIASRTIYDG VTNYATLMAN AGYTLSSALV GSDTIGAINQ TASSVGGTSF SGIAQAGIFF STPSAAVLSI GNANNYLFSY ATASNTVAKA SLTIAAIPNL TGNVYNGSTY NGTYVTNALG SDQGLMTISG IASGVDAGIY TSNLAVSGSV LSNYNSPVIR NANLVITPAP LVIKANDLAR VAGTAFYGGN GVTYTGLLNG ESPSVLSGVL TYSGSAQGAS NAGAYVIQPG GLTSQNYSIG FISGVLVNTP QPVVNAITPP PPPPPPPMVF SIAPPPPPPA AAAMVSIAPP PPAAEAGSGG REVAPPPVAA AMVAAPDGGI ALAPAPAAPP PTASAPPPAA PSGANPSNTP SSSTNTDASS NKPSEGKPNG TNSKGERIGK YDARRLAESK EAKEGKGGSD KPGSKKAEPG VNGETPPKYA GKYANGFRAA EKAAGKDGAN KSLASNKANP PREGKYSNRV NALNNGSASA SAAMAAAAMS QHQFSSTGLP PLPGAPAEAA TPTILRGGDS LTQSYDDVPS IRNSGVANAG RSRNTENYHE SLESVNLMST LNLFIVH
|
| |