Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Aboo_0794 |
Symbol | |
ID | 8827744 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Aciduliprofundum boonei T469 |
Kingdom | Archaea |
Replicon accession | NC_013926 |
Strand | + |
Start bp | 756867 |
End bp | 761180 |
Gene Length | 4314 bp |
Protein Length | 1437 aa |
Translation table | 11 |
GC content | 43% |
IMG OID | |
Product | extracellular solute-binding protein family 5 |
Protein accession | YP_003483165 |
Protein GI | 289596469 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.0894131 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAACGAAA GAAGTATGAA GAAAATATTG TTAGTGACTT TAGTGTTAGC AGCGATGGTA CTCAGTGGCT TTTCAGTGAT GTCCGGGAAT GCTGCTGCGG CAGCTCCCAA AGTTTCAGCA GCCAATAACG GTAATCTAGT GGTAGCTTTT CAGAATGACA TATCAAACCT GAACATATTT GACCCAACAA CGAACACTGT TTGGAAAGCT GATGCTCTTG GGTGGGCATT TGAGAGTCTG TATTCTTATA CTCCGGACTT AATACCCTAT CCAAACTTGG CATTGAGTGC CACGCCTGAC GCAAGTGGCT TGAATGTAAC TGTTCAGCTA AGGCACAATG TGACTTTCCA AGATGGACAG CCAATGACCG CAGAGGATGT TGTATTTAGT TATCAGACCT TGTATTGGGA TTCTCTGTAT AAGACAACCC TTCAGTGCCT TTACTGGCCC ACTCCTAAGT GGCCTCTTTG GAATGGAAAT GGCACATCCC ACATAGGTGT TGTTGCCACA GGCAAATACA CTGTTGTCTT CCATCTATAT CAACCATATC CTCTGTTCTA TCAGGTTACA CTTGGTAATA CTATCATACC CGAGCACATA TGGGTAAAGC ACTTGGTATC TGCAGGTACA GGTGATAGTG ATGATATGAC AATAGATACA TCTTGGAATG CTCCAGGAGC AACTATTGGT ACTGGTCCAT TTAAGTTTGT AGAATGGAAG CCAGGTAACT ATGTGAAGAT TGAAGTTTAC AAGAATTACT GGGGTAAGGG ACTGTCTACT CCATGGAGAG GTAAAAATTG GCCTTGGTAT CCCGAGTATG TGAGAACAAT AACATTCAGA ATTTACAACA CCTTGGATAC TGCAGTTCTT GCACTGAAGA GAGGTGAGGT TCAGTTTATT GACTGGTCAT TGCCTCCAGG CTATTATAAC CAGTTGAAGA CTGATCCAAA TATAGGTGCT ACAATTGTTA ATGACCAGGG GTTCTTCTAT TTGGCATTCA ATATGAGAAA AGAGCCCATG AGCGATTTGG CCTTCAGGAC CGCTGTTGCC CACTGTATTA ATAAGGACTA CATAGTGACT ACCCTTATGC AGGGTTATGG TACGAAGGGT ACCGTGCCTA TATCAATTAC ATCTGGTGCA TATGTCAATA CATCTGCTAT ACCCCCAGGT TTCGATTTGA ATGCTGCTCA GCAGGTTCTT GACCAAGCGG GCTATAAAGT TGGCTCTGAT GGTTGGAGAC ACGCCCCAGA TGGCTCACCA ATTAAGGAGA CAATTCTCAC GCCGCCAAAG GATTACGATC CTATAAGGGC TGAAGCTGGC ATAATGATAC AGGCAAACCT TCAGAAAGTA AAATTGAATA TTCAGAGTAT TCCTACAAAC TTTGATACGA TAGTTTCAAA GGCATTCGTG CAGGTGCAGT TCGATATGTA CATCCTTGGA TGGGGTGTCG GTGCATTCCC TGAGACATAC CTCGAGGACT TCTTTGCAAG CTGGAATGCA GCACCTGTTG GTGATAACAC ACCTGGTTAC CACAATCCTA AGGTGGATAA ATTGCTCATT GAGATTAGAA CAGATATGAA CACTCAGGAT AGAATTAACA AGATTAAGGA GATTGAAGGC ATATTGGTGC ATGATTTGCC TTACGATGTG CTTTATTACA GAAAGAACAT AATGGCTTAC AGGCAGGATA TGTGGCAGGG CTGGGTTGCA GCCTATGGTA CTATTTGGAA TGGATTCTCA CTTGGCGTAC TCCATCCGCC GACATCTGGA GGCGGCGGTG GAGGCGGTGG CGGTGGCCAC CACCCAGGAC CTATAACTAA TGTTACAGGG AGTGGATTAG TTGGTGCAGT TGGACATCTT TATACCCCTA ACATAGCGTA TGCTGGGCAC TCTATAAATG GAGTATTCTA TCTCACTGAT CAGAATGGCA TTCCTATCCC TAATGCAAAG GTTAGCATCT ATGCATCCAA TGGAGTATGG GCAAATGGCA CCACTGGCAG CTCGGGTGGA CTTGCATTTA ATGTGCCTCT GAAGTTCGAG GATTATGGTA AAGTGAATAT TACATATTCT TATTCGGTTA AAATTGGGGG CCATGTGTAT AATGGCTCCG GTACGAACTC TGTAACCGTT TACCTGCCAA AGAATGTTGC AAAGCTGAAG CTAAGTATTG ATAAGCCAAT ACTTACCCCT GGTGCAAGTG CTACTATTAC TGCCAAAGTT ACTGACATCT ACGCTCATCC ACTTGCGGGA GTGAATGTTA CCATACTCTC AGAAGAGACC TCTGGTTCTA TAACACCAGG TTATGGTATA ACAAATAGTG ATGGTATTGC CACATTCCAT TACACTGCAC CAACGATGGT AACTAACATA AATCCTGTAG ATATAGTAAA GGCCCAGATT AAGGTCAACA ACACAATACT AACAAATCTA CAGACTGCCA CTCTCTTCGT TCCAATACAG AGCCATGGAA GCAGCTGGTA CAAGGTGCAG ATAACTCATG TGAGCAGTTA TGGAATAACT GCTGGTCAGA GTACCCAAGT AACCGTTAAG GTTGTGAATA TAAATGGAAA TCCTGTGGCA AATCATAAGG TTTATATGGC CGCCTACTAC TCAAATGCAA CTGATCCTAA CTGGTATGGC CAGCAGCTCG TACCTGCATG GGGAATAACA ATGGATTCAA ATGAAAAGAC AACAGATAGC AATGGTGAGG CGACTTTCAC CATAACTGCA AATGAGAACG CAAATATTCC TGTGATATTA AAGGCTTACA CACAGGATAC TTATATGGCG TATGACACTG TGCAGATTTA TGTAGGTAAC AATACTGGCT TTGATCCTTA CATTGGTCCT TGGACTGGCT TGTATGCAAT GGATATGCAG CTCAACAAGG TAACTGCTGG TGTTGGACAG CAGGTGCAGG TTACAATGAC TGTATATAAT GCAGCCACAG GTGCACCGCT GCCAAATGCT ACAGTGTTCA TGGCCGTATT CGGTACTGAT TATGGATTCG GTGCTGATTG GGCAAACAAT GTAGGTACAT ATGTATGGTG GGCAGGCCAG AGTGTTATCT GGGGAACAAC TGATGCAAAT GGTCAGTTTA GCTACACATT GAACACAAGT GCCCTTAGGG CAGATCAGCC CATCTATGTA AGCGGTTGGA TGGATGCATA TGGGTATGGA GCCAATGCTA TATGGACTGG TCTTGGATTT GACTTCCCGT ACCTCTTTGG CGTGAAGGAT GGCTTCATAC TTCAGAGAGC ACCAATAATG GGTATATCTA ACATAATGGT TAACGAGTTC TATCTAAGTA ACTCTCTGTG GACTACAATG ATGACCTTGA AGGTTGTTGA TATCAATGGT CCTCTAGCTA ATGTTACAGT GTCTGCTGAT TGGACTCTTG GTAACTATGA AAATAACATA AATGTAACCA CTAATGCTCA GGGTATTGCT CAGTTTAATA TAACAATAGA GCCAACAACG GCAGATAGTA TTGTATCTGT CTCAATTATG CTCTCCTCGC CTGACCATGC AATGAATCTC AACTATGAGT ACTATATACC ATTCTTATCA GGAGCATCAC AGCTTTCAAA GATACAAACA TTTGACTGTG TGAAAATATT AACTGAATCC TCGAATGTAG GTCTAGTGCC TTACGAAGGA AAGGCAACTA TTAGATTGCA TCTTATAGGA GGACTTTTGG GAACACCTAT TTCAGGTGAA CAAGTACATC TTTATGTACC AGCAGGTACA TTAGACAATT CTAGCGTGTT TACTGATGAA AATGGTACAG CGGTATTTAC ATACAATGCT CCCGATGTAC TTACACCAAA ACATTATGTT TTCGTTGCAA CTACCCCTGA AGGTGGAATA TACTTCTTCG GAGTGGAAGT TGCTGGCAAA TACAACAATA CCTTAGAATG GGCTAATGAA ATTAACAGCC TTAATCAGAA GATTGCAGAC CAGAGCTCAC AGATACAGGA TCTACAGAAC AAGTACAATA ATCTCAATAC CGAATACAAG AACTTGCAGG ACAAGTATAA TAGCTTGCAG ACAAACTACA CAAATCTAGG CAAGCAGAAA GATAATGCAG TGACAATGCA GTATGTATGG CTCTCTCTGT TCATCATAAT GATAATCATA GCAATAGTGA TGTACTATGC AGGCAAGAAG AGCGGAGCAA AGAGTGCACC TAGTGAAGAA GAGAGTACAG AAGAGGAAAT CGAGGAAGAA GAGGAGCCTA CAGAAGAAAT AGAGGAAGAA ACCGAAGAGG GCTCTGAGGA AGAGACCTCA GAGGAGGAAA ATACAGAGGA GTAA
|
Protein sequence | MNERSMKKIL LVTLVLAAMV LSGFSVMSGN AAAAAPKVSA ANNGNLVVAF QNDISNLNIF DPTTNTVWKA DALGWAFESL YSYTPDLIPY PNLALSATPD ASGLNVTVQL RHNVTFQDGQ PMTAEDVVFS YQTLYWDSLY KTTLQCLYWP TPKWPLWNGN GTSHIGVVAT GKYTVVFHLY QPYPLFYQVT LGNTIIPEHI WVKHLVSAGT GDSDDMTIDT SWNAPGATIG TGPFKFVEWK PGNYVKIEVY KNYWGKGLST PWRGKNWPWY PEYVRTITFR IYNTLDTAVL ALKRGEVQFI DWSLPPGYYN QLKTDPNIGA TIVNDQGFFY LAFNMRKEPM SDLAFRTAVA HCINKDYIVT TLMQGYGTKG TVPISITSGA YVNTSAIPPG FDLNAAQQVL DQAGYKVGSD GWRHAPDGSP IKETILTPPK DYDPIRAEAG IMIQANLQKV KLNIQSIPTN FDTIVSKAFV QVQFDMYILG WGVGAFPETY LEDFFASWNA APVGDNTPGY HNPKVDKLLI EIRTDMNTQD RINKIKEIEG ILVHDLPYDV LYYRKNIMAY RQDMWQGWVA AYGTIWNGFS LGVLHPPTSG GGGGGGGGGH HPGPITNVTG SGLVGAVGHL YTPNIAYAGH SINGVFYLTD QNGIPIPNAK VSIYASNGVW ANGTTGSSGG LAFNVPLKFE DYGKVNITYS YSVKIGGHVY NGSGTNSVTV YLPKNVAKLK LSIDKPILTP GASATITAKV TDIYAHPLAG VNVTILSEET SGSITPGYGI TNSDGIATFH YTAPTMVTNI NPVDIVKAQI KVNNTILTNL QTATLFVPIQ SHGSSWYKVQ ITHVSSYGIT AGQSTQVTVK VVNINGNPVA NHKVYMAAYY SNATDPNWYG QQLVPAWGIT MDSNEKTTDS NGEATFTITA NENANIPVIL KAYTQDTYMA YDTVQIYVGN NTGFDPYIGP WTGLYAMDMQ LNKVTAGVGQ QVQVTMTVYN AATGAPLPNA TVFMAVFGTD YGFGADWANN VGTYVWWAGQ SVIWGTTDAN GQFSYTLNTS ALRADQPIYV SGWMDAYGYG ANAIWTGLGF DFPYLFGVKD GFILQRAPIM GISNIMVNEF YLSNSLWTTM MTLKVVDING PLANVTVSAD WTLGNYENNI NVTTNAQGIA QFNITIEPTT ADSIVSVSIM LSSPDHAMNL NYEYYIPFLS GASQLSKIQT FDCVKILTES SNVGLVPYEG KATIRLHLIG GLLGTPISGE QVHLYVPAGT LDNSSVFTDE NGTAVFTYNA PDVLTPKHYV FVATTPEGGI YFFGVEVAGK YNNTLEWANE INSLNQKIAD QSSQIQDLQN KYNNLNTEYK NLQDKYNSLQ TNYTNLGKQK DNAVTMQYVW LSLFIIMIII AIVMYYAGKK SGAKSAPSEE ESTEEEIEEE EEPTEEIEEE TEEGSEEETS EEENTEE
|
| |