Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Apre_1608 |
Symbol | |
ID | 8398420 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Anaerococcus prevotii DSM 20548 |
Kingdom | Bacteria |
Replicon accession | NC_013171 |
Strand | - |
Start bp | 1749572 |
End bp | 1752853 |
Gene Length | 3282 bp |
Protein Length | 1093 aa |
Translation table | 11 |
GC content | 38% |
IMG OID | 644995972 |
Product | glycoside hydrolase family 2 sugar binding |
Protein accession | YP_003153350 |
Protein GI | 257067094 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG3250] Beta-galactosidase/beta-glucuronidase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAAGTA AATATTATAT AGAAAAAATA GATACTAATC AAAGAAGAAT CAATCTTAAT AAGTCCTGGA AATTTGCTTT AGGAGATATT CCAGGATTTT ACAAGAACTT CTTCGACGAT TCTTCCTGGG ATTATATAGA CCTACCCCAC GATTACTCGA TAGCTAGACC TTATTCTAAG GCGGGAGAGG CCCAGTCTGC CTATAAGCTC GGAGGAGTGG GGCTTTACAG GAAAAGTTTT ATTATAGAGG ATAAAAAGAG AGCTAAACTT TCCTTTGATG GAATCTACTG TGATGTAGAA GTTTTCTTAA ATGGTAGGAA ATTGGCTTGC CACCACCATG GATACAGTCC CTTTCTAATC GATATCACTG ATTTCCTTTA TTATGATAGG GAAAATATTT TGGCTATCAA AGTTGATAAT CCGATTCCTA CTAGCCGTTG GTATTCTGGT TCTGGAATTT ATAGAGATGT AGATCTCATC CTTACAAACG ACATAGCCTT TGATGAAGTT AGGATTGATG ATTATGGTTT AGAAGATAAG AAAGGAGAGA TCGACCTAGA AATCAGGTCT TTTATTTCAA ACAAGAGTAA CGTTGATGAA GAAATTATCC TTGAACATAA GCTTATCTAC AAGAATATAT TAGTTGAAAG ATATAAATCA GAAGTCTTTT ACATAAAAGC TATGGGAGAG ATGGAAGTTA AGGATAGATT TCCTATTTCC TACCCAAGGC TTTGGAGTGT GGATAATCCC GAGCTTTACA AGATAGAATC TCTTATAAAA TACAAGGGAA AAATAATTGA TAAAATTACG AGTAACTATG GCTTTAGATA CTTTACTGCT ACATCTTCTA GAGGTTTTTC CTTAAATGGA AAGTCCATAA AGTTAAAGGC AGTCTGCCTC CACCACGATC AGGGGGCTCT TGGAGCGGCC GATTATTATA GGGCAATCCT TAGGCAAGTT CTGCTTATGA AGGATATGGG AGCAAATGCA ATAAGAATCA CCCACAACCC AGGCAGTAAG AAGTTAATCG ATATAGCAAA CAAGGAGGGC ATCCTCCTTA TAGAAGAAAT ATTCGATGGA TGGATTTTGG ATAAGAATAA TAATTACAAG GATTATTCTA GGTATTTTGC CCAAAAAATT GGAGAAAATA AGCTCATAAA TGCGACTTGC GACATGACTT GGGGAGAATT TGACTTAAAG GAGACCTTAA GAAGAGATTA TAATGCCCCA TCAATCATAG CCTACTCCTT GGGCAATGAA ATTTTGTGCG GGACTAACCA GACTAGGAGC AAGGAATATC CTAGTCTTGC CAGAGATTTG ATAAGCTGGG CAAGAGAGCT TGACCAGAAA AGATTTCTAA CTATAGGGGA TAATTCCCTA AGAGATGGCT ACGATAAAAA CTTAGTCGAG ATCGAAGAAG AACTTACAAG ATCTAAGGGC CTTGTCGGTC TTAACTATTG CGATGGGGAT AAATATGATA AAATCCACAA AGACCACCCT AACTGGATTC TCTGGCAGTC AGAATCTGCT TCATCAGTTA ACTCTAGGGC TTGTTATGAT AGGTTGGGAG ATGATCTAAG AGAAGATATG CGTCTTACTT CCTATGATGA ATCCAAAGTC GCCTGGGGAA ATCTTGCAGC AGAAGCCTGG TTTGATGTAA TAAACCGCGA CTTTGTAATG GGAGAGGCTG TGTGGACGGT CTTTGATTAT CTAGGTGAGC CAACTCCCTA CAATGGCATA GAAAGGGGAG CACCCTACGG ATTTCCTGCT CCTAGATCTT CCTTTTTCGG CATAGTTGAT ACGGCGGGAT TTCCCAAAGA TTCCTACTAT CTCTATAGGG CCTTGTGGAA TGAAAAGGAT ACAACAACCC ATATTCTTCC TTCCTGGAAT GCTAAAGAGC TAGGAGACTT GGCGGAAAAT GTACCGATTG TTGTTTACAC TAATGCTTAT GCTATTGAGT TGATATTTAC TGATGAAGAT GGGAATATTA AATCTTTGGG AAAGAAATAC ATGGAGAAAG TTAGGACTGA GAGAGGATTT AGTTACAAGA AGGTCAAGGG AGAAGATGGT CCGAGATCTC TTTATATGAC TTGGCATATG CCTTATGAAA AGGGAGAGAT CTCTGCCATA TCCTTTGATG AAAATGGCAA AATCATAAGA AATACGGTAG GAAGATCTAA GGTTAAAACT CCTACTGAGG ATACTTTTAT AAAGTTAGAG GCCTTTTATC CTTCTATGGG AGAAAAACGA GAGGGAATTA ACTTCATTAC TATAGACCTT GTAGATGAAG ATGGGAATAT CAAATCTAGC GCATTAGATG AAATATCAGT AGAAGTATCA GAAAACGCAG AGCTTCTTGC CCTGGATTCA GGACTACAGG CTGACTTCGA ACTCTTTGCG ACTGATAAGA AAAGGGCCTA TGGAGGAAGG CTCCTTGCCA TAGTTAAGGC AAAAGGTCCC GGTCCTCTTA AGCTTAAAGC CTCTGGCCAA AATTTAAGAC AGACAGAGCT TACAATCCCT GTTTGGGGAG AATTTCACAA AAGAAATAGT CTAGTTTATG ATAAGTATCT GATAAATTCT TCGCCCGAAG ATCTTAATCT TGAAGCCAAT AAGAAAATTA CATGGAAGCT AATAGAAAAA GGTAAGTATC ATAGGAGCTA TGTGGGAGAT TGTGAGAGCG GACCGATCAA TACCTACGTC CTAGATATAA AGGGAGAGAT GAAGCTAGTA GATTATGAAA TGGGAATTTT CGAAGAAGAG CTACCAATCT TTCCTAAATC CCTTCCCCTA GTAGATGAGG AAGGACAAAT CTATTATCAT GGCAAGGAGA TAAGCTATGA TGATTTTGAT GAGGAAAAAT TTAGGAGAGA GGGTTTTGTA AGAACTAGGG CAAGGCTTAA ACTCCTAGGG AAAAATTATA AATCTTCTGT TAGTGTAAGA AAGCTAGTAG AAAGATACAG GAAAGATGCC TACATAGAGG ATTTCGCCCT AGAAAAAAGG GTTAGTGAAG ATGAGATATA CTTTGCCTAC GATACCCAGC AGATTTTTGG AGAAATTGAG ATCTATCCGC AAGGATTTTT TTCTGATTTT AGATTTTTCA TAGGAGAAAG TGAGAATGAA GGAAGCTTCA AGGAGATTAG GGCTAATAAG ATAGTAAAAG ATTCTTCTAA GACTACTTTT ACCTTCGAGA AATTTCCTGC AACATTTATC AAAATTATAG GCGATGTAAG AAATATTAGC AAAGTAAGAC TTAGGTCTAT GAAGATTAAA GTGGAAGAAT AG
|
Protein sequence | MKSKYYIEKI DTNQRRINLN KSWKFALGDI PGFYKNFFDD SSWDYIDLPH DYSIARPYSK AGEAQSAYKL GGVGLYRKSF IIEDKKRAKL SFDGIYCDVE VFLNGRKLAC HHHGYSPFLI DITDFLYYDR ENILAIKVDN PIPTSRWYSG SGIYRDVDLI LTNDIAFDEV RIDDYGLEDK KGEIDLEIRS FISNKSNVDE EIILEHKLIY KNILVERYKS EVFYIKAMGE MEVKDRFPIS YPRLWSVDNP ELYKIESLIK YKGKIIDKIT SNYGFRYFTA TSSRGFSLNG KSIKLKAVCL HHDQGALGAA DYYRAILRQV LLMKDMGANA IRITHNPGSK KLIDIANKEG ILLIEEIFDG WILDKNNNYK DYSRYFAQKI GENKLINATC DMTWGEFDLK ETLRRDYNAP SIIAYSLGNE ILCGTNQTRS KEYPSLARDL ISWARELDQK RFLTIGDNSL RDGYDKNLVE IEEELTRSKG LVGLNYCDGD KYDKIHKDHP NWILWQSESA SSVNSRACYD RLGDDLREDM RLTSYDESKV AWGNLAAEAW FDVINRDFVM GEAVWTVFDY LGEPTPYNGI ERGAPYGFPA PRSSFFGIVD TAGFPKDSYY LYRALWNEKD TTTHILPSWN AKELGDLAEN VPIVVYTNAY AIELIFTDED GNIKSLGKKY MEKVRTERGF SYKKVKGEDG PRSLYMTWHM PYEKGEISAI SFDENGKIIR NTVGRSKVKT PTEDTFIKLE AFYPSMGEKR EGINFITIDL VDEDGNIKSS ALDEISVEVS ENAELLALDS GLQADFELFA TDKKRAYGGR LLAIVKAKGP GPLKLKASGQ NLRQTELTIP VWGEFHKRNS LVYDKYLINS SPEDLNLEAN KKITWKLIEK GKYHRSYVGD CESGPINTYV LDIKGEMKLV DYEMGIFEEE LPIFPKSLPL VDEEGQIYYH GKEISYDDFD EEKFRREGFV RTRARLKLLG KNYKSSVSVR KLVERYRKDA YIEDFALEKR VSEDEIYFAY DTQQIFGEIE IYPQGFFSDF RFFIGESENE GSFKEIRANK IVKDSSKTTF TFEKFPATFI KIIGDVRNIS KVRLRSMKIK VEE
|
| |