Gene Apre_1608 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagApre_1608 
Symbol 
ID8398420 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaerococcus prevotii DSM 20548 
KingdomBacteria 
Replicon accessionNC_013171 
Strand
Start bp1749572 
End bp1752853 
Gene Length3282 bp 
Protein Length1093 aa 
Translation table11 
GC content38% 
IMG OID644995972 
Productglycoside hydrolase family 2 sugar binding 
Protein accessionYP_003153350 
Protein GI257067094 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3250] Beta-galactosidase/beta-glucuronidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAAGTA AATATTATAT AGAAAAAATA GATACTAATC AAAGAAGAAT CAATCTTAAT 
AAGTCCTGGA AATTTGCTTT AGGAGATATT CCAGGATTTT ACAAGAACTT CTTCGACGAT
TCTTCCTGGG ATTATATAGA CCTACCCCAC GATTACTCGA TAGCTAGACC TTATTCTAAG
GCGGGAGAGG CCCAGTCTGC CTATAAGCTC GGAGGAGTGG GGCTTTACAG GAAAAGTTTT
ATTATAGAGG ATAAAAAGAG AGCTAAACTT TCCTTTGATG GAATCTACTG TGATGTAGAA
GTTTTCTTAA ATGGTAGGAA ATTGGCTTGC CACCACCATG GATACAGTCC CTTTCTAATC
GATATCACTG ATTTCCTTTA TTATGATAGG GAAAATATTT TGGCTATCAA AGTTGATAAT
CCGATTCCTA CTAGCCGTTG GTATTCTGGT TCTGGAATTT ATAGAGATGT AGATCTCATC
CTTACAAACG ACATAGCCTT TGATGAAGTT AGGATTGATG ATTATGGTTT AGAAGATAAG
AAAGGAGAGA TCGACCTAGA AATCAGGTCT TTTATTTCAA ACAAGAGTAA CGTTGATGAA
GAAATTATCC TTGAACATAA GCTTATCTAC AAGAATATAT TAGTTGAAAG ATATAAATCA
GAAGTCTTTT ACATAAAAGC TATGGGAGAG ATGGAAGTTA AGGATAGATT TCCTATTTCC
TACCCAAGGC TTTGGAGTGT GGATAATCCC GAGCTTTACA AGATAGAATC TCTTATAAAA
TACAAGGGAA AAATAATTGA TAAAATTACG AGTAACTATG GCTTTAGATA CTTTACTGCT
ACATCTTCTA GAGGTTTTTC CTTAAATGGA AAGTCCATAA AGTTAAAGGC AGTCTGCCTC
CACCACGATC AGGGGGCTCT TGGAGCGGCC GATTATTATA GGGCAATCCT TAGGCAAGTT
CTGCTTATGA AGGATATGGG AGCAAATGCA ATAAGAATCA CCCACAACCC AGGCAGTAAG
AAGTTAATCG ATATAGCAAA CAAGGAGGGC ATCCTCCTTA TAGAAGAAAT ATTCGATGGA
TGGATTTTGG ATAAGAATAA TAATTACAAG GATTATTCTA GGTATTTTGC CCAAAAAATT
GGAGAAAATA AGCTCATAAA TGCGACTTGC GACATGACTT GGGGAGAATT TGACTTAAAG
GAGACCTTAA GAAGAGATTA TAATGCCCCA TCAATCATAG CCTACTCCTT GGGCAATGAA
ATTTTGTGCG GGACTAACCA GACTAGGAGC AAGGAATATC CTAGTCTTGC CAGAGATTTG
ATAAGCTGGG CAAGAGAGCT TGACCAGAAA AGATTTCTAA CTATAGGGGA TAATTCCCTA
AGAGATGGCT ACGATAAAAA CTTAGTCGAG ATCGAAGAAG AACTTACAAG ATCTAAGGGC
CTTGTCGGTC TTAACTATTG CGATGGGGAT AAATATGATA AAATCCACAA AGACCACCCT
AACTGGATTC TCTGGCAGTC AGAATCTGCT TCATCAGTTA ACTCTAGGGC TTGTTATGAT
AGGTTGGGAG ATGATCTAAG AGAAGATATG CGTCTTACTT CCTATGATGA ATCCAAAGTC
GCCTGGGGAA ATCTTGCAGC AGAAGCCTGG TTTGATGTAA TAAACCGCGA CTTTGTAATG
GGAGAGGCTG TGTGGACGGT CTTTGATTAT CTAGGTGAGC CAACTCCCTA CAATGGCATA
GAAAGGGGAG CACCCTACGG ATTTCCTGCT CCTAGATCTT CCTTTTTCGG CATAGTTGAT
ACGGCGGGAT TTCCCAAAGA TTCCTACTAT CTCTATAGGG CCTTGTGGAA TGAAAAGGAT
ACAACAACCC ATATTCTTCC TTCCTGGAAT GCTAAAGAGC TAGGAGACTT GGCGGAAAAT
GTACCGATTG TTGTTTACAC TAATGCTTAT GCTATTGAGT TGATATTTAC TGATGAAGAT
GGGAATATTA AATCTTTGGG AAAGAAATAC ATGGAGAAAG TTAGGACTGA GAGAGGATTT
AGTTACAAGA AGGTCAAGGG AGAAGATGGT CCGAGATCTC TTTATATGAC TTGGCATATG
CCTTATGAAA AGGGAGAGAT CTCTGCCATA TCCTTTGATG AAAATGGCAA AATCATAAGA
AATACGGTAG GAAGATCTAA GGTTAAAACT CCTACTGAGG ATACTTTTAT AAAGTTAGAG
GCCTTTTATC CTTCTATGGG AGAAAAACGA GAGGGAATTA ACTTCATTAC TATAGACCTT
GTAGATGAAG ATGGGAATAT CAAATCTAGC GCATTAGATG AAATATCAGT AGAAGTATCA
GAAAACGCAG AGCTTCTTGC CCTGGATTCA GGACTACAGG CTGACTTCGA ACTCTTTGCG
ACTGATAAGA AAAGGGCCTA TGGAGGAAGG CTCCTTGCCA TAGTTAAGGC AAAAGGTCCC
GGTCCTCTTA AGCTTAAAGC CTCTGGCCAA AATTTAAGAC AGACAGAGCT TACAATCCCT
GTTTGGGGAG AATTTCACAA AAGAAATAGT CTAGTTTATG ATAAGTATCT GATAAATTCT
TCGCCCGAAG ATCTTAATCT TGAAGCCAAT AAGAAAATTA CATGGAAGCT AATAGAAAAA
GGTAAGTATC ATAGGAGCTA TGTGGGAGAT TGTGAGAGCG GACCGATCAA TACCTACGTC
CTAGATATAA AGGGAGAGAT GAAGCTAGTA GATTATGAAA TGGGAATTTT CGAAGAAGAG
CTACCAATCT TTCCTAAATC CCTTCCCCTA GTAGATGAGG AAGGACAAAT CTATTATCAT
GGCAAGGAGA TAAGCTATGA TGATTTTGAT GAGGAAAAAT TTAGGAGAGA GGGTTTTGTA
AGAACTAGGG CAAGGCTTAA ACTCCTAGGG AAAAATTATA AATCTTCTGT TAGTGTAAGA
AAGCTAGTAG AAAGATACAG GAAAGATGCC TACATAGAGG ATTTCGCCCT AGAAAAAAGG
GTTAGTGAAG ATGAGATATA CTTTGCCTAC GATACCCAGC AGATTTTTGG AGAAATTGAG
ATCTATCCGC AAGGATTTTT TTCTGATTTT AGATTTTTCA TAGGAGAAAG TGAGAATGAA
GGAAGCTTCA AGGAGATTAG GGCTAATAAG ATAGTAAAAG ATTCTTCTAA GACTACTTTT
ACCTTCGAGA AATTTCCTGC AACATTTATC AAAATTATAG GCGATGTAAG AAATATTAGC
AAAGTAAGAC TTAGGTCTAT GAAGATTAAA GTGGAAGAAT AG
 
Protein sequence
MKSKYYIEKI DTNQRRINLN KSWKFALGDI PGFYKNFFDD SSWDYIDLPH DYSIARPYSK 
AGEAQSAYKL GGVGLYRKSF IIEDKKRAKL SFDGIYCDVE VFLNGRKLAC HHHGYSPFLI
DITDFLYYDR ENILAIKVDN PIPTSRWYSG SGIYRDVDLI LTNDIAFDEV RIDDYGLEDK
KGEIDLEIRS FISNKSNVDE EIILEHKLIY KNILVERYKS EVFYIKAMGE MEVKDRFPIS
YPRLWSVDNP ELYKIESLIK YKGKIIDKIT SNYGFRYFTA TSSRGFSLNG KSIKLKAVCL
HHDQGALGAA DYYRAILRQV LLMKDMGANA IRITHNPGSK KLIDIANKEG ILLIEEIFDG
WILDKNNNYK DYSRYFAQKI GENKLINATC DMTWGEFDLK ETLRRDYNAP SIIAYSLGNE
ILCGTNQTRS KEYPSLARDL ISWARELDQK RFLTIGDNSL RDGYDKNLVE IEEELTRSKG
LVGLNYCDGD KYDKIHKDHP NWILWQSESA SSVNSRACYD RLGDDLREDM RLTSYDESKV
AWGNLAAEAW FDVINRDFVM GEAVWTVFDY LGEPTPYNGI ERGAPYGFPA PRSSFFGIVD
TAGFPKDSYY LYRALWNEKD TTTHILPSWN AKELGDLAEN VPIVVYTNAY AIELIFTDED
GNIKSLGKKY MEKVRTERGF SYKKVKGEDG PRSLYMTWHM PYEKGEISAI SFDENGKIIR
NTVGRSKVKT PTEDTFIKLE AFYPSMGEKR EGINFITIDL VDEDGNIKSS ALDEISVEVS
ENAELLALDS GLQADFELFA TDKKRAYGGR LLAIVKAKGP GPLKLKASGQ NLRQTELTIP
VWGEFHKRNS LVYDKYLINS SPEDLNLEAN KKITWKLIEK GKYHRSYVGD CESGPINTYV
LDIKGEMKLV DYEMGIFEEE LPIFPKSLPL VDEEGQIYYH GKEISYDDFD EEKFRREGFV
RTRARLKLLG KNYKSSVSVR KLVERYRKDA YIEDFALEKR VSEDEIYFAY DTQQIFGEIE
IYPQGFFSDF RFFIGESENE GSFKEIRANK IVKDSSKTTF TFEKFPATFI KIIGDVRNIS
KVRLRSMKIK VEE