Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Apre_0015 |
Symbol | |
ID | 8396762 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Anaerococcus prevotii DSM 20548 |
Kingdom | Bacteria |
Replicon accession | NC_013171 |
Strand | + |
Start bp | 17737 |
End bp | 19923 |
Gene Length | 2187 bp |
Protein Length | 728 aa |
Translation table | 11 |
GC content | 39% |
IMG OID | 644994352 |
Product | extracellular solute-binding protein family 5 |
Protein accession | YP_003151791 |
Protein GI | 257065535 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0747] ABC-type dipeptide transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 0 |
Plasmid unclonability p-value | 0.0000000973096 |
Plasmid hitchhiking | No |
Plasmid clonability | unclonable |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAATGA AAAGAGTTTT TTCTTCATTA ATGGCACTTG GACTAGCTAG TACTCTAGTT GCTTGTGGCG GTGGAGAAAA TAAAGACAAC AAAGCTGCTA ATAATGACGC AGACACAAAA GCTGAAGATA AAAACAAAGA CGATAAAGAT GCAAAAGAAA GTGAAGGAAC CGGCGAAGAA GGTGCGGAAA ACTTCGAATC ACAAACATCT GACGATACTC TAGTAGTAGG TCTTGGTGAG CTTAACGGTG ACTTTCTACA AGGATGGACC AACAACTCTG GAGATGTTAA AGTTAGAAAA TATCTTGGTA TCGAAGGAAA CAACGGCTAC CAAACAGTTG TTCAAGATGA ATCAGGAGCT TGGGTAAACA ACACTGCAGT TCTAGATGGA GAGCCAGTAT CTCAAGATAA CGAAGATGGT TCAAAGACTG TTACATTCAA GATCAAGAAA GACCTTAAAT GGTCTGATGG CGAACCAATC AAGGCAGATG ATTACTTATT CATGTCTCTT CTACACACTC ACCCAGAATA TACCAAACTA ACAGGCTCTA CAAGCCCAGG ATCTGACTCA GTTAAGGGAT ATGAAGCCTA TAAGAAGGGC GATAGCGATG TATTTGAAGG TCTAGAGAAG GTAGATGATT ACACATTTAA GATTACAATA GATGCATCAT TCCTTCCATA CTTCGAAGAA GCGTCTCTAC TAGCTATTCA ACCACACCCA ATGCACTATC TAAACGAGAA CCTAGCTCTT TCTAAAGAGG GTAACAAGCT TGTTGCTAAA GAAGGCTATA AAGTTAGTGA CGAAGAGAAA GAAAACTATG TAAAGAACCT GGATGAACAA ATCAAAAAAC AAAATGAAGA CTTTGAGGAA AACAATCCAG CTCCAGCAGA TGATGCAGCA GAAGAAGATA AGAAGGCTTA CGAAGAAGCT AAAAAAGAAC ACGAAGATGC AATCAAAGAT CTAGAAGAAC GTAAAGCAGG AGATGTAGAT CCTACTCAAC AACTTATCGA TGAAGCTATG CTTAAGGAAG TTAATGAATA CAGACTTAAT CCAGCAGTTG TATCAGGACC TTACAAGTTT GAATCATTTG AAAACAACAT GGTTAAACTA AGCCTAAACG AAAACTATGT TGGAAACTTC AAGGGTGATA AGGCTACAAT TCCTAACGTA ATCCTTCAAA CTGTAAACAA AAATATTGCT GTAGACTTAC TTGAAAATGG AGATATCGAC CTTTGGGAAG AAGAATCTGA AGGTGGTCCA ATCGACAGAA TGAGAGAAGC TGCTGATAGT GGAAAGATCG GTGGATATAA CACATTCGAA AGAAACGGTT ATGGTAACGT AACATTCCTA ACAGACAGAG GAAGTACAAA ATACAAAGAA GTTAGACAAG CTATAGCTCA CCTAATGGAT AGAAACAGCT TCGTACAATC CTTCGCGGGT GGATACGGTG TTGTAACTAA CGGTATGTAC GGTAGCAGCC AATGGATGTA TAAGGAAAGA GGAGCAGATC TTGAAGGTAA GTTAATCAAC TACCAAATGA ACTTAGATCA AGCTAATGCC CTTCTCGACA AAACACCTTA CAAGTTCGAG TCTGATGGAA CTACACCTTG GGATAAGACT AAGGCTGATG AAGCTTTCGC ATCTAACCCA GATGGATTTG ACTATTATAG ATACGATGAA AATGGTAAGA AGCTTGTAGT TAACCAATAC GGTTCTGATG AATCACCAAT TACAACATTA ATCTCTAACC AAGTACCAAA CAATGCTAAG CAAGTTGGTA TGGAATACAA TGTTACAGCT GGTTCATTCG CAACTCTTCT AAACTACTAC TACTATCCAG AAGAAGACCC AGAATATACA GTATTCAATA TGGGTACAAA CTTCGGTACA CCATTTGACC CATGGTACGC TTATAACTCT GAAGGACCTT ACAACTATAC TAAGACTAAT GATCCAAAGG CTGATGAGTT GACAGTTAAA CTACGTAAGA CACCTGCTGA TAAGAAGGAT GAATACCTAG ATAACTGGGA AGAATTCCAA ATCTGGTACA ATGATTACCT ACCAGAAATC CCACTTTACG CTAACCAATA TCACACAGGT TACACAAAGA GAGTTAAGGG ATTCGATGTT AATACACCAG TATGGCAATC AGAAGATCAA ATAAACGCTC TTAGTCTAGA AAACTAA
|
Protein sequence | MKMKRVFSSL MALGLASTLV ACGGGENKDN KAANNDADTK AEDKNKDDKD AKESEGTGEE GAENFESQTS DDTLVVGLGE LNGDFLQGWT NNSGDVKVRK YLGIEGNNGY QTVVQDESGA WVNNTAVLDG EPVSQDNEDG SKTVTFKIKK DLKWSDGEPI KADDYLFMSL LHTHPEYTKL TGSTSPGSDS VKGYEAYKKG DSDVFEGLEK VDDYTFKITI DASFLPYFEE ASLLAIQPHP MHYLNENLAL SKEGNKLVAK EGYKVSDEEK ENYVKNLDEQ IKKQNEDFEE NNPAPADDAA EEDKKAYEEA KKEHEDAIKD LEERKAGDVD PTQQLIDEAM LKEVNEYRLN PAVVSGPYKF ESFENNMVKL SLNENYVGNF KGDKATIPNV ILQTVNKNIA VDLLENGDID LWEEESEGGP IDRMREAADS GKIGGYNTFE RNGYGNVTFL TDRGSTKYKE VRQAIAHLMD RNSFVQSFAG GYGVVTNGMY GSSQWMYKER GADLEGKLIN YQMNLDQANA LLDKTPYKFE SDGTTPWDKT KADEAFASNP DGFDYYRYDE NGKKLVVNQY GSDESPITTL ISNQVPNNAK QVGMEYNVTA GSFATLLNYY YYPEEDPEYT VFNMGTNFGT PFDPWYAYNS EGPYNYTKTN DPKADELTVK LRKTPADKKD EYLDNWEEFQ IWYNDYLPEI PLYANQYHTG YTKRVKGFDV NTPVWQSEDQ INALSLEN
|
| |