Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Apre_0255 |
Symbol | |
ID | 8397029 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Anaerococcus prevotii DSM 20548 |
Kingdom | Bacteria |
Replicon accession | NC_013171 |
Strand | + |
Start bp | 287952 |
End bp | 289271 |
Gene Length | 1320 bp |
Protein Length | 439 aa |
Translation table | 11 |
GC content | 37% |
IMG OID | 644994616 |
Product | extracellular solute-binding protein family 1 |
Protein accession | YP_003152028 |
Protein GI | 257065772 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1653] ABC-type sugar transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 36 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAAAGA AATTATCGTT GTTAATGGCT CTAGTTTTGT CACTAGGAGT ATTCACAGCA TGCGGATCTA ATGAAAACAC AAGTGAAGAA AAGGCAGACA GCAAGGTAGA AGAAAGTGCT GATCAAGTTA AAGATGACAA GAAGGAAGAA GGAAAAGAAG AAGATAAGGC AGGAGGTCAA ACAGAAATAG TATTCTGGCA CGCTATGGGT GGAGGTCAAG GAGAAGCTCT TGAAAATTTG ACCAAGAAGT TTGAAGAAGA AAATCCAAAC ATCAAGGTTA CTCTTCAAGG TCAAGGAAAA TACGGAGACT TAAACCAAAT CCTCGTTGCA TCAATGCAAT CACCTAAAGA CCTTCCAACA ATAACTCAAG CTTATCCTGA CTGGATGCTT CAATTCAAAG ACGCTAATAT GATAGCAGAT CTTACAGACT ATGTTAAAAA AGACATGGAT GATTATGATG ATATCTTGCC AGGAGTAAGA GATGAGTTAG AAAAAGATGG CAAGATTGAA GCTCTACCAT TTAACAAATC AACAGAAGTA TTCTGGTATA ACAAAAATCT TTATGATGAA CTAGGACTAA AAGAGCCTAC AAGTTTTGAA GAGCTAAAAG AAAATGCTAA GAAAATTTAC GAAGCAAAAG GAATCCCTGG GGCAGGATTT GATTCACTAT CAAACTTCTA CCTAACATAT CTAAAAAACA AGGGAATCGA ATTTGATGAA AATCTAGACC CTGCTTCAGC TGAATCTATC GAAGCTGTTG AATATTATCT AGAAGGTATC AAAGAAGGAT ACTTTAGAAT AGCAGGGACA GACCAACACT TATCTGGTCC GTTTGCAAAC GAACAAGTTG GTTCATTTGT AGGATCAAAC GCTGGTGAAG TATATGTAAA AGAAGCTCTA AACGATAAAT TCGAATATGC TGCAGCTCCT TACCCAGCAA AAGAAGCCTT CCAACAAGGT ACAAATATTT ACATGTTTGA CAAGGCAAGT GATGAAGAAA AACAAGCAGC ATTTAAATAC ATGCAATTTT TAGCTAGCAA GGATTCACAA GTTGAATTTG CTATAGCTAC AGGTTACATG CCAGCAAGAA AATCTGCAGT TGAAGATGAA ACTTACAAAT CATCTGATTC AAAAATCGCA CCAATTCTTG ATAAGGCAAG TGAAAAACTA TTCTCTAGAC CACTTGCTCC AGGCAGCCAA CAAGCCTACA ATGACGTAGC AAGTTTACTT GAAAGCATCC TTTCAAATCC AAATGCAGAC GTTAAGGCAG AGCTTGAAGC ATTCGCGCCA CAATTTAAGG CAGACTTTGA AGCTCAATAA
|
Protein sequence | MKKKLSLLMA LVLSLGVFTA CGSNENTSEE KADSKVEESA DQVKDDKKEE GKEEDKAGGQ TEIVFWHAMG GGQGEALENL TKKFEEENPN IKVTLQGQGK YGDLNQILVA SMQSPKDLPT ITQAYPDWML QFKDANMIAD LTDYVKKDMD DYDDILPGVR DELEKDGKIE ALPFNKSTEV FWYNKNLYDE LGLKEPTSFE ELKENAKKIY EAKGIPGAGF DSLSNFYLTY LKNKGIEFDE NLDPASAESI EAVEYYLEGI KEGYFRIAGT DQHLSGPFAN EQVGSFVGSN AGEVYVKEAL NDKFEYAAAP YPAKEAFQQG TNIYMFDKAS DEEKQAAFKY MQFLASKDSQ VEFAIATGYM PARKSAVEDE TYKSSDSKIA PILDKASEKL FSRPLAPGSQ QAYNDVASLL ESILSNPNAD VKAELEAFAP QFKADFEAQ
|
| |