Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Apre_0146 |
Symbol | |
ID | 8396897 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Anaerococcus prevotii DSM 20548 |
Kingdom | Bacteria |
Replicon accession | NC_013171 |
Strand | + |
Start bp | 172457 |
End bp | 174013 |
Gene Length | 1557 bp |
Protein Length | 518 aa |
Translation table | 11 |
GC content | 37% |
IMG OID | 644994484 |
Product | extracellular solute-binding protein family 1 |
Protein accession | YP_003151919 |
Protein GI | 257065663 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1653] ABC-type sugar transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 41 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAATTTAG TTAAAAAATT TGCTTACACA TCAATGGCTC TAATGCTTGC TCTAGGCATT TCAGCTTGCG GAAACAATGA CGGAAATGTA GAAGATGAAA ACAAAACCGA AACTAGTGAA AGCGTAGAAA ACAAAGAAAA CAAAGACGAT AAAAAAGATA AAAATGCTAA AGATGACGAA GTTCCTACCT TAACTTATCT TAATATAGGA ACCCCGCAAG CAGGAACGGA AGAGACAGTA AGTAAGATTA ACGAATACCT TGATGAAAAA GAAGCTGGTT ATCACTTAAA TCTAATCTTC TATGATTGGG GAGATTATGA ACAAAGACTT CAACTTGCTT CATCAACAGG AGAAGATTGG GATCTAGCCT TTACTGCAAG CTGGGCAGGA CCTTACAAGA CTCTAGTATC ACAAAACGCC TTGATGGATC TAACTGACTT AATGGAAGGC AAGGAATTTG TTGATTTAAT CAATCCTGAT ATGCTAAAAG GTGTTAGTGT AGACGGCAAA GTTTACGGAA TCCCTGCAGC ATATCCTGGA GTTGTTGCAG CTAACCAATT TGTATGGACA AAGAGCATGG TTGATAAATA TAATATCGAC TACAAGAATA TCCAAAATAT AGACCAATTA GAGCCTATCT TTGAAGAAGT AAAAGAAAAA GAAGGTATGC AATATCCATT TGGCGTATCT AAGGACTTCT TGTTCGCTAT GCCAGAGCCT GTATATCAAG TTACAGATGG GGTAGCTGTA AGAGAAGAAG ATGGCAAGCT TAAGGCTTAC AACTTATATG CGGATGATCT TTATAAAGAA CAAATCATGA AGATGAAAGA TTATATGGAC AAAGGATACA TCTCACCATC AGCTCCACAA GTTGAGCCAG GAACAGTAAT GCCAGAAAAC GAAGTTCTAC TAACAGAAGG TGAAGGAGAA CCAGGATCTG CCGCTATATG GTCAGTTGCA CCACGTAACG AAGTAGTATC AAATATAATT GGAGATAAGG TTCTTATATC TAACGACAAG GCTACAGGAA AGATGATTTC CATCAACTCA CAAACTGATA AGGCAGAACT TGCAATGGAC TTTATCAACA GAATGTTCAC AGATAAAAAC CTACAAGACA TGCTATCTTA CGGTGTTGAA GGCAAAAACT TCGAGTACAA AGACGGAAAA GTTGTAAAAC ACAATAAGGA TTCTGATGGA ATAGATAACG ATTACGATGT ACCATCATTT ACCTTCCTAT GTGCATTTAA CAGAACTCCA CTAGAGGGAG CTCCTGGAAT TGGCGATGAA GAATTCGATA AGGAAGCAAA AGAATTTGAA GATAAACTAG TAGCATCTCC AGTATTAGGA TTTACCCTAG ACGAATCAAA GATAGCAACA GAAATAGCAA ATGTTCAACA AACACAAAGC GAATACACCA TCAACCTTAA GACAGGTGCC TTTGACGAAA GCTACTATCA AGAATTCTTA GATAAACTAA ATACAGCAGG AATCGAAAAG GTAATAGAAG AAGTACAAGC ACAATTAGAT AATTGGGAAG GCGCTGATAA GCAATAA
|
Protein sequence | MNLVKKFAYT SMALMLALGI SACGNNDGNV EDENKTETSE SVENKENKDD KKDKNAKDDE VPTLTYLNIG TPQAGTEETV SKINEYLDEK EAGYHLNLIF YDWGDYEQRL QLASSTGEDW DLAFTASWAG PYKTLVSQNA LMDLTDLMEG KEFVDLINPD MLKGVSVDGK VYGIPAAYPG VVAANQFVWT KSMVDKYNID YKNIQNIDQL EPIFEEVKEK EGMQYPFGVS KDFLFAMPEP VYQVTDGVAV REEDGKLKAY NLYADDLYKE QIMKMKDYMD KGYISPSAPQ VEPGTVMPEN EVLLTEGEGE PGSAAIWSVA PRNEVVSNII GDKVLISNDK ATGKMISINS QTDKAELAMD FINRMFTDKN LQDMLSYGVE GKNFEYKDGK VVKHNKDSDG IDNDYDVPSF TFLCAFNRTP LEGAPGIGDE EFDKEAKEFE DKLVASPVLG FTLDESKIAT EIANVQQTQS EYTINLKTGA FDESYYQEFL DKLNTAGIEK VIEEVQAQLD NWEGADKQ
|
| |