Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Apre_0387 |
Symbol | |
ID | 8397161 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Anaerococcus prevotii DSM 20548 |
Kingdom | Bacteria |
Replicon accession | NC_013171 |
Strand | + |
Start bp | 438242 |
End bp | 439753 |
Gene Length | 1512 bp |
Protein Length | 503 aa |
Translation table | 11 |
GC content | 39% |
IMG OID | 644994745 |
Product | extracellular solute-binding protein family 5 |
Protein accession | YP_003152157 |
Protein GI | 257065901 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0747] ABC-type dipeptide transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 1 |
Plasmid unclonability p-value | 0.000000219856 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAAAGA GAAACATATT CGTAAGCCTT GCCCTAGCAG GTCTACTGCT AACTTCTTGC GGCAATCAGG ATAAGAAGGA AAATACCAAA ACTAGCGATG CGCCACTTAA GGTAGCCATG TATACTGAAA TCGATTCGCT CGATCCTTTC AATGCTACAG CAGGAGATAC CAAGACAATC ATGGATCAAG TATTTGATGG ACTCTTCGAT GTGGATGAAG ATGGAAATCT AGTTCCTGAC CTTTGTGAAT CTTACGAGAT AAGTGAAGAT GGCCTTACTT ATGATTTCAA ATTAAAAGAA GGGGTCAAAT TCCACAATGA TAAGGACTTT ACTGCAGATG ATGTCTACTA CACTTATGAT ATCCTAGCAG GACTAACAAG TGGAGAGCCA AAGTCTTCTA AGTTTGCCCA AATAGAATCA ATGGAAGTAG CCTCTCCTAC AGAAATTAAA ATCAAATTAA AGGAAAAATC TAATTCCTTT ATCTATCTAA ATACTCAACC AATAGTTCAA AAAGACTATG AAGACAATCA GACAAAGCCA ATAGGAACTG GTCCTTTCGA GTTTGTTTCC TACACACCAG GTGAGGGCAT GAAGCTAAAA AGATTTGACG ACTATCACAG AAAAGACCAT ATTGCGAAGT TTGCTGACGT AGAGATTCTA AGAATTGCCG ACAGACAAAC CCTAATCATG GCCCTAAACA ACAAGGACGT TGACCTTGCT ACAGGACTTA CTAATGATGA GCTAAGCCAA ATAGAAGAAA CTTGCGATAT CCATTCCTTC CCACAAAACT TAGTTCAAGT CCTAGGTCTT AATAACGATG TTAAGCCTTT CGATGATATG AAGGTAAGAC AAGCTATTGC TTACGCAATC GATAAGGATG AAATCATAAA CACAGCGGCA GGTGGCAGGG CAAGTAAGCT AGTATCAAAC TTCTCCCCAG CCCTTAAAGA ATATTACAAT GATATGGAAG AAAAATACCC TTACAATCCG GAAAAGGCCA AGGAACTTTT AAAAGAAGCA GGCCTTGAGG ACGGATTTTC TGTAAAACTA ACTGTGCCAA GTGATTACAA ATATCATATG GACACAGCAG AGCTAATCCA AGCCCAACTA GGTAAGGTAG GAATTGATGT AACACTTGAT CCAATCGAGT TTTCTACTTG GCTAAGCAAG GTATACAAGG ATAAGGACTA TGAGGCTACA GTTTCAGGCT TTGTAGGTTA TGTTGATCCA ATCAGAGTAA TCGATAGATA TGTATCAACT AATGACAAAA ACTTCATCAA CTACAAGTCA GAAGGTTATG ACGAGGCTAT AAAGGCAGCC CAAAGTGCAG ATAATAAGGA AGATATAATC CAAAATGTAA AAGATGCCCA AGAATTTATG GCAGAAGATG CAGGATCAGT CTTCCTAACA GACCCTGACA ACAACCAAGC CCTAAACAAG GACCTCACAG GACTTAAGTC CTATCCAGTA CAAAAGATAA ATCTAGAGGA TATAGAAAAG AAAAATGACT AA
|
Protein sequence | MKKRNIFVSL ALAGLLLTSC GNQDKKENTK TSDAPLKVAM YTEIDSLDPF NATAGDTKTI MDQVFDGLFD VDEDGNLVPD LCESYEISED GLTYDFKLKE GVKFHNDKDF TADDVYYTYD ILAGLTSGEP KSSKFAQIES MEVASPTEIK IKLKEKSNSF IYLNTQPIVQ KDYEDNQTKP IGTGPFEFVS YTPGEGMKLK RFDDYHRKDH IAKFADVEIL RIADRQTLIM ALNNKDVDLA TGLTNDELSQ IEETCDIHSF PQNLVQVLGL NNDVKPFDDM KVRQAIAYAI DKDEIINTAA GGRASKLVSN FSPALKEYYN DMEEKYPYNP EKAKELLKEA GLEDGFSVKL TVPSDYKYHM DTAELIQAQL GKVGIDVTLD PIEFSTWLSK VYKDKDYEAT VSGFVGYVDP IRVIDRYVST NDKNFINYKS EGYDEAIKAA QSADNKEDII QNVKDAQEFM AEDAGSVFLT DPDNNQALNK DLTGLKSYPV QKINLEDIEK KND
|
| |