Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Apre_0039 |
Symbol | |
ID | 8396786 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Anaerococcus prevotii DSM 20548 |
Kingdom | Bacteria |
Replicon accession | NC_013171 |
Strand | + |
Start bp | 47341 |
End bp | 48642 |
Gene Length | 1302 bp |
Protein Length | 433 aa |
Translation table | 11 |
GC content | 40% |
IMG OID | 644994376 |
Product | pyrimidine-nucleoside phosphorylase |
Protein accession | YP_003151815 |
Protein GI | 257065559 |
COG category | [F] Nucleotide transport and metabolism |
COG ID | [COG0213] Thymidine phosphorylase |
TIGRFAM ID | [TIGR02644] pyrimidine-nucleoside phosphorylase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.0183104 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGATTTA TTGATATAAT TGAAAAGAAA AAACTTAAAG AAGAACTAAC AGATGAAGAA ATCCAATTTT TTATAGATGG AGTAACTGAC GATTCAATCG AAGATTATCA AATAGCAGCC CTCCTCATGG CGATCAGGCT TAACGGCATG ACAGAGCATG AGACAGCCAA GCTTGCAGAA GCTATGATGC ACTCTGGAGA TGTTATTGAC CTATCAGAAA TAGAAGGAAT CAAATCTGAC AAGCACTCAA CAGGTGGTGT TGGAGATAAG ACCTCAATGG CACTCGGTGC AATGGTTGCT GCTTGTGGCC TTAAGCTTGC CAAGATGAGT GGTAGGGGCC TAGGACATAC TGGTGGAACC CTTGATAAAC TAGAATCTAT AGAAGGATTT AACATCTCCC TTACCGAAGA AGAATTCAAA AAGCAAGTAA ACGAAATAGG CCTTGCCATA ATTGGTCAAA CAGGAGACTT GGTCCCAGCT GATAAGAAGC TCTACGCCCT AAGGGATGTA ACAGCAACAG TAGATTCGAT TCCGCTAATT GCTTCATCAA TTATGTCTAA GAAACTTGCT TCTGGATCAG ATACCATATT ACTCGATGTA AAATACGGTG AAGGTGCCTT CATGCACACA GTAGAAGATG CTAAGAAACT TGCCGAAGCT ATGATTTCAA TCGGTAAGAA ACTAGGCAAA AATACTATGG CCATGATTAC AGATATGAAC CAACCTTTAG GAAATACTAT AGGTAATGCC CTTGAAGTAA GAGAAGCTAT AGAGACTGTA AGGGGACATG GACCAAAAGA CTTCACAGAA CTTTGTATGT GTGCTGGGGA GATTATGCTC ATGCAAGCAG ACAAGGCAGA GACTAAAGAA GAAGCTAGAA AGATGTTAGA AGAAGCAATC TCATCTGGAA AAGCCTACGA AAAGCTAGAA AAAATGGTAG AATACCAAGG CGGAAATGTA GAACAAATCA GAAACACAGA CCTCCTTCCT CAAGCGAAAT TCAAGACAGA AATGTTATCT AAAGAAGAAG GCTACATTGA AAATATCCAC TCAATGGGAC TTGGTATCCA AGCGATGAAG CTTGGAGCTG GAAGAGCTAA GAAAACTGAC CCTATAAACT ACGCTGTTGG TCTCGAGATG AATGCCAAAA AGGGCGACTA TGTCAAAAAG GGCGACCTTC TCTGTACAGT ATATCACGAC GAAGAATTAA CAGAAGAGTG GAAAAAAGAT TTCTATGATA CCTTTACCTT TACAGACAAG GAAGTAGAGC CAATTCCAAT AGTAGAAGAA ATTTTAAAAT AA
|
Protein sequence | MRFIDIIEKK KLKEELTDEE IQFFIDGVTD DSIEDYQIAA LLMAIRLNGM TEHETAKLAE AMMHSGDVID LSEIEGIKSD KHSTGGVGDK TSMALGAMVA ACGLKLAKMS GRGLGHTGGT LDKLESIEGF NISLTEEEFK KQVNEIGLAI IGQTGDLVPA DKKLYALRDV TATVDSIPLI ASSIMSKKLA SGSDTILLDV KYGEGAFMHT VEDAKKLAEA MISIGKKLGK NTMAMITDMN QPLGNTIGNA LEVREAIETV RGHGPKDFTE LCMCAGEIML MQADKAETKE EARKMLEEAI SSGKAYEKLE KMVEYQGGNV EQIRNTDLLP QAKFKTEMLS KEEGYIENIH SMGLGIQAMK LGAGRAKKTD PINYAVGLEM NAKKGDYVKK GDLLCTVYHD EELTEEWKKD FYDTFTFTDK EVEPIPIVEE ILK
|
| |