Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Apar_0147 |
Symbol | |
ID | 8412993 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Atopobium parvulum DSM 20469 |
Kingdom | Bacteria |
Replicon accession | NC_013203 |
Strand | + |
Start bp | 167367 |
End bp | 168692 |
Gene Length | 1326 bp |
Protein Length | 441 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 645021717 |
Product | pyrimidine-nucleoside phosphorylase |
Protein accession | YP_003179174 |
Protein GI | 257783957 |
COG category | [F] Nucleotide transport and metabolism |
COG ID | [COG0213] Thymidine phosphorylase |
TIGRFAM ID | [TIGR02644] pyrimidine-nucleoside phosphorylase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 26 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 27 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGCATGT ACGATGTAAT TGAGAAGAAG CGCGATGGAG GCGAGCTTAC CGACGCAGAG ATTGATTACT TTGTCTCGGG TTATGTAGCT GGTGATATTC CCGATTACCA GGCTTCCGCA CTTGCTATGG CCATCTTTTA TAAGGGCATG ACCGCGCACG AGACGGCTCA TCTGACTATG GCGATGGCTG AGTCTGGCGA TATGATGGAC CTCTCGGCAA TTCCTGGTAT CAAGGTTGAT AAGCACTCTA CCGGCGGCGT TGGCGATAAA ACCACGCTGG TAGTAGCTCC ACTTGTTGCA TCTCTGGGTG TGAAAGTTGC TAAGATGAGC GGTCGCGGAC TGGGTCACAC AGGCGGTACG CTTGACAAGC TTGAGGCAAT TCCAGGACTT TCTATTGAGA TTTCCGAGCC CGACTTTTTC AAGCAGGTTT CCGAGATTGG TGTTGCTGTT GCAGGTCAGA CGGGCAATCT TGTCCCTGCC GATAAGAAAC TCTATGCGCT GCGCGACGTT ACTGCAACCG TTGACTCGGT GCCTCTGATT GCGTCAAGTA TCATGAGTAA GAAGATTGCT TCTGGCTCTG ATTGCATTTT GCTGGACGTT AAGTGTGGAT CTGGTGCCTT TATGAAGGAT GTTGATTCTG CAATTGAGCT GGCAGACGCC ATGGTTTCTA TTGGTGAACA CGTTAACCGT ACTACTGCTG CGTTAATTAC CGGTATGGAT CGTCCTCTGG GCAAAAACGT TGGTAACTCC CTTGAGGTCA TTGAGGCAGT GGCAACGCTC AAGGGCGAGG GCCCTAAGGA TCTGACCGAC GTCTGCATTG AGCTTGCTGC AAACATGCTT AACCTTGCAG GTAAGGGAAG TGTTGATGAC TGCCGTAAGC TGGCTCGCCA GCAGATTGCC AACGGCGAGG GTCTGGCCAA GCTAGCTCAG ATGGTCAAAG CTCAGGGTGG TACCGACGAG GTTATTTTTG ATACCACCAA GTTTGAGGCT GCTCCATTCC GTCGTGATAT TGTGTCCGAG ACCAGTGGAT ATATCACTTC CATGAATGCT GAGCTGGTTG GTATTTCCTC CGTTGCTCTG GGAGCCGGTC GCGAGAAAAA GGGTGATCCA ATTGACCCAT CCGCCGGTAT TATCCTCGAG CGCAAGACGG GCGATTATGT CGAGAAGGGC GATGTCATCG CAACGCTTCT GACTGGTGAC GAAAGCCGTC TTGATGAGGG CGAGCGCATC TTCCGTGAGG CTCTAGCCTT TGGTGAGAGT GCACCTGAGT TGGAGCCATT GTTCTTTGCA CGCGTCTCCA AGGACGGTGT TGAGCGTTTC GCGTAA
|
Protein sequence | MRMYDVIEKK RDGGELTDAE IDYFVSGYVA GDIPDYQASA LAMAIFYKGM TAHETAHLTM AMAESGDMMD LSAIPGIKVD KHSTGGVGDK TTLVVAPLVA SLGVKVAKMS GRGLGHTGGT LDKLEAIPGL SIEISEPDFF KQVSEIGVAV AGQTGNLVPA DKKLYALRDV TATVDSVPLI ASSIMSKKIA SGSDCILLDV KCGSGAFMKD VDSAIELADA MVSIGEHVNR TTAALITGMD RPLGKNVGNS LEVIEAVATL KGEGPKDLTD VCIELAANML NLAGKGSVDD CRKLARQQIA NGEGLAKLAQ MVKAQGGTDE VIFDTTKFEA APFRRDIVSE TSGYITSMNA ELVGISSVAL GAGREKKGDP IDPSAGIILE RKTGDYVEKG DVIATLLTGD ESRLDEGERI FREALAFGES APELEPLFFA RVSKDGVERF A
|
| |