Gene Apar_0147 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagApar_0147 
Symbol 
ID8412993 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAtopobium parvulum DSM 20469 
KingdomBacteria 
Replicon accessionNC_013203 
Strand
Start bp167367 
End bp168692 
Gene Length1326 bp 
Protein Length441 aa 
Translation table11 
GC content52% 
IMG OID645021717 
Productpyrimidine-nucleoside phosphorylase 
Protein accessionYP_003179174 
Protein GI257783957 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0213] Thymidine phosphorylase 
TIGRFAM ID[TIGR02644] pyrimidine-nucleoside phosphorylase 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones27 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGCATGT ACGATGTAAT TGAGAAGAAG CGCGATGGAG GCGAGCTTAC CGACGCAGAG 
ATTGATTACT TTGTCTCGGG TTATGTAGCT GGTGATATTC CCGATTACCA GGCTTCCGCA
CTTGCTATGG CCATCTTTTA TAAGGGCATG ACCGCGCACG AGACGGCTCA TCTGACTATG
GCGATGGCTG AGTCTGGCGA TATGATGGAC CTCTCGGCAA TTCCTGGTAT CAAGGTTGAT
AAGCACTCTA CCGGCGGCGT TGGCGATAAA ACCACGCTGG TAGTAGCTCC ACTTGTTGCA
TCTCTGGGTG TGAAAGTTGC TAAGATGAGC GGTCGCGGAC TGGGTCACAC AGGCGGTACG
CTTGACAAGC TTGAGGCAAT TCCAGGACTT TCTATTGAGA TTTCCGAGCC CGACTTTTTC
AAGCAGGTTT CCGAGATTGG TGTTGCTGTT GCAGGTCAGA CGGGCAATCT TGTCCCTGCC
GATAAGAAAC TCTATGCGCT GCGCGACGTT ACTGCAACCG TTGACTCGGT GCCTCTGATT
GCGTCAAGTA TCATGAGTAA GAAGATTGCT TCTGGCTCTG ATTGCATTTT GCTGGACGTT
AAGTGTGGAT CTGGTGCCTT TATGAAGGAT GTTGATTCTG CAATTGAGCT GGCAGACGCC
ATGGTTTCTA TTGGTGAACA CGTTAACCGT ACTACTGCTG CGTTAATTAC CGGTATGGAT
CGTCCTCTGG GCAAAAACGT TGGTAACTCC CTTGAGGTCA TTGAGGCAGT GGCAACGCTC
AAGGGCGAGG GCCCTAAGGA TCTGACCGAC GTCTGCATTG AGCTTGCTGC AAACATGCTT
AACCTTGCAG GTAAGGGAAG TGTTGATGAC TGCCGTAAGC TGGCTCGCCA GCAGATTGCC
AACGGCGAGG GTCTGGCCAA GCTAGCTCAG ATGGTCAAAG CTCAGGGTGG TACCGACGAG
GTTATTTTTG ATACCACCAA GTTTGAGGCT GCTCCATTCC GTCGTGATAT TGTGTCCGAG
ACCAGTGGAT ATATCACTTC CATGAATGCT GAGCTGGTTG GTATTTCCTC CGTTGCTCTG
GGAGCCGGTC GCGAGAAAAA GGGTGATCCA ATTGACCCAT CCGCCGGTAT TATCCTCGAG
CGCAAGACGG GCGATTATGT CGAGAAGGGC GATGTCATCG CAACGCTTCT GACTGGTGAC
GAAAGCCGTC TTGATGAGGG CGAGCGCATC TTCCGTGAGG CTCTAGCCTT TGGTGAGAGT
GCACCTGAGT TGGAGCCATT GTTCTTTGCA CGCGTCTCCA AGGACGGTGT TGAGCGTTTC
GCGTAA
 
Protein sequence
MRMYDVIEKK RDGGELTDAE IDYFVSGYVA GDIPDYQASA LAMAIFYKGM TAHETAHLTM 
AMAESGDMMD LSAIPGIKVD KHSTGGVGDK TTLVVAPLVA SLGVKVAKMS GRGLGHTGGT
LDKLEAIPGL SIEISEPDFF KQVSEIGVAV AGQTGNLVPA DKKLYALRDV TATVDSVPLI
ASSIMSKKIA SGSDCILLDV KCGSGAFMKD VDSAIELADA MVSIGEHVNR TTAALITGMD
RPLGKNVGNS LEVIEAVATL KGEGPKDLTD VCIELAANML NLAGKGSVDD CRKLARQQIA
NGEGLAKLAQ MVKAQGGTDE VIFDTTKFEA APFRRDIVSE TSGYITSMNA ELVGISSVAL
GAGREKKGDP IDPSAGIILE RKTGDYVEKG DVIATLLTGD ESRLDEGERI FREALAFGES
APELEPLFFA RVSKDGVERF A