Gene CPF_2062 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCPF_2062 
Symbolpdp 
ID4203206 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium perfringens ATCC 13124 
KingdomBacteria 
Replicon accessionNC_008261 
Strand
Start bp2301785 
End bp2303086 
Gene Length1302 bp 
Protein Length433 aa 
Translation table11 
GC content32% 
IMG OID638082927 
Productpyrimidine-nucleoside phosphorylase 
Protein accessionYP_696491 
Protein GI110799911 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0213] Thymidine phosphorylase 
TIGRFAM ID[TIGR02644] pyrimidine-nucleoside phosphorylase 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGAATGT ATGATTTAAT ATTAAAAAAG AGAAATGGTG GGGAATTAAG CACTGAAGAA 
ATAAACTTTT TTGTTGATAA ATATACAAAT GGTGAAATAC CAGATTATCA AGTAGCAGCC
TTATTAATGG CTATATATTT CCAAAAAATG AATAAGAGAG AAACTTCAGA TTTAACTATG
GCTATGGTTA ATTCAGGAGA TATATTAGAT TTAAGTGAAA TACACGGTAT AAAAGTAGAT
AAGCATTCAA CTGGTGGGGT TGGTGATACT ACTACTTTAG TTTTAGGACC TATGGTTGCA
GCTTTAGGAA TTCCTGTAGC TAAAATGTCA GGAAGAGGTC TTGGACATAC TGGTGGAACA
ATAGATAAAT TAGAAAGTTT TGATGGATTC TCAGTAGAAA TGACTAAGGA TCAATTTATA
AATAATGTTA ATAATATAAA ATTAGCAGTT GGTGGACAAA CAGGAGATTT AGCTCCAGCT
GATAAAAAAC TTTATGCTTT AAGAGATGTT ACTGGAACTG TTGATAATGT ATCATTAATA
GCTTCAAGTA TTATGAGTAA AAAAATTGCT GCAGGCGCTG ATGCCATAGT ATTAGACGTT
AAGGTTGGAG ATGGAGCATT TATGAAAACT CCAGAGGCAG CTAGAGAACT TGCAACTGAA
ATGGTTGGAA TAGGTAAACA TGTTGGAAGA AATACTGTTG CCATAATTTC AGATATGGAT
CAACCTTTAG GATTTGCTAT AGGTAATGCA TTAGAAGTTA AAGAAGCCAT AGAAACTTTA
AGAGGAAATG GTCCTAAGGA TCTTTTAGAG CTTTGCTTAA CACTTGGAAG CAACATGGTT
GTTTTAGCAG GTGCTGCTAA AGATACTGAT GAAGCAAGAA AAATGTTAAT GGAAACAATA
ACTTCAGGAA AAGCAATTGA AAAATTAAAA GAATTTGTAA AAGCTCAAGG TGGAGATGCA
TCAGTTATTG ATGATATATC AAACTTCCAT AATGCTAAAT ATGTAATTCC AGTTAAAGCT
AATAAATCAG GGGTTGTAAG CAAAATACAT GCTGAAAACA TAGGATTAGT TGCTATGGAA
CTTGGAGCAG GAAGAGCTAC TAAAGAAAGT ATTATAGATT TAGCTGTTGG TATTGTTCTT
CAAAAGAAAA GAGGAGACAA AGTTAATGAA GGAGATATAA TAGCATATAT TCATGCTGAT
GATGAAGAAA AAGGTAAGAA AGCAGTAGAT GGAATATTAG CTAATTACGA AATATCAGAT
TCAGTAAAAG ATATTCCATT AATATATGAT ATAGTTAAAT AG
 
Protein sequence
MRMYDLILKK RNGGELSTEE INFFVDKYTN GEIPDYQVAA LLMAIYFQKM NKRETSDLTM 
AMVNSGDILD LSEIHGIKVD KHSTGGVGDT TTLVLGPMVA ALGIPVAKMS GRGLGHTGGT
IDKLESFDGF SVEMTKDQFI NNVNNIKLAV GGQTGDLAPA DKKLYALRDV TGTVDNVSLI
ASSIMSKKIA AGADAIVLDV KVGDGAFMKT PEAARELATE MVGIGKHVGR NTVAIISDMD
QPLGFAIGNA LEVKEAIETL RGNGPKDLLE LCLTLGSNMV VLAGAAKDTD EARKMLMETI
TSGKAIEKLK EFVKAQGGDA SVIDDISNFH NAKYVIPVKA NKSGVVSKIH AENIGLVAME
LGAGRATKES IIDLAVGIVL QKKRGDKVNE GDIIAYIHAD DEEKGKKAVD GILANYEISD
SVKDIPLIYD IVK