Gene Pden_2097 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPden_2097 
Symbol 
ID4579894 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameParacoccus denitrificans PD1222 
KingdomBacteria 
Replicon accessionNC_008686 
Strand
Start bp2100310 
End bp2101296 
Gene Length987 bp 
Protein Length328 aa 
Translation table11 
GC content71% 
IMG OID639769427 
ProductHhH-GPD family protein 
Protein accessionYP_915885 
Protein GI119384829 
COG category[L] Replication, recombination and repair 
COG ID[COG1194] A/G-specific DNA glycosylase 
TIGRFAM ID[TIGR01084] A/G-specific adenine glycosylase 


Plasmid Coverage information

Num covering plasmid clones33 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.291413 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCCCCGG GTCGCGGCAC TGCCGACCCC TATCGCGTCT GGCTGTCCGA GGTCATGCTG 
CAACAGACCA CCGTCGCCGC GGTGAAAGCC TATTTCGAGC GGTTCACCAG CCTTTGGCCC
ACGGTCCACG ACCTTGCCGC CGCCGAGGAT GCGCAGGTGA TGGCGGAATG GGCCGGGCTC
GGCTATTACG CCCGCGCCCG CAACCTGATC GCCTGCGCCC GCGCGGTCTC GGCCATGGGC
GCATTCCCCG ACACGCGCGC GGAACTGGCA GACCTGCCCG GCATCGGCGC CTATACCTCG
GCCGCCATCG CCGCCATCGC CTTCGACCGG CCCGAAACAG TGGTGGACGG CAATGTCGAG
CGCGTCGTCG CGCGGCTTTT CGCGGTCGAG ACGCCCCTGC CCGCCGCCAA GCCGGAACTT
GTCGCACTGG CCGCCGGCCT GACACCGTCG GAGCGGCCGG GCGATTTCGC ACAGGCGATG
ATGGACCTGG GCGCGACCAT CTGCACGCCC AGAAGCCCGG CCTGCGGCAT CTGCCCGGTC
ATCGACCATT GCGCGGCGCG CGCGCAGGGC ATCGCCGCCG ACCTGCCCCG CAAGGCGCCC
AAGAAGGCCA AGCCCCTGCG TCAAGGCATC GTCTGGATCG GTTTTTCCAA GGGGGCGGTC
CTGGTCGAGA CCCGGCCGGA CCGGGGCCTT CTGGGCGGTA CGCTGGCCTT TCCCTCGACC
GGCTGGGACG GCTCGGACCT GCCGCCGCCT GCGCCGGGCG ACTGGCAGGA GATCGGACTC
GTCCGCCATG TCTTCACCCA TTTCGCGCTG GATCTGACGG TAATGACCGC CCGCTTGACC
GCCGCGCCGG AACGCGGCAA TCTGGCGCCG CTCAGCGAGT TCAGACCTGC CGCCCTGCCC
GGGCTGATGC GCAAGGCCTG GGCGCTTGCC CGCCCCGAAA CCGCCACCCT GCCCGCGACA
ACCGGTCGCC GTGGCAAGAC CGGGTAA
 
Protein sequence
MPPGRGTADP YRVWLSEVML QQTTVAAVKA YFERFTSLWP TVHDLAAAED AQVMAEWAGL 
GYYARARNLI ACARAVSAMG AFPDTRAELA DLPGIGAYTS AAIAAIAFDR PETVVDGNVE
RVVARLFAVE TPLPAAKPEL VALAAGLTPS ERPGDFAQAM MDLGATICTP RSPACGICPV
IDHCAARAQG IAADLPRKAP KKAKPLRQGI VWIGFSKGAV LVETRPDRGL LGGTLAFPST
GWDGSDLPPP APGDWQEIGL VRHVFTHFAL DLTVMTARLT AAPERGNLAP LSEFRPAALP
GLMRKAWALA RPETATLPAT TGRRGKTG