Gene Pden_1998 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPden_1998 
Symbol 
ID4578667 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameParacoccus denitrificans PD1222 
KingdomBacteria 
Replicon accessionNC_008686 
Strand
Start bp2004932 
End bp2006002 
Gene Length1071 bp 
Protein Length356 aa 
Translation table11 
GC content72% 
IMG OID639769327 
Productputative DNA-binding/iron metalloprotein/AP endonuclease 
Protein accessionYP_915786 
Protein GI119384730 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0533] Metal-dependent proteases with possible chaperone activity 
TIGRFAM ID[TIGR00329] metallohydrolase, glycoprotease/Kae1 family 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value0.887934 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCATGG GACTGACCTT TCTGGGCATC GAAAGCAGCT GCGACGATAC CGCCGCGGCG 
GTGGTGCGCG ACGACCGCAG CATCCTTGCC TCGGTGGTGG CAGGCCAGGC GGCGCTGCAT
GCCGATTTCG GCGGCGTGGT GCCCGAGATC GCCGCCCGCG CCCATGCCGA AAAGCTGGAC
CTCTGCGTCG AGGAGGCGCT GGCCCAGGCC GGGCTGCGCC TGTCGGACCT GGACGGCATC
GCCGTCACCG CGGGGCCGGG GCTGATCGGC GGCGTGCTGT CGGGCGTCAT GCTGGCCAAG
GGGCTTGCGG CGGGCACGGG GCTGCCGCTG GTCGGCGTCA ACCATCTGGC GGGCCACGCG
CTGACGCCGC GCCTGACCGA CGGAACCCCT TATCCCTATC TGATGCTGCT GGTCTCGGGC
GGGCATTGCC AGTTCCTGCG CGTGGACGGC CCCGAGGATT TCACACGCCT CGGTGGCACC
ATCGACGATG CGCCGGGCGA GGCTTTCGAC AAGGTGGCAA AGCTGCTGGG CCTGCCACAA
CCGGGGGGGC CCTCGGTCGA GGCGGCCGCG CGGGCGGGTG ATGCACGCCG CTTCGCCCTG
CCCCGGCCGC TGCTGGACCG GCCGGGCTGC GACCTCAGCT TTTCCGGGCT CAAGACCGCC
GTGCTGCGCC AGCGCGACGA ATTGGTGGCA GCACAAGGCG GCCTGCACGA ACAGGACCGC
GCCGATCTTT GCGCCGGCTT CCAGGCGGCG GTGGCCGAGG TTCTGGCCGA AAAGACCCGC
CGTGCCCTGG CGCTGGCCCC CGCCCCGGTG CTGGCCGCGG CCGGCGGCGT CGCGGCCAAC
CAGACCCTGC GCACGGCCTT GCAAGCAGTC GCGGCCGAGG CGGGCGCAAC CTTCCTCGCC
CCGCCGCTGC GGCTTTGCAC CGACAATGCC GCGATGATCG CCTGGGCCGG AATCGAGGCA
TACGAGGCGG GCCGGCGCGA CGGCATGGAT CTGGCCGCGC GCCCGCGCTG GCCGCTGGAC
CAAAGGGCCG CACCCATGCT GGGCGCCGGA AAAAGGGGGG CCAAGGCATG A
 
Protein sequence
MSMGLTFLGI ESSCDDTAAA VVRDDRSILA SVVAGQAALH ADFGGVVPEI AARAHAEKLD 
LCVEEALAQA GLRLSDLDGI AVTAGPGLIG GVLSGVMLAK GLAAGTGLPL VGVNHLAGHA
LTPRLTDGTP YPYLMLLVSG GHCQFLRVDG PEDFTRLGGT IDDAPGEAFD KVAKLLGLPQ
PGGPSVEAAA RAGDARRFAL PRPLLDRPGC DLSFSGLKTA VLRQRDELVA AQGGLHEQDR
ADLCAGFQAA VAEVLAEKTR RALALAPAPV LAAAGGVAAN QTLRTALQAV AAEAGATFLA
PPLRLCTDNA AMIAWAGIEA YEAGRRDGMD LAARPRWPLD QRAAPMLGAG KRGAKA