Gene Pden_4635 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPden_4635 
Symbol 
ID4583140 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameParacoccus denitrificans PD1222 
KingdomBacteria 
Replicon accessionNC_008688 
Strand
Start bp122749 
End bp123795 
Gene Length1047 bp 
Protein Length348 aa 
Translation table11 
GC content66% 
IMG OID639771941 
Productpeptidase M42 family protein 
Protein accessionYP_918394 
Protein GI119387360 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1363] Cellulase M and related proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.0148815 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGATCGCG ACAGCGAATT GTTCGACCTG ATCGGCGACC TGATCATGTG CCATTCGCCC 
AGCGGGGTCG AGGCGCGGAT CGACGCCTTC CTTCTGCAGC GGCTCTCGGA ACTGGGTATC
GAGGCGGCGC TGGACGCCGC CGGGAACGTC GTCGCACGCA TTCCCGGCAG CGGGGCGGGC
AAGCTGGCGA TCACCGCCCA CAAGGATGAG ATCGGCGCAT CGGTCAGCAC CGTCGGGGAC
GATGGCCGGC TCAGGCTGCG CGCCTTGGGA TCGTCCTTCC CCTGGGTCTA TGGCGAGGGA
ATCGTGGATA TCCTGGGCGA CAACGAGACG ATTCAGGGCG TCCTCAGCTT CGGATCGCGC
CACATCACCC GCGCCTCGCC GCAATATCCG CAGCAAGAGA CCGCGCCGGT CAAATGGTCG
GATGTCTGGG TCGAGACCAA GTGCAGCTGC GACGACATCG CCAAGGCGGG CGTGCGGCCG
GGCTCTCGCG TGCTGGTCGC GAGGCATCGC AAGGCGCCCT ATCGGTTGGG CGACCATATC
GCCGGCTATA CGCTGGACAA CAAGGCCTCG GTCGCGGTCC TGATCGAGCT GGCCAAGCGG
ATCCGCAACC CGGTCTCGGA GATCTGCCTG GTCTTTTCGT CCATGGAAGA GGTCGGTGCC
TGCGGCGCGC TGTATTTCAC CAGAAACGAG CCCGTGGATG CGATCATCGC GCTGGAGATC
GCCCCCCTGT CCGATGAATA CGACATCGTG GATGGCCCGG ACCCGGTGAT CTATGCGCAG
GATGGCTATG GCCTTTACCA TGAGGGGCTG AACGGCCGGA TCGCCGCCGC GGCCGCGCGG
GCGGGGGTCG GACTGCAACG CTCGGTGGTC CATGATTTCG GCAGCGACGC CTCGATCGTG
ATGCGCAACG GGCATGCGCC GCGCGGGGCC TGCCTGGCCT TCCCGACGCA GAACACCCAC
GGATACGAGA TCGCGCGCCT TGCCGCGATC GGGAACTGCG TCGCCGTCCT CGATGAACTT
TGCAAGGGGG ATCTGTCGCA ATGGTGA
 
Protein sequence
MDRDSELFDL IGDLIMCHSP SGVEARIDAF LLQRLSELGI EAALDAAGNV VARIPGSGAG 
KLAITAHKDE IGASVSTVGD DGRLRLRALG SSFPWVYGEG IVDILGDNET IQGVLSFGSR
HITRASPQYP QQETAPVKWS DVWVETKCSC DDIAKAGVRP GSRVLVARHR KAPYRLGDHI
AGYTLDNKAS VAVLIELAKR IRNPVSEICL VFSSMEEVGA CGALYFTRNE PVDAIIALEI
APLSDEYDIV DGPDPVIYAQ DGYGLYHEGL NGRIAAAAAR AGVGLQRSVV HDFGSDASIV
MRNGHAPRGA CLAFPTQNTH GYEIARLAAI GNCVAVLDEL CKGDLSQW