Gene Pden_3547 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPden_3547 
Symbol 
ID4582102 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameParacoccus denitrificans PD1222 
KingdomBacteria 
Replicon accessionNC_008687 
Strand
Start bp694964 
End bp696325 
Gene Length1362 bp 
Protein Length453 aa 
Translation table11 
GC content66% 
IMG OID639770860 
Producthomogentisate 1,2-dioxygenase 
Protein accessionYP_917313 
Protein GI119386258 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG3508] Homogentisate 1,2-dioxygenase 
TIGRFAM ID[TIGR01015] homogentisate 1,2-dioxygenase 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.294103 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCCAGC ACGACCTGCC CCCCGGCATG ACCCGCGCCG CCGTCGCCAC CGGCACGCAT 
CCGGGCTACA TGCCCGGCTT CGGCAACGAT TTCGAGACCG AGGCGCTGCC CGGCGCCCTG
CCGCAGGGCC AGAACAGCCC GCAGAAATGC GAATACGGGC TTTATGCCGA GCAGCTGTCG
GGCACCGCCT TCACCGCGCC GCGCGGCCAG AACGAGCGGA CCTGGTGCTA TCGCATCCGG
CCCTCGGTCC GCCATACCGG CGATTTCGCG GCGATCGATC TGCCGCATTG GAAGACGGCG
CCGAACCTGC GCGACGACAT CGTCAGCCTG GGCCAGTATC GCTGGGACCC GATCCCGGTC
CCGGAGGAGG AGCTGACCTG GATCACCGGC ATGCGCAGCA TGACCACGGC CGGCGACGTG
AACATCCAGG TCGGCATGGC GTCGCATGTC TACCTGGTCA CCCGCTCGAT GCAGGACGAA
TATTTCTTCT CGGCCGACAG CGAGTTGCTG GTGGTCCCGC AAGAGGGCCG GCTGCGCTTC
TGCACCGAGC TGGGGGTGAT CGACCTGGAG CCGCGGGAAA TCGCCATCCT GCCGCGCGGC
CTGGTCTACC GCGTCGAGGT GCTGGAGGGC CCGGCCCGCG GCTTCGTCTG CGAGAATTAC
GGCCAGAAGT TCGACCTGCC GGGCCGCGGC CCGATCGGCG CCAATTGCCT GGCCAATCCG
CGCGACTTCA AATGCCCGGT CGCCGCCTTC GAGGACCGCG AGGCGCGCTC GCGCGTGGTG
ATCAAGTGGT GCGGCCGGTT CCACGAGACC TGGATCGACC ACAGCCCGCT GGACGTGGTG
GCCTGGCACG GGAATTACTG CCCCTACAAA TACGACCTGC GCACCTATTC GCCGGTGGGC
GCGATCCTGT TCGACCATCC CGACCCGTCG ATCTTCACCG TGCTGACCGC GCCCTCGGGC
CAGGAGGGCA CGGCGAATAT CGACTTCGTG CTGTTCCGCG AGCGCTGGAT GGTGGCCGAG
CACAGCTTCC GCCCGCCCTG GTATCACAAG AACATCATGT CCGAGCTGAT GGGCAACATC
TACGGCATCT ACGACGCCAA GCCGCAGGGC TTTGCGCCGG GCGGCATCAG CCTGCACAAT
TGCATGCTGC CGCACGGCCC GGACCGCGAC GCCTTCGAGG GCGCCAGCAA CGCCGATCTG
AAGCCCGAGA AGCTGGAGGA GACCATGAGC TTCATGTTCG AGACCCGCTT TCCCCAGCAC
CTCACCGAAT TCGCTGCGCG CGAGGCCCCG ATGCAGAAGG ACTATATCGA AGTCTGGAAC
CGGCTCGAGA AGAAGTTCGA CGGAACGCCA GGCGTCAAGT GA
 
Protein sequence
MTQHDLPPGM TRAAVATGTH PGYMPGFGND FETEALPGAL PQGQNSPQKC EYGLYAEQLS 
GTAFTAPRGQ NERTWCYRIR PSVRHTGDFA AIDLPHWKTA PNLRDDIVSL GQYRWDPIPV
PEEELTWITG MRSMTTAGDV NIQVGMASHV YLVTRSMQDE YFFSADSELL VVPQEGRLRF
CTELGVIDLE PREIAILPRG LVYRVEVLEG PARGFVCENY GQKFDLPGRG PIGANCLANP
RDFKCPVAAF EDREARSRVV IKWCGRFHET WIDHSPLDVV AWHGNYCPYK YDLRTYSPVG
AILFDHPDPS IFTVLTAPSG QEGTANIDFV LFRERWMVAE HSFRPPWYHK NIMSELMGNI
YGIYDAKPQG FAPGGISLHN CMLPHGPDRD AFEGASNADL KPEKLEETMS FMFETRFPQH
LTEFAAREAP MQKDYIEVWN RLEKKFDGTP GVK