Gene Pmen_1697 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPmen_1697 
Symbol 
ID5109671 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePseudomonas mendocina ymp 
KingdomBacteria 
Replicon accessionNC_009439 
Strand
Start bp1866505 
End bp1867644 
Gene Length1140 bp 
Protein Length379 aa 
Translation table11 
GC content67% 
IMG OID640502926 
Producthomogentisate 1,2-dioxygenase 
Protein accessionYP_001187193 
Protein GI146306728 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG3508] Homogentisate 1,2-dioxygenase 
TIGRFAM ID[TIGR01015] homogentisate 1,2-dioxygenase 


Plasmid Coverage information

Num covering plasmid clones33 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones34 
Fosmid unclonability p-value0.196029 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCCGCC AATGGATCCG CTTTCCCCTG CGTGAAGGCG AGTGCTCGCG CCAGGCGCAT 
TGCGACCTGC CGCAGGGCAC CTACGAGCGC GAGATGGGCC GTGAGGGCTT CTTCGGCCCC
ACCGCGCACC TGCACCACAA GCATCCGCCC ACCGGCTGGA TCGACTGGGA AGGCCCGCTG
CGCCCGCACG CGTTCAACTT CAACGACATC CCCAGCGAGC GCGACTGCCC GCTGGCGGCG
CCGCTGACCC TGCACAACGC CGATGTGAAG TTGCGCGTGT GGCGCACCCA CGGCGCCATG
CGCCACCTGG TGCGCAACGC CGACGGCGAC GAGCTGCTGT TCGTCCACGA GGGGGCAGGG
CACCTGTATT GCGATTTCGG CCATCTGGAG TACCGCGACG GCGATTACCT GCTGATCCCC
CGCGGCACCG CCTGGCGCAT CGAGGCCAGC ACGCCGAGCT ACTTCCTGCT GATCGAGAAC
AGCGACGGCG CCTACCAGCT GCCGGACAAG GGCCTGCTGG GCCCACAGGC GATCTTCGAC
CCCGCCGTGC TCGATCATCC GCGGCTCGAC GAGGCCTTCA AGGCGCAGCA GGACGAGAAC
ACCTGGCAGA TCAGGATCAA GCGGCGCAAT CAGATCAGCA CCGTGACCTA CCCGTACAAC
CCGCTGGACG TGGTCGGCTG GCACGGCGAC AACACCGTGG TGCGCCTGAA CTGGCGCGAC
ATTCGTCCGC TGCTCAGTCA CCGCTATCAC CTGCCGCCGT CGGCGCACAC CACCTTCGTC
GCCAACGGCT TCGTGGTCTG CACCTTCACC CCGCGGCCGG TCGAATCCGA CCCCGGCGCG
CTCAAGGTGC CGTTCTATCA CAACAACGAC GACTACGACG AAGTGCTGTT TTACCACCGC
GGCAACTTCT TCAGCCGCGA CAACATCGAG GCCGGGATGG TCACTCTGCA CCCCTGCGGT
TTCCCCCACG GGCCGCACCC CAAGGCGCTG AAAAAGAGCC AGGAGGACCC GGCGACCTTC
ATCGACGAGG TGGCGGTGAT GATCGACACC CGCCGCGCCC TGGAAGTGGC CGATGCCGCC
GACGCGGTGG ACGTGGCCGA GTACGTCAAC TCCTGGCGCG CGCCGGGTAC ACAAGGTTAA
 
Protein sequence
MSRQWIRFPL REGECSRQAH CDLPQGTYER EMGREGFFGP TAHLHHKHPP TGWIDWEGPL 
RPHAFNFNDI PSERDCPLAA PLTLHNADVK LRVWRTHGAM RHLVRNADGD ELLFVHEGAG
HLYCDFGHLE YRDGDYLLIP RGTAWRIEAS TPSYFLLIEN SDGAYQLPDK GLLGPQAIFD
PAVLDHPRLD EAFKAQQDEN TWQIRIKRRN QISTVTYPYN PLDVVGWHGD NTVVRLNWRD
IRPLLSHRYH LPPSAHTTFV ANGFVVCTFT PRPVESDPGA LKVPFYHNND DYDEVLFYHR
GNFFSRDNIE AGMVTLHPCG FPHGPHPKAL KKSQEDPATF IDEVAVMIDT RRALEVADAA
DAVDVAEYVN SWRAPGTQG