Gene Pmen_1696 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPmen_1696 
Symbol 
ID5109670 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePseudomonas mendocina ymp 
KingdomBacteria 
Replicon accessionNC_009439 
Strand
Start bp1865418 
End bp1866500 
Gene Length1083 bp 
Protein Length360 aa 
Translation table11 
GC content63% 
IMG OID640502925 
Product4-hydroxyphenylpyruvate dioxygenase 
Protein accessionYP_001187192 
Protein GI146306727 
COG category[E] Amino acid transport and metabolism
[R] General function prediction only 
COG ID[COG3185] 4-hydroxyphenylpyruvate dioxygenase and related hemolysins 
TIGRFAM ID[TIGR01263] 4-hydroxyphenylpyruvate dioxygenase 


Plasmid Coverage information

Num covering plasmid clones35 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones33 
Fosmid unclonability p-value0.141185 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACGCCG TGGCCAAGAT CGAGCAGCAC AACCCCATCG GAACCGACGG TTTCGAGTTC 
GTCGAATTCA CCGCACCGAC CGCCGAAGGC ATCGAGCAAC TGCGCCAGTT GTTCACCGCC
ATGGGCTTTA CCGAAACCGC CAAGCACCGC TCGAAAGAGG TGTGGCTGTT CCAGCAGAAC
GACATCAACA TCGTGCTCAA CGGCAGCCCC ACCGGGCACG TGCGCGCCTT CGGCGAGAAG
CACGGCCCCA GCGCCTGCGC CATGGCGTTC CGGGTCAAGA ACGCCGCCCA GGCCGCGGCC
TACGTCGAGC AGCAGGGCGC CAAGCTGGTG GGCAGCCACG CCAATTTCGG CGAGCTGAAC
ATTCCCTGCG TCGAAGGCAT CGGCGGCTCG CTGCTGTATC TGGTTGACCG CTATGGCGAC
AAGACCATCT ACGACGTCGA CTTCGAGTAC ATCGAAGGCC GCAGTGCCAA CGACAACGCC
GTCGGCCTGC AGTGCATCGA CCACCTGACC CATAACGTGC GCCGCGGACA GATGGACGTA
TGGTCCGGCT TCTACGAGCG CATCGCCAAC TTCCGCGAGA TCCGCTACTT CGACATCGAA
GGCAAGCTCA CCGGCCTGTT CTCCCGCGCC ATGACCGCGC CGTGCGGCAA GATCCGCATC
CCGATCAATG AGTCGGCCGA TGACAAGTCG CAGATCGAGG AATTCATCCG CGAGTACCAC
GGCGAGGGCA TTCAGCACAT CGCCCTGTCC ACCGACGACA TCTACGCCAC CGTGCGCCAG
CTGCGCGCCA ACGGCGTGGA CTTCATGACC ACCCCGGACA CCTATTATGA AAAGGTCGAC
AGCCGCGTCG CCGGCCATGG CGAGCCGACC GATGTGCTGC GCGAGCTGAA CATCCTGATC
GACGGAGCGC CGGGCGACGA CGGCATCCTG CTGCAGATCT TCACCAACAC GGTGATCGGC
CCGATCTTCT TCGAGATCAT CCAGCGCAAG GGCAATCAGG GTTTCGGCGA GGGCAACTTC
AAGGCGCTGT TCGAGTCGAT CGAGGAAGAC CAGCTGCGTC GCGGTGTGAT CTCCGAGGAG
TGA
 
Protein sequence
MNAVAKIEQH NPIGTDGFEF VEFTAPTAEG IEQLRQLFTA MGFTETAKHR SKEVWLFQQN 
DINIVLNGSP TGHVRAFGEK HGPSACAMAF RVKNAAQAAA YVEQQGAKLV GSHANFGELN
IPCVEGIGGS LLYLVDRYGD KTIYDVDFEY IEGRSANDNA VGLQCIDHLT HNVRRGQMDV
WSGFYERIAN FREIRYFDIE GKLTGLFSRA MTAPCGKIRI PINESADDKS QIEEFIREYH
GEGIQHIALS TDDIYATVRQ LRANGVDFMT TPDTYYEKVD SRVAGHGEPT DVLRELNILI
DGAPGDDGIL LQIFTNTVIG PIFFEIIQRK GNQGFGEGNF KALFESIEED QLRRGVISEE