Gene PP_2554 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPP_2554 
Symbol 
ID1045859 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePseudomonas putida KT2440 
KingdomBacteria 
Replicon accessionNC_002947 
Strand
Start bp2900916 
End bp2902823 
Gene Length1908 bp 
Protein Length635 aa 
Translation table11 
GC content64% 
IMG OID637145977 
Product4-hydroxyphenylpyruvate dioxygenase 
Protein accessionNP_744699 
Protein GI26989274 
COG category[E] Amino acid transport and metabolism
[G] Carbohydrate transport and metabolism
[R] General function prediction only 
COG ID[COG1082] Sugar phosphate isomerases/epimerases
[COG3185] 4-hydroxyphenylpyruvate dioxygenase and related hemolysins 
TIGRFAM ID[TIGR01263] 4-hydroxyphenylpyruvate dioxygenase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00914053 
Fosmid HitchhikerNo 
Fosmid clonabilitydecreased coverage 
 

Sequence

Gene sequence
ATGCAGCGTT CGATCGCTAC CGTGTCCTTG AGCGGCACTC TGCCGGAAAA GCTCGAAGCC 
ATCGCCGCCG CCGGTTTTGA CGGCGTCGAG ATCTTCGAGA ACGATCTGCT CTATTACGCT
GGCAGCCCGC GCCAGGTGCG CCAGATGTGC GCCGACCTGG GCATTGCCAT CACCTTGTTC
CAGCCTTTCC GCGACTTTGA AGGCTGCCGC CGTGACCGCC TGCAGAAAAA CCTCGACCGC
GCCGAACGCA AGTTCGACCT GATGCAGGAG CTGGGTACCG ACCTGGTGCT GGTGTGCAGC
AACGTCCAGG CCGATGCCCT GGGTGACGAG CAACTGTTGG TCGACGACCT GCGCCTGCTG
GGCGAACATG CCGGCAAGCG TGGCCTGCGC ATTGGTTACG AAGCGCTGGC CTGGGGCCGC
CACGTCAACA CTTACCAGCA AGTGTGGAAC CTGGTGCGCC AGGCCGACCA CCCGGCACTC
GGGGTGATCC TCGACAGCTT CCACACCTTG TCGCTCAAAG GTGACCCCAG CGCGATCCGC
GACATCCCCG GCGACAAGAT CTTCTTCGTG CAAATGGCCG ATGCGCCGAT CCTGGCCATG
GATGTGCTGG AGTGGAGCCG CCACTTTCGC TGCTTCCCGG GGCAGGGCGA AATGGACATG
GCCGGTTTCC TGGCGCCGAT CCTCGCCACG GGTTACCGTG GCCCGCTGTC GCTGGAAATC
TTCAACGACG GCTTCCGCGC CGCACCGACC CGGCAGAATG CCGCCGACGG CTTGCGTTCG
CTGCTGTACC TCGAAGAACA GACCCGCTTG CGCCTGGAGC AGGAGAACAC GCCGATCGAA
CCTGGCGTGC TGTTCTCCCC GCCGCCGGCC AGCGCTTATG ACGGCGTGGA GTTCCTGGAG
TTCGCGGTCG ACGAAGCCGT CGGCGCGCGC CTGGGCAACT GGCTGAAGCG CCTGGGCTTT
GCCGAAGCCG GCAAGCACCG CAGCAAAGAA GTGCAACTGC TGCGCCAGGG TGATATCAAC
ATTGTGCTGA ACGCCGAACC GTATTCCTTC GGCCACAACT TCTTCGAGGC CCATGGCCCA
TCGCTGTGCG CCACTGCGCT GCGGGTCAAG GACCAGCAAG CGGCCTTGAA GCGGGCCACC
GCCTTCCGTG GCCAGCCGTT CCGCGGCCTG GTCGGCCCCA ACGAATGCGA AGTGCCGGCG
GTGCGTGCGC CCGATGGCAG CCTGCTGTAT CTGGTGGAGC AGGGCACTGC CGGCCACACC
CTGTACGATA CCGACTTCAG CCTGGACAAC AACGCAACCG CTACCGGCGG CCTGCGCCGC
ATCGACCACA TGGCCCTAGC CTTGCCGGCC GAGTCGCTGG ACAGCTGGGT GCTGTTCTAC
AAGAGCTTGT TCGACTTCGC CGCCGACGAC GAGGTGGTGC TGCCCGACCC GTATGGCCTG
GTCAAGAGCC GCGCCTTGCG CAGCCAGTGC GGCACTTTGC GCCTGCCGCT GAACATCTCG
GAAAACCGCA ACACCGCCAT CGCCCATGCG CTGTCAAGCT ACCGTGGTTC GGGCGTGCAT
CACATCGCTT TCGATTGTGA CGACATCTTC CGCGAAGTGG CGCGGGCCAA GCTGGCAGGG
GTACCGCTGC TGGAAATCCC GCTGAACTAC TACGACGACC TGGCGGCGCG TTTCGATTTC
GACGACGAGT TCCTCAGTGA GCTGGCGTAC TACAACGTGC TGTATGACCG CGACGCTCAA
GGTGGCGAGC TGTTCCACGT CTATACCGAG CCGTTCGAGG AGCGTTTCTT CTTCGAGATC
ATCCAGCGCA AGGCGGGGTA CGCTGGTTAC GGCGCTGCCA ACGTTGCGGT GCGCCTGGCA
GCCATGGCCA AGGCCCGTAG CGGGGCGGCG CGCAAGCCGG TGCTGTAG
 
Protein sequence
MQRSIATVSL SGTLPEKLEA IAAAGFDGVE IFENDLLYYA GSPRQVRQMC ADLGIAITLF 
QPFRDFEGCR RDRLQKNLDR AERKFDLMQE LGTDLVLVCS NVQADALGDE QLLVDDLRLL
GEHAGKRGLR IGYEALAWGR HVNTYQQVWN LVRQADHPAL GVILDSFHTL SLKGDPSAIR
DIPGDKIFFV QMADAPILAM DVLEWSRHFR CFPGQGEMDM AGFLAPILAT GYRGPLSLEI
FNDGFRAAPT RQNAADGLRS LLYLEEQTRL RLEQENTPIE PGVLFSPPPA SAYDGVEFLE
FAVDEAVGAR LGNWLKRLGF AEAGKHRSKE VQLLRQGDIN IVLNAEPYSF GHNFFEAHGP
SLCATALRVK DQQAALKRAT AFRGQPFRGL VGPNECEVPA VRAPDGSLLY LVEQGTAGHT
LYDTDFSLDN NATATGGLRR IDHMALALPA ESLDSWVLFY KSLFDFAADD EVVLPDPYGL
VKSRALRSQC GTLRLPLNIS ENRNTAIAHA LSSYRGSGVH HIAFDCDDIF REVARAKLAG
VPLLEIPLNY YDDLAARFDF DDEFLSELAY YNVLYDRDAQ GGELFHVYTE PFEERFFFEI
IQRKAGYAGY GAANVAVRLA AMAKARSGAA RKPVL