Gene PP_4038 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPP_4038 
Symbol 
ID1042076 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePseudomonas putida KT2440 
KingdomBacteria 
Replicon accessionNC_002947 
Strand
Start bp4550629 
End bp4551903 
Gene Length1275 bp 
Protein Length424 aa 
Translation table11 
GC content63% 
IMG OID637147446 
Productdihydropyrimidine dehydrogenase 
Protein accessionNP_746165 
Protein GI26990740 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0167] Dihydroorotate dehydrogenase 
TIGRFAM ID[TIGR01037] dihydroorotate dehydrogenase (subfamily 1) family protein 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCGACT TGTCTATCGT ATTTGCCGGC ATCAAGGCAC CCAACCCCTT CTGGCTGGCC 
TCCGCACCGC CAACCGACAA GGCCTACAAC GTGGTCCGCG CCTTCGAGGC CGGCTGGGGC
GGCGTGGTCT GGAAGACCCT GGGCGAAGAC CCTGCGGCGG TCAACGTGTC GTCGCGCTAC
TCGGCGCACT ATGGCCCCAA CCGTCAGGTA CAGGGGATCA ACAATATCGA GTTGATCACC
GACCGCTCTC TGGAAATCAA CCTGCGTGAA ATCACCCAGG TGAAGAAGGA CTGGCCCGAC
CGCGCACTGA TCGTGTCGCT GATGGTGCCG TGCATAGAGG AATCGTGGAA GTTCATCCTG
CCGCTGGTGG AAGCCACGGG GGCCGATGGC ATCGAGCTGA ACTTCGGTTG CCCGCACGGT
ATGCCGGAGC GCGGCATGGG GGCGGCCGTC GGCCAGGTGC CGGAATATGT GGAAATGGTC
ACCCGCTGGT GCAAGACGTA CTGCGCGCTG CCGGTGATCG TGAAGCTTAC GCCGAACATC
ACCGATATCC GCCAGTCGGC CCGCGCCGCG CATCGAGGCG GGGCCGATGC CGTGTCGTTG
ATCAACACCA TCAACTCCAT CACCAGCGTC GACCTGGACC GCATGGTGGC CCACCCGATC
GTCGGAGACC AGAGCACCCA TGGCGGTTAC TGCGGTTCGG CAGTGAAGCC GATTGCCTTG
AACATGGTGG CGGAAATCGC CCGCGACCCG GAAACGCGGG GCTTGCCGAT TTGTGGCATT
GGCGGCATCG GCAACTGGCG TGACGCCGCC GAGTTCATGG CCTTGGGCAG TGGCGCCGTG
CAGGTGTGCA CGGCGGCGAT GCTGCATGGC TTCCGTATCG TCGAGGACAT GCAGGACGGC
CTGGCACGCT GGATGGACCA GCATGGGCAT GCCACTGTCG AAGCGTTCCG CGGGCAGGCG
GTGGGGCATA CCACCGACTG GAAATACCTG GACATCAACT ACAAGTCGGT CGCGCACATT
GACCAGGAGG CGTGCATTGG CTGCGGGCGC TGCCACATTG CGTGCGAGGA CACCTCGCAC
CAGGCGATTG CCAGCACGCT GAAGGCTGAC GGCACGCATG CCTACAGTGT GATCGAAGAG
GAATGCGTAG GCTGTAACCT GTGCCAGATC ACCTGCCCGG TGGAAAACTG CATCGAAATG
GAGGCACAGG ATACCGGCAA GCCGTACCTG AACTGGACGC AGGATCCGCG TAACCCCTAC
CGTGAGGCCA GTTGA
 
Protein sequence
MADLSIVFAG IKAPNPFWLA SAPPTDKAYN VVRAFEAGWG GVVWKTLGED PAAVNVSSRY 
SAHYGPNRQV QGINNIELIT DRSLEINLRE ITQVKKDWPD RALIVSLMVP CIEESWKFIL
PLVEATGADG IELNFGCPHG MPERGMGAAV GQVPEYVEMV TRWCKTYCAL PVIVKLTPNI
TDIRQSARAA HRGGADAVSL INTINSITSV DLDRMVAHPI VGDQSTHGGY CGSAVKPIAL
NMVAEIARDP ETRGLPICGI GGIGNWRDAA EFMALGSGAV QVCTAAMLHG FRIVEDMQDG
LARWMDQHGH ATVEAFRGQA VGHTTDWKYL DINYKSVAHI DQEACIGCGR CHIACEDTSH
QAIASTLKAD GTHAYSVIEE ECVGCNLCQI TCPVENCIEM EAQDTGKPYL NWTQDPRNPY
REAS