Gene PP_4621 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPP_4621 
SymbolhmgA 
ID1041590 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePseudomonas putida KT2440 
KingdomBacteria 
Replicon accessionNC_002947 
Strand
Start bp5243178 
End bp5244479 
Gene Length1302 bp 
Protein Length433 aa 
Translation table11 
GC content64% 
IMG OID637148019 
Producthomogentisate 1,2-dioxygenase 
Protein accessionNP_746730 
Protein GI26991305 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG3508] Homogentisate 1,2-dioxygenase 
TIGRFAM ID[TIGR01015] homogentisate 1,2-dioxygenase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.356023 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACCGCG ACACGTCGCC CGACCTTCAC TACCTGAGTG GCTTCGGCAA CGAATTCGCC 
AGCGAAGCAT TGCCCGGGGC GCTGCCTGTT GGGCAGAACT CCCCGCAGAA GGCCCCGTAT
GGCCTGTATG CCGAGCTGCT GTCGGGCACG GCGTTCACCA TGGCCCGCAG CGAGCTGCGC
CGTACCTGGC TGTACCGCAT TCGCCCTTCT GCCTTGCACC CACGCTTCGA GCGCCTGGCG
CGCCAGCCGC TCGGCGGGCC ACTGGGTGGC ATCAACCCCA ACCGCCTGCG CTGGAGCCCG
CAGCCGATTC CTGCTGAACC GACCGATTTC ATCGAAGGTT GGCTGCCCAT GGCTGCCAAC
GCCGGAGCGG AAAAACCGGC TGGCGTGAGC ATCTACATCT ACCGCGCCAA CCGGTCCATG
GAACGGGTGT TCTTCAACGC AGACGGTGAG CTGCTACTGG TGCCGGAACA GGGCCGCCTG
CGTATCGCCA CCGAGCTGGG CGTGATGGAG GTCGAACCGT TGGAAATTGC GGTGATCCCA
CGTGGCATGA AGTTCCGCGT CGAACTGCTC GACGGCCAGG CCCGTGGCTA CATCGCGGAA
AACCACGGTG CGCCGCTGCG TCTGCCGGAC CTGGGCCCGA TCGGCAGCAA CGGCCTGGCC
AACCCCCGCG ACTTCCTCAC GCCTGTGGCC CACTACGAAG AAGCCGAAGG CCCGGTGCAA
CTGGTACAGA AGTTCCTGGG TGAGCACTGG GCCTGCGAGC TGCAGCACTC GCCACTGGAC
GTTGTGGCCT GGCATGGCAG CAACGTGCCG TACAAGTATG ACCTGCGCCG CTTCAACACC
ATCGGCACGG TCAGCTTCGA CCACCCGGAC CCCTCGATCT TCACCGTGCT CACCTCGCCA
ACCAGCGTGC ATGGCATGGC CAACATGGAC TTCGTGATTT TCCCGCCACG CTGGATGGTG
GCCGAGAACA CCTTCCGTCC GCCATGGTTC CACCGCAACC TGATGAACGA GTTCATGGGC
CTGATCAATG GCGCCTACGA CGCCAAGGCC GAGGGCTTCC TGCCGGGTGG TGCCTCGTTG
CACGGGGTGA TGAGTGCCCA TGGCCCCGAC GCCGAAACCT GTGAAAAGGC CATTGCCGCT
GACCTGGCGC CACACAAGAT CGACAACACC ATGGCCTTCA TGTTCGAGAC CAGCCAAGTG
TTGCGCCCGA GCCTGCAAGC CCTTGAATGC CCGCAATTGC AGGCCGACTA CGATAGTTGC
TGGGCGACTT TGCCGAGCAC CTTCAACCCG AACCGGAGAT AA
 
Protein sequence
MNRDTSPDLH YLSGFGNEFA SEALPGALPV GQNSPQKAPY GLYAELLSGT AFTMARSELR 
RTWLYRIRPS ALHPRFERLA RQPLGGPLGG INPNRLRWSP QPIPAEPTDF IEGWLPMAAN
AGAEKPAGVS IYIYRANRSM ERVFFNADGE LLLVPEQGRL RIATELGVME VEPLEIAVIP
RGMKFRVELL DGQARGYIAE NHGAPLRLPD LGPIGSNGLA NPRDFLTPVA HYEEAEGPVQ
LVQKFLGEHW ACELQHSPLD VVAWHGSNVP YKYDLRRFNT IGTVSFDHPD PSIFTVLTSP
TSVHGMANMD FVIFPPRWMV AENTFRPPWF HRNLMNEFMG LINGAYDAKA EGFLPGGASL
HGVMSAHGPD AETCEKAIAA DLAPHKIDNT MAFMFETSQV LRPSLQALEC PQLQADYDSC
WATLPSTFNP NRR