Gene PA14_38510 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPA14_38510 
SymbolhmgA 
ID4380294 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePseudomonas aeruginosa UCBPP-PA14 
KingdomBacteria 
Replicon accessionNC_008463 
Strand
Start bp3435469 
End bp3436767 
Gene Length1299 bp 
Protein Length432 aa 
Translation table11 
GC content68% 
IMG OID639325661 
Producthomogentisate 1,2-dioxygenase 
Protein accessionYP_791230 
Protein GI116049960 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG3508] Homogentisate 1,2-dioxygenase 
TIGRFAM ID[TIGR01015] homogentisate 1,2-dioxygenase 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.00419964 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones56 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACCTCG ACTCCACTGC CCTCGCCTAT CAATCGGGCT TCGGCAACGA ATTCAGCAGC 
GAAGCGCTCC CCGGCGCCCT GCCGGTCGGC CAGAACTCCC CGCAGAAAGC GCCCTACGGC
CTGTACGCCG AACTGCTCTC CGGCACCGCC TTCACCATGG CTCGCAGCGA GGCCCGGCGC
ACCTGGCTAT ACCGCATCAC GCCGTCGGCC AAGCATCCGC CGTTCCGCCG CCTGGAACGA
CAGATCGCCG GTGCCGAACT GGATGCGCCG ACCCCCAACC GCCTGCGCTG GGACCCGCTG
GCACTGCCCG AGCAGCCCAC CGACTTCCTC GACGGCCTGC TGCGCATGGC CGCCAACGCG
CCCGGCGACA AGCCCGCCGG CGTGAGCATC TACCAGTACC TGGCCAACCG CTCGATGGAG
CGTTGCTTCT ACGACGCCGA CGGCGAACTG CTGCTGGTCC CGCAGTTGGG CCGCCTGCGC
CTGTGCACCG AACTCGGCGC GCTGCAGGTC GAACCGCTGG AGATCGCGGT GATCCCGCGC
GGGATGAAGT TCCGCGTCGA GCTGCTCGAC GGCGAGGCAC GCGGCTATAT CGCCGAGAAC
CACGGCGCGC CGCTGCGCCT GCCCGACCTC GGCCCGATCG GCAGCAATGG CCTGGCCAAT
CCGCGCGACT TCCTGGCCCC GGTGGCGCGC TACGAAGACA GCCGCCAGCC GCTGCAACTG
GTGCAGAAAT ACCTCGGCGA GCTGTGGGCC TGCGAGCTTG ACCACTCGCC GCTGGACGTG
GTCGCCTGGC ACGGCAACAA CGTGCCCTAC AAGTACGACC TGCGCCGCTT CAACACCATC
GGCACGGTCA GCTTCGACCA CCCGGACCCG TCGATCTTCA CCGTGCTGAC CTCCCCCACC
AGCGTCCATG GCCTGGCCAA CATCGACTTC GTGATCTTCC CGCCGCGCTG GATGGTGGCC
GAGAACACCT TCCGTCCGCC ATGGTTCCAC CGCAACCTGA TGAACGAGTT CATGGGCCTG
ATCCAGGGCG CCTATGACGC CAAGGCCGGC GGCTTCGTGC CTGGCGGCGC CTCGCTGCAC
AGTTGCATGA GCGCCCACGG CCCGGACGCG GAAAGCTGCG ACAAGGCCAT CGCCGCCGAC
CTCAAGCCGC ACAGGATCGA CCAGACCATG GCCTTCATGT TCGAGACCAG CCAGGTCCTC
CGGCCGAGCC GTGCCGCCCT CGAGACGCCG GCCCTGCAGA ATGACTACGA TGCCTGCTGG
GCGTCGCTCG TATCCACCTT CAACCCGCAA CGGAGATAA
 
Protein sequence
MNLDSTALAY QSGFGNEFSS EALPGALPVG QNSPQKAPYG LYAELLSGTA FTMARSEARR 
TWLYRITPSA KHPPFRRLER QIAGAELDAP TPNRLRWDPL ALPEQPTDFL DGLLRMAANA
PGDKPAGVSI YQYLANRSME RCFYDADGEL LLVPQLGRLR LCTELGALQV EPLEIAVIPR
GMKFRVELLD GEARGYIAEN HGAPLRLPDL GPIGSNGLAN PRDFLAPVAR YEDSRQPLQL
VQKYLGELWA CELDHSPLDV VAWHGNNVPY KYDLRRFNTI GTVSFDHPDP SIFTVLTSPT
SVHGLANIDF VIFPPRWMVA ENTFRPPWFH RNLMNEFMGL IQGAYDAKAG GFVPGGASLH
SCMSAHGPDA ESCDKAIAAD LKPHRIDQTM AFMFETSQVL RPSRAALETP ALQNDYDACW
ASLVSTFNPQ RR