Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PA14_38510 |
Symbol | hmgA |
ID | 4380294 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Pseudomonas aeruginosa UCBPP-PA14 |
Kingdom | Bacteria |
Replicon accession | NC_008463 |
Strand | + |
Start bp | 3435469 |
End bp | 3436767 |
Gene Length | 1299 bp |
Protein Length | 432 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 639325661 |
Product | homogentisate 1,2-dioxygenase |
Protein accession | YP_791230 |
Protein GI | 116049960 |
COG category | [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG3508] Homogentisate 1,2-dioxygenase |
TIGRFAM ID | [TIGR01015] homogentisate 1,2-dioxygenase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.00419964 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 56 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAACCTCG ACTCCACTGC CCTCGCCTAT CAATCGGGCT TCGGCAACGA ATTCAGCAGC GAAGCGCTCC CCGGCGCCCT GCCGGTCGGC CAGAACTCCC CGCAGAAAGC GCCCTACGGC CTGTACGCCG AACTGCTCTC CGGCACCGCC TTCACCATGG CTCGCAGCGA GGCCCGGCGC ACCTGGCTAT ACCGCATCAC GCCGTCGGCC AAGCATCCGC CGTTCCGCCG CCTGGAACGA CAGATCGCCG GTGCCGAACT GGATGCGCCG ACCCCCAACC GCCTGCGCTG GGACCCGCTG GCACTGCCCG AGCAGCCCAC CGACTTCCTC GACGGCCTGC TGCGCATGGC CGCCAACGCG CCCGGCGACA AGCCCGCCGG CGTGAGCATC TACCAGTACC TGGCCAACCG CTCGATGGAG CGTTGCTTCT ACGACGCCGA CGGCGAACTG CTGCTGGTCC CGCAGTTGGG CCGCCTGCGC CTGTGCACCG AACTCGGCGC GCTGCAGGTC GAACCGCTGG AGATCGCGGT GATCCCGCGC GGGATGAAGT TCCGCGTCGA GCTGCTCGAC GGCGAGGCAC GCGGCTATAT CGCCGAGAAC CACGGCGCGC CGCTGCGCCT GCCCGACCTC GGCCCGATCG GCAGCAATGG CCTGGCCAAT CCGCGCGACT TCCTGGCCCC GGTGGCGCGC TACGAAGACA GCCGCCAGCC GCTGCAACTG GTGCAGAAAT ACCTCGGCGA GCTGTGGGCC TGCGAGCTTG ACCACTCGCC GCTGGACGTG GTCGCCTGGC ACGGCAACAA CGTGCCCTAC AAGTACGACC TGCGCCGCTT CAACACCATC GGCACGGTCA GCTTCGACCA CCCGGACCCG TCGATCTTCA CCGTGCTGAC CTCCCCCACC AGCGTCCATG GCCTGGCCAA CATCGACTTC GTGATCTTCC CGCCGCGCTG GATGGTGGCC GAGAACACCT TCCGTCCGCC ATGGTTCCAC CGCAACCTGA TGAACGAGTT CATGGGCCTG ATCCAGGGCG CCTATGACGC CAAGGCCGGC GGCTTCGTGC CTGGCGGCGC CTCGCTGCAC AGTTGCATGA GCGCCCACGG CCCGGACGCG GAAAGCTGCG ACAAGGCCAT CGCCGCCGAC CTCAAGCCGC ACAGGATCGA CCAGACCATG GCCTTCATGT TCGAGACCAG CCAGGTCCTC CGGCCGAGCC GTGCCGCCCT CGAGACGCCG GCCCTGCAGA ATGACTACGA TGCCTGCTGG GCGTCGCTCG TATCCACCTT CAACCCGCAA CGGAGATAA
|
Protein sequence | MNLDSTALAY QSGFGNEFSS EALPGALPVG QNSPQKAPYG LYAELLSGTA FTMARSEARR TWLYRITPSA KHPPFRRLER QIAGAELDAP TPNRLRWDPL ALPEQPTDFL DGLLRMAANA PGDKPAGVSI YQYLANRSME RCFYDADGEL LLVPQLGRLR LCTELGALQV EPLEIAVIPR GMKFRVELLD GEARGYIAEN HGAPLRLPDL GPIGSNGLAN PRDFLAPVAR YEDSRQPLQL VQKYLGELWA CELDHSPLDV VAWHGNNVPY KYDLRRFNTI GTVSFDHPDP SIFTVLTSPT SVHGLANIDF VIFPPRWMVA ENTFRPPWFH RNLMNEFMGL IQGAYDAKAG GFVPGGASLH SCMSAHGPDA ESCDKAIAAD LKPHRIDQTM AFMFETSQVL RPSRAALETP ALQNDYDACW ASLVSTFNPQ RR
|
| |