Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rmet_4374 |
Symbol | hmgA |
ID | 4041232 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Cupriavidus metallidurans CH34 |
Kingdom | Bacteria |
Replicon accession | NC_007974 |
Strand | + |
Start bp | 972562 |
End bp | 973890 |
Gene Length | 1329 bp |
Protein Length | 442 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 637979795 |
Product | homogentisate 1,2-dioxygenase |
Protein accession | YP_586508 |
Protein GI | 94313299 |
COG category | [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG3508] Homogentisate 1,2-dioxygenase |
TIGRFAM ID | [TIGR01015] homogentisate 1,2-dioxygenase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.0000328355 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 0.761675 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCCAGA CCCAACTCAA TGTGGCCGAC ACCGCCGTGG CGGAGTACAT GTCCGGTTTT GCCAATGAGT TCGCCACGGA GGCGTTGCCC GGCGCGTTGC CGGTAGGACG CAACTCGCCG CAGCGCGCGC CGTATGGCCT GTATGCCGAA CAGCTATCGG GCACCGCGTT CACCGCGCCG CGCGCCCACA ATCGCCGTTC GTGGCTGTAT CGCATTCGCC CGGCAGCCAT GCACAAGCCG TTCACGTTGA TCGAGCAGGA TCGTTGGCTG AGCCGCTTCG ACGAAGTGGC GCCATCGCCC AATCAGATGC GCTGGAGTGC GCCGGCCATG CCGACCGTTC CCACCGACTT TGTCGATGGC ATCGTCACGA TGGCTGGTAA CGGCGGTCCG GAAGCGCTGT CCGGGTGCGG GATCCATCTG TATCTGGCCA ACGCGTCGAT GCGGGACCGC TTCTTCTACA ACGCCGATGG CGAAATGCTG ATCGTGCCGC AGGAGGGCCG TCTGCTGATC GTCACCGAGA TGGGGCGCCT GGCGGTGGAG CCGCAGGAGA TCGTGGTGAT CCCGCGCGGC GTGCGTTTCC GCGTGGAACT GCCCGACGGC GCGGCGCGTG GCTATATCTG CGAAAACTAC GGCGCGATGT TCAAGCTGCC CGACCTCGGC GTAATCGGCT CCAACGGGCT GGCCAACCCG CGTGATTTCC TGACGCCGGT CGCCAGCTAC GAGGATCGCG AAGGCGACTT CGAACTGGTC GCCCGTTTCC AGGGCAATCT GTGGCGTGCC GACATTGGCC ACTCGCCGCT CGATGTGGTG GCGTGGCACG GCAACTATGT GCCGTACAAG TACGATCTGC GCCTCTTCAA CACGATCGGC TCGATCAGCT ACGACCATCC GGACCCGTCG ATTTTCCTGG TGCTGCAATC GCCGTCGGAT ACGCCGGGCG TGGACACGAT CGACTTCGTG ATCTTCGGCC CGCGCTGGCT GGCTGCGGAA GACACGTTCC GTCCGCCCTG GTTCCACCGC AATATCGCCA GCGAATTCAT GGGTTTGATC GCGGGCGAGT ATGACGCCAA GGCCGAAGGC TTTGTGCCGG GCGGGGCGAG CCTGCACAAC TGCATGAGCG GTCACGGCCC CGATGCGGAG ACCTTCGAGC GTGCGTCCGC CTCTGACACC AGCAAGCCGC ACCACATCAC GGACACGATG GCATTCATGT TCGAAACCCC GGGCGTGATC CGTCCGACGC GGCACGCTGC CGAGTCGGCG CTGCTCCAGC ACGATTACTA CACCTGCTGG CAAGGCCTGA AGAAGCATTT CAACCCGAAC GTGCGTTGA
|
Protein sequence | MTQTQLNVAD TAVAEYMSGF ANEFATEALP GALPVGRNSP QRAPYGLYAE QLSGTAFTAP RAHNRRSWLY RIRPAAMHKP FTLIEQDRWL SRFDEVAPSP NQMRWSAPAM PTVPTDFVDG IVTMAGNGGP EALSGCGIHL YLANASMRDR FFYNADGEML IVPQEGRLLI VTEMGRLAVE PQEIVVIPRG VRFRVELPDG AARGYICENY GAMFKLPDLG VIGSNGLANP RDFLTPVASY EDREGDFELV ARFQGNLWRA DIGHSPLDVV AWHGNYVPYK YDLRLFNTIG SISYDHPDPS IFLVLQSPSD TPGVDTIDFV IFGPRWLAAE DTFRPPWFHR NIASEFMGLI AGEYDAKAEG FVPGGASLHN CMSGHGPDAE TFERASASDT SKPHHITDTM AFMFETPGVI RPTRHAAESA LLQHDYYTCW QGLKKHFNPN VR
|
| |