Gene Rmet_4374 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRmet_4374 
SymbolhmgA 
ID4041232 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCupriavidus metallidurans CH34 
KingdomBacteria 
Replicon accessionNC_007974 
Strand
Start bp972562 
End bp973890 
Gene Length1329 bp 
Protein Length442 aa 
Translation table11 
GC content64% 
IMG OID637979795 
Producthomogentisate 1,2-dioxygenase 
Protein accessionYP_586508 
Protein GI94313299 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG3508] Homogentisate 1,2-dioxygenase 
TIGRFAM ID[TIGR01015] homogentisate 1,2-dioxygenase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000328355 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.761675 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCCAGA CCCAACTCAA TGTGGCCGAC ACCGCCGTGG CGGAGTACAT GTCCGGTTTT 
GCCAATGAGT TCGCCACGGA GGCGTTGCCC GGCGCGTTGC CGGTAGGACG CAACTCGCCG
CAGCGCGCGC CGTATGGCCT GTATGCCGAA CAGCTATCGG GCACCGCGTT CACCGCGCCG
CGCGCCCACA ATCGCCGTTC GTGGCTGTAT CGCATTCGCC CGGCAGCCAT GCACAAGCCG
TTCACGTTGA TCGAGCAGGA TCGTTGGCTG AGCCGCTTCG ACGAAGTGGC GCCATCGCCC
AATCAGATGC GCTGGAGTGC GCCGGCCATG CCGACCGTTC CCACCGACTT TGTCGATGGC
ATCGTCACGA TGGCTGGTAA CGGCGGTCCG GAAGCGCTGT CCGGGTGCGG GATCCATCTG
TATCTGGCCA ACGCGTCGAT GCGGGACCGC TTCTTCTACA ACGCCGATGG CGAAATGCTG
ATCGTGCCGC AGGAGGGCCG TCTGCTGATC GTCACCGAGA TGGGGCGCCT GGCGGTGGAG
CCGCAGGAGA TCGTGGTGAT CCCGCGCGGC GTGCGTTTCC GCGTGGAACT GCCCGACGGC
GCGGCGCGTG GCTATATCTG CGAAAACTAC GGCGCGATGT TCAAGCTGCC CGACCTCGGC
GTAATCGGCT CCAACGGGCT GGCCAACCCG CGTGATTTCC TGACGCCGGT CGCCAGCTAC
GAGGATCGCG AAGGCGACTT CGAACTGGTC GCCCGTTTCC AGGGCAATCT GTGGCGTGCC
GACATTGGCC ACTCGCCGCT CGATGTGGTG GCGTGGCACG GCAACTATGT GCCGTACAAG
TACGATCTGC GCCTCTTCAA CACGATCGGC TCGATCAGCT ACGACCATCC GGACCCGTCG
ATTTTCCTGG TGCTGCAATC GCCGTCGGAT ACGCCGGGCG TGGACACGAT CGACTTCGTG
ATCTTCGGCC CGCGCTGGCT GGCTGCGGAA GACACGTTCC GTCCGCCCTG GTTCCACCGC
AATATCGCCA GCGAATTCAT GGGTTTGATC GCGGGCGAGT ATGACGCCAA GGCCGAAGGC
TTTGTGCCGG GCGGGGCGAG CCTGCACAAC TGCATGAGCG GTCACGGCCC CGATGCGGAG
ACCTTCGAGC GTGCGTCCGC CTCTGACACC AGCAAGCCGC ACCACATCAC GGACACGATG
GCATTCATGT TCGAAACCCC GGGCGTGATC CGTCCGACGC GGCACGCTGC CGAGTCGGCG
CTGCTCCAGC ACGATTACTA CACCTGCTGG CAAGGCCTGA AGAAGCATTT CAACCCGAAC
GTGCGTTGA
 
Protein sequence
MTQTQLNVAD TAVAEYMSGF ANEFATEALP GALPVGRNSP QRAPYGLYAE QLSGTAFTAP 
RAHNRRSWLY RIRPAAMHKP FTLIEQDRWL SRFDEVAPSP NQMRWSAPAM PTVPTDFVDG
IVTMAGNGGP EALSGCGIHL YLANASMRDR FFYNADGEML IVPQEGRLLI VTEMGRLAVE
PQEIVVIPRG VRFRVELPDG AARGYICENY GAMFKLPDLG VIGSNGLANP RDFLTPVASY
EDREGDFELV ARFQGNLWRA DIGHSPLDVV AWHGNYVPYK YDLRLFNTIG SISYDHPDPS
IFLVLQSPSD TPGVDTIDFV IFGPRWLAAE DTFRPPWFHR NIASEFMGLI AGEYDAKAEG
FVPGGASLHN CMSGHGPDAE TFERASASDT SKPHHITDTM AFMFETPGVI RPTRHAAESA
LLQHDYYTCW QGLKKHFNPN VR