Gene RSp0691 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRSp0691 
SymbolhmgA 
ID1222998 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRalstonia solanacearum GMI1000 
KingdomBacteria 
Replicon accessionNC_003296 
Strand
Start bp878028 
End bp879374 
Gene Length1347 bp 
Protein Length448 aa 
Translation table11 
GC content69% 
IMG OID637240551 
Producthomogentisate 1,2-dioxygenase 
Protein accessionNP_522252 
Protein GI17548912 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG3508] Homogentisate 1,2-dioxygenase 
TIGRFAM ID[TIGR01015] homogentisate 1,2-dioxygenase 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.398963 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACATGC TTGCTCCCGC GGCCAAGAAC GCCTTCACCC CCGCGTCGCC GGACCGGCCC 
GCCTACCAGA GCGGCTTCGG CAACGAGTTC GCCACCGAGG CGCTGCCGGG CGCACTGCCG
CACGGCCAGA ATTCGCCACA GCAGGCGCCG TATGGCCTGT ATGCCGAGCA ACTGTCCGGC
ACGGCGTTCA CCGCGCCGCG CGCGCACAAT CGCCGCGCGT GGCTGTACCG CATCCGCCCG
GCCGCCGTGC ATCTGCCATT CGAGCCCATC GCGCAAGACC GCTTCCACAG CGACTTCCAC
GCCGTGCCTG CGTCGCCCAA CCAGCTGCGC TGGGACCCGC TGCCCGCGCC GGCCGCCGGC
ACGGACTTCA TCGACGGCAT CGTCACCTTT GCCGGCAACG GCGGGCCGGA TGCGCAGACC
GGCTGCGGCA TCCATCTCTA TGCCGCCAAC GCGTCGATGA CCGGCCGCTT CTTCTACAAC
GCCGACGGCG AACTGCTGAT CGTGCCGCAG CAAGGCCGCC TGCGCCTGCT GACCGAGCTG
GGCGTGCTCG ACGTCGAACC GCTGGAAATC GCCGTGATCC CGCGCGGCGT GCGCTTTCGC
GTGGAACTGC CGGACGGCGA GGCGCGCGGC TACCTCTGCG AGAACTTCGG CGCGATCTTC
CGACTGCCCG ACCTGGGCGT GATCGGCTCG AACGGCCTGG CCAACCCGCG CGACTTCCTC
ACGCCGCACG CCTGGTACGA AGACCGCGAG GGCGATTTCG AGCTCGTCGC CAAGTTCCAC
GGCAACCTGT GGCGCGCGCG GATCGGGCAC TCGCCGCTGG ACGTGGTGGC CTGGCACGGC
AACTACGCGC CGTACAAGTA CGACCTGCGC CTGTTCAACA CCATCGGCTC GATCAGCTAC
GACCACCCGG ACCCGTCGAT CTTCCTGGTG CTGCAGAGCG TGTCCGACAC GCCGGGCGTG
GACGCCATCG ATTTCGTCAT CTTCCCGCCA CGCTGGCTGG CGATGGAGCA CTCGTTCCGC
CCGCCCTGGT TCCACCGCAA CATCGCCAGC GAGTTCATGG GCCTGATCCA GGGCGTGTAC
GACGCCAAGG CCGAGGGCTT CGTGCCGGGC GGCGCGAGCC TGCACAACTG CATGACGGGC
CACGGCCCCG ATGCCGAGAC CTTCGAGAAA GCCAGTCATG CCGACACGAC CCAGCCGCAC
AAGGTGGAGG CGACGATGGC CTTCATGTTC GAGACCCGCG GCGTGATCCG CCCGACACGT
TTCGCGGCCG AATCGGCGCA GCTGCAGGCC AGGTACTTCG AGTGCTGGCA GGGCCTGAAG
AAGCATTTCG ACCCGGCCAA GCGCTGA
 
Protein sequence
MNMLAPAAKN AFTPASPDRP AYQSGFGNEF ATEALPGALP HGQNSPQQAP YGLYAEQLSG 
TAFTAPRAHN RRAWLYRIRP AAVHLPFEPI AQDRFHSDFH AVPASPNQLR WDPLPAPAAG
TDFIDGIVTF AGNGGPDAQT GCGIHLYAAN ASMTGRFFYN ADGELLIVPQ QGRLRLLTEL
GVLDVEPLEI AVIPRGVRFR VELPDGEARG YLCENFGAIF RLPDLGVIGS NGLANPRDFL
TPHAWYEDRE GDFELVAKFH GNLWRARIGH SPLDVVAWHG NYAPYKYDLR LFNTIGSISY
DHPDPSIFLV LQSVSDTPGV DAIDFVIFPP RWLAMEHSFR PPWFHRNIAS EFMGLIQGVY
DAKAEGFVPG GASLHNCMTG HGPDAETFEK ASHADTTQPH KVEATMAFMF ETRGVIRPTR
FAAESAQLQA RYFECWQGLK KHFDPAKR