Gene BBta_0566 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBBta_0566 
SymbolhmgA 
ID5153991 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBradyrhizobium sp. BTAi1 
KingdomBacteria 
Replicon accessionNC_009485 
Strand
Start bp570086 
End bp571435 
Gene Length1350 bp 
Protein Length449 aa 
Translation table11 
GC content64% 
IMG OID640555577 
Producthomogentisate 1,2-dioxygenase 
Protein accessionYP_001236750 
Protein GI148252165 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG3508] Homogentisate 1,2-dioxygenase 
TIGRFAM ID[TIGR01015] homogentisate 1,2-dioxygenase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.11982 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones29 
Fosmid unclonability p-value0.485064 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATGAACA TCAATACATC GCCTGACCAG CTCAGCCGCG CCTCGGCGCA GGTTACGCCG 
GGCTACATGT CCGGCTTTGG CAATTCGTTC GAGACCGAGG CGCTGCCCGG CGCGCTGCCG
ATCGGCCGTA ACTCGCCGCA GCGCTGCGCC TATGGCCTCT ATGCCGAGCA GCTTTCCGGC
TCGCCCTTCA CCGCGCCGCG CGGCTCCAAC GAGCGCTCCT GGCTGTATCG CATCCGTCCT
TCGGTGAAGC ACTCCGGCCG CTTCGCGAAG GTCGACGCCG GGCTGTGGCG CACGGCGCCG
TGCCACGAGC AGGAGATCAC GGTGCAGCAG CTGCGCTGGG ATCCGACACC GGTGCCGTCC
GGTGAGGTCA CCTTCCTGCA GGGCGTGCAG ACCATGACGA CGGCGGGCGA CGCCGCGACG
CAGTCCGGCA TGGCCGCGCA TGTCTATGTC ATCACCAAGT CGATGGTCGA TCAGCATTTC
TACAATGCTG ACGGCGAACT GATGTTCGTG CTGCAGCAGG GACGCCTGTT GTTCGTCACC
GAGTTCGGCC GCATCGATGC CGCGCCCGGC GAGATCGTCA TGATCCCGCG CGGCGTCAAG
TTCCGTGTCG AGCTTCCGGC AGGGCCGGCG CGGGGCTATC TGTGTGAGAA TTATGGCGGG
GCTTTCACCC TGCCGGAGCG TGGCCCGATC GGCGCCAACT GCCTCGCCAA CGCCCGCGAC
TTCCTGACGC CAGTCGCTTC GTATGAGGAC AAGGACACGC CGACCGAGCT CTACGTGAAG
TGGGGCGGTG CGCTGTTCAA GACCGAGCTG AAGCACTCGC CGATCGACGT CGTCGCCTGG
CATGGCAACT ACGCGCCATA CAAATATGAT CTGCGAACCT TCTCGCCGGT CGGCGCCATC
GGTTTCGATC ATCCCGATCC ATCGATCTTC ACGGTGCTGA CCTCGCCGTC GGAAACGGCC
GGCACGGCCA ATATCGACTT CGTGATCTTC CCCGAGCGCT GGATGGTGGC CGACAACACC
TTCCGTCCGC CATGGTATCA CATGAACATC ATGTCCGAAT TCATGGGCCT GATCTACGGC
GTCTACGACG CCAAGCCGCA GGGCTTCGTT CCCGGCGGCA TGTCGCTGCA CAATTGCATG
CTGCCGCACG GACCGGATCG CGAGGCGTTC GATCACGCCA GCAATGGCGA GTTGAAGCCG
GTGAAGCTCA CGGGCACTAT GGCCTTCATG TTCGAGACCC GCTTCCCGCA GCGGATCACC
AAGCATGCGG CGGAGTCCAG CACACTGCAG CCGGATTACG CGGACTGCTG GAACGGACTG
GAAAAGCGTT TTGATCCGAA CCGGCCTTAG
 
Protein sequence
MMNINTSPDQ LSRASAQVTP GYMSGFGNSF ETEALPGALP IGRNSPQRCA YGLYAEQLSG 
SPFTAPRGSN ERSWLYRIRP SVKHSGRFAK VDAGLWRTAP CHEQEITVQQ LRWDPTPVPS
GEVTFLQGVQ TMTTAGDAAT QSGMAAHVYV ITKSMVDQHF YNADGELMFV LQQGRLLFVT
EFGRIDAAPG EIVMIPRGVK FRVELPAGPA RGYLCENYGG AFTLPERGPI GANCLANARD
FLTPVASYED KDTPTELYVK WGGALFKTEL KHSPIDVVAW HGNYAPYKYD LRTFSPVGAI
GFDHPDPSIF TVLTSPSETA GTANIDFVIF PERWMVADNT FRPPWYHMNI MSEFMGLIYG
VYDAKPQGFV PGGMSLHNCM LPHGPDREAF DHASNGELKP VKLTGTMAFM FETRFPQRIT
KHAAESSTLQ PDYADCWNGL EKRFDPNRP