Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BBta_0566 |
Symbol | hmgA |
ID | 5153991 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Bradyrhizobium sp. BTAi1 |
Kingdom | Bacteria |
Replicon accession | NC_009485 |
Strand | + |
Start bp | 570086 |
End bp | 571435 |
Gene Length | 1350 bp |
Protein Length | 449 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 640555577 |
Product | homogentisate 1,2-dioxygenase |
Protein accession | YP_001236750 |
Protein GI | 148252165 |
COG category | [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG3508] Homogentisate 1,2-dioxygenase |
TIGRFAM ID | [TIGR01015] homogentisate 1,2-dioxygenase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.11982 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 29 |
Fosmid unclonability p-value | 0.485064 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATGAACA TCAATACATC GCCTGACCAG CTCAGCCGCG CCTCGGCGCA GGTTACGCCG GGCTACATGT CCGGCTTTGG CAATTCGTTC GAGACCGAGG CGCTGCCCGG CGCGCTGCCG ATCGGCCGTA ACTCGCCGCA GCGCTGCGCC TATGGCCTCT ATGCCGAGCA GCTTTCCGGC TCGCCCTTCA CCGCGCCGCG CGGCTCCAAC GAGCGCTCCT GGCTGTATCG CATCCGTCCT TCGGTGAAGC ACTCCGGCCG CTTCGCGAAG GTCGACGCCG GGCTGTGGCG CACGGCGCCG TGCCACGAGC AGGAGATCAC GGTGCAGCAG CTGCGCTGGG ATCCGACACC GGTGCCGTCC GGTGAGGTCA CCTTCCTGCA GGGCGTGCAG ACCATGACGA CGGCGGGCGA CGCCGCGACG CAGTCCGGCA TGGCCGCGCA TGTCTATGTC ATCACCAAGT CGATGGTCGA TCAGCATTTC TACAATGCTG ACGGCGAACT GATGTTCGTG CTGCAGCAGG GACGCCTGTT GTTCGTCACC GAGTTCGGCC GCATCGATGC CGCGCCCGGC GAGATCGTCA TGATCCCGCG CGGCGTCAAG TTCCGTGTCG AGCTTCCGGC AGGGCCGGCG CGGGGCTATC TGTGTGAGAA TTATGGCGGG GCTTTCACCC TGCCGGAGCG TGGCCCGATC GGCGCCAACT GCCTCGCCAA CGCCCGCGAC TTCCTGACGC CAGTCGCTTC GTATGAGGAC AAGGACACGC CGACCGAGCT CTACGTGAAG TGGGGCGGTG CGCTGTTCAA GACCGAGCTG AAGCACTCGC CGATCGACGT CGTCGCCTGG CATGGCAACT ACGCGCCATA CAAATATGAT CTGCGAACCT TCTCGCCGGT CGGCGCCATC GGTTTCGATC ATCCCGATCC ATCGATCTTC ACGGTGCTGA CCTCGCCGTC GGAAACGGCC GGCACGGCCA ATATCGACTT CGTGATCTTC CCCGAGCGCT GGATGGTGGC CGACAACACC TTCCGTCCGC CATGGTATCA CATGAACATC ATGTCCGAAT TCATGGGCCT GATCTACGGC GTCTACGACG CCAAGCCGCA GGGCTTCGTT CCCGGCGGCA TGTCGCTGCA CAATTGCATG CTGCCGCACG GACCGGATCG CGAGGCGTTC GATCACGCCA GCAATGGCGA GTTGAAGCCG GTGAAGCTCA CGGGCACTAT GGCCTTCATG TTCGAGACCC GCTTCCCGCA GCGGATCACC AAGCATGCGG CGGAGTCCAG CACACTGCAG CCGGATTACG CGGACTGCTG GAACGGACTG GAAAAGCGTT TTGATCCGAA CCGGCCTTAG
|
Protein sequence | MMNINTSPDQ LSRASAQVTP GYMSGFGNSF ETEALPGALP IGRNSPQRCA YGLYAEQLSG SPFTAPRGSN ERSWLYRIRP SVKHSGRFAK VDAGLWRTAP CHEQEITVQQ LRWDPTPVPS GEVTFLQGVQ TMTTAGDAAT QSGMAAHVYV ITKSMVDQHF YNADGELMFV LQQGRLLFVT EFGRIDAAPG EIVMIPRGVK FRVELPAGPA RGYLCENYGG AFTLPERGPI GANCLANARD FLTPVASYED KDTPTELYVK WGGALFKTEL KHSPIDVVAW HGNYAPYKYD LRTFSPVGAI GFDHPDPSIF TVLTSPSETA GTANIDFVIF PERWMVADNT FRPPWYHMNI MSEFMGLIYG VYDAKPQGFV PGGMSLHNCM LPHGPDREAF DHASNGELKP VKLTGTMAFM FETRFPQRIT KHAAESSTLQ PDYADCWNGL EKRFDPNRP
|
| |