Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Avin_15350 |
Symbol | hmgA |
ID | 7760470 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Azotobacter vinelandii DJ |
Kingdom | Bacteria |
Replicon accession | NC_012560 |
Strand | - |
Start bp | 1508074 |
End bp | 1509459 |
Gene Length | 1386 bp |
Protein Length | 461 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 643804432 |
Product | homogentisate 1,2-dioxygenase |
Protein accession | YP_002798725 |
Protein GI | 226943652 |
COG category | [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG3508] Homogentisate 1,2-dioxygenase |
TIGRFAM ID | [TIGR01015] homogentisate 1,2-dioxygenase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCCTTTCG ACAACCCCGC CGATGGGCCG GAACACGGCG CACGATGCTC CGGCGCACGG ATCGATACTG ACAGAGAGGT CGCCATGCAC ACGGCAGAAC CATCCGCCCC GGGCTACCAG ACGGGCTTCG GCAACACCTT CAGCAGCGAG GCCCTGCCGG GCGCCCTGCC CATCGGACAG AACTCGCCGC AACGTCTCCC CTACGGCCTG TATGCCGAGC AACTCTCCGG CACCGCCTTC ACCGTGCCGC GCAGCGAAGC GCGGCACGCC TGGCTGTACC GCATTCGCCC CTCGGCCAAC CACCCGGATT TCAAACGCCT GGGCTCACAG ATCAGCGGCA TCGAGCAGGG GCCGATCACC CCCAATCGGC TGCGCTGGGC GCCCTTCGAA GTACCGGCGG AGCACACCGA CTTTCTCGAC GGCCTGATCC GCCTGGCGGC GACCGCAGCG GCGGAACAGG CCGACGGCGT GAGCCTCTAC GTCTATCGCG CCAACGCCTC GATGGAATCG GTATTCTTCG ACGCCGATGG CGAACTGCTG CTCGTGCCCG AATCTGGGCG CCTGGGCATC GACACGGAAC TGGGACGGCT GGAGATCGGC CCGCTGCAGA TCGCCGTGGT GCCGCGCGGG GTGCGCTTTC GCGTCGAACT GCTGGACGCC ACGGCGCGCG GCTACCTGTG CGAGAACCAC GGCAGCCCGC TGCGCCTGCC CGAACTGGGC CCGATCGGCA GCAACGGCCT GGCCAACCCC CGCGACTTCC TCGTCCCGGT CGCTCGCTAC GAGGACCGCG ACGGGCCGGT GCAACTGGTG CAGAAGTTCC TCGGCGAGCT ATGGGCCTGC GAGTTGAACC ACTCGCCGCT GGATGTGGTC GCCTGGCACG GCAACCACGT GCCCTACAGC TACGACCTGC GCCGCTTCAA CACCTTCGGC ACGGTCAGTT TCGACCATCC CGACCCGTCG ATCTTCACCG TGCTGACCTC ACCCGGCAGT GTGCCCGGCC AAGCCAACGT CGACTTCGTG ATCTTCCCGC CGCGCTGGAT GGTGGCCGAG CACACATTTC GCCCGCCCTG GTTCCACCGC AATCTGATGA ACGAGTTCAT GGGCCTGATC CAGGGCGTCT ACGACGCCAA GGCCGGCGGC TTCCTGCCCG GCGGCGCCTC GCTGCACAAT CGCATGAGCG CCCACGGCCC CGACGCGGCG ACCACCCGCC AGGCCATCGC CGCCGACCTG CAGCCACAGA AGTTCGAGAA CACCATGGCC TTCATGTTCG AGACCGGCCA GGTCCTGCGC CCGAGCCGCC TCGCCATCGA CTGCCCGCAA CGACAGACCG ACTACGATGG CTGCTGGTCG GGGCTGGCCG GGACCTTCGA TCCGAACCGG AAATAG
|
Protein sequence | MPFDNPADGP EHGARCSGAR IDTDREVAMH TAEPSAPGYQ TGFGNTFSSE ALPGALPIGQ NSPQRLPYGL YAEQLSGTAF TVPRSEARHA WLYRIRPSAN HPDFKRLGSQ ISGIEQGPIT PNRLRWAPFE VPAEHTDFLD GLIRLAATAA AEQADGVSLY VYRANASMES VFFDADGELL LVPESGRLGI DTELGRLEIG PLQIAVVPRG VRFRVELLDA TARGYLCENH GSPLRLPELG PIGSNGLANP RDFLVPVARY EDRDGPVQLV QKFLGELWAC ELNHSPLDVV AWHGNHVPYS YDLRRFNTFG TVSFDHPDPS IFTVLTSPGS VPGQANVDFV IFPPRWMVAE HTFRPPWFHR NLMNEFMGLI QGVYDAKAGG FLPGGASLHN RMSAHGPDAA TTRQAIAADL QPQKFENTMA FMFETGQVLR PSRLAIDCPQ RQTDYDGCWS GLAGTFDPNR K
|
| |