Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Anae109_3485 |
Symbol | |
ID | 5374549 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Anaeromyxobacter sp. Fw109-5 |
Kingdom | Bacteria |
Replicon accession | NC_009675 |
Strand | + |
Start bp | 4091846 |
End bp | 4093003 |
Gene Length | 1158 bp |
Protein Length | 385 aa |
Translation table | 11 |
GC content | 74% |
IMG OID | 640845009 |
Product | homogentisate 12-dioxygenase |
Protein accession | YP_001380652 |
Protein GI | 153006327 |
COG category | [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG3508] Homogentisate 1,2-dioxygenase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 0.306053 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 49 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCTCGACC GGATCGTGCA GGGCGCGGTC CCGCGGAAGC ACCACATCGC CTTCCGCGAT CCCGACGGGC GCCTCCTGCA CGAGGAGGCC TTCACGCGCG CCGGGTTCGA CGGTGCGTAC ACCCTCGCCT ACCACCGGCA CCGGCCCCAC GCGACGCACG CCGCCGAGGT CCGTCACGGC TGGACGCTGC CGCGGGCGGC GCCGCCGCGC GGTCTCCTGA AGCGCCACTA CCGCACGCAG GAGCTCCCCC TGCCCGCGGG GCCGGCGGTG GACGCGCGCG TCCCGCTCCT CTTCAACGCG GACGTGGTCG TCGGCCTCGC CACGCCGAGC GCGGAGGATC CGGTCTACCT CTCGAACGGC GACGGCGACG ACCTCTTCTT CGTCCTCGAG GGAGGCGGGC TCTTGCGCAC GCCGCTCGGC GACCTGCGCT TCGCGCAGGA CGACTACGTC TACGTGCCGA AGGGGCTCCT CCACCGCTTC GTGCCGGGCG CGGGTCCGCA GCGCTGGCTC TCCCTCGAGT TCCCGGGCGG CCTCCACCTG CCCTCCCAGT GGCGCAACGA GACCGGCCAG CTCCGCATGG ACGCGCCCTA CTCCCACCGC GACTTCCGCC GCGTCGAGTG GACGGGCCCG CTCGACGAGG GGATCCGCGA GCTCCTCGTC AAGCGCGCCG GGGCTTTCCA CGCCTTCCGC TACGACGAGT CGCCGCTCGA CGTGGTGGGC TGGGACGGCG CCGTGTACCC GTTCGCCTTT CCCATCCTGA ACTTCCAGCC GCGCGCCGGG CTCGTGCACC TGCCGCCCAC CTGGCACGGC ACCTTCGCCG CCCGCGGCGC GCTGGTGTGC TCGTTCGTCC CGCGCGTGGT GGACTTCCAC CCCGAGGCGA TCCCCTGCCC CTACCCGCAC GCCTCGCCGG ACGTGGACGA GATCCTGTTC TACGTCCGCG GGGAGTTCAC CTCCCGGCGC GGCGTGGGCC CCGGCTCGAT CTCGCACCAC CCCGCGGGCG TGATGCACGG GCCGCACCCG GGCGCCTACG AGGGCTCGAT CGGCGCCCGC ACCACGAGCG AGCTCGCGGT CATGCTCGAC TGCTACCTCC CGCTCGCCGC GACCCCCGCC GCGCTGGGGA TCGAGGACCC CGGCTACCAG GAGAGCTTCG TGCGCTGA
|
Protein sequence | MLDRIVQGAV PRKHHIAFRD PDGRLLHEEA FTRAGFDGAY TLAYHRHRPH ATHAAEVRHG WTLPRAAPPR GLLKRHYRTQ ELPLPAGPAV DARVPLLFNA DVVVGLATPS AEDPVYLSNG DGDDLFFVLE GGGLLRTPLG DLRFAQDDYV YVPKGLLHRF VPGAGPQRWL SLEFPGGLHL PSQWRNETGQ LRMDAPYSHR DFRRVEWTGP LDEGIRELLV KRAGAFHAFR YDESPLDVVG WDGAVYPFAF PILNFQPRAG LVHLPPTWHG TFAARGALVC SFVPRVVDFH PEAIPCPYPH ASPDVDEILF YVRGEFTSRR GVGPGSISHH PAGVMHGPHP GAYEGSIGAR TTSELAVMLD CYLPLAATPA ALGIEDPGYQ ESFVR
|
| |