Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | AnaeK_3504 |
Symbol | |
ID | 6783809 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Anaeromyxobacter sp. K |
Kingdom | Bacteria |
Replicon accession | NC_011145 |
Strand | + |
Start bp | 3965814 |
End bp | 3966971 |
Gene Length | 1158 bp |
Protein Length | 385 aa |
Translation table | 11 |
GC content | 74% |
IMG OID | 642764975 |
Product | homogentisate 12-dioxygenase |
Protein accession | YP_002135846 |
Protein GI | 197123895 |
COG category | [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG3508] Homogentisate 1,2-dioxygenase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.142046 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCTCGACC GGATCGTCCA GGGAGAGGTG CCGCGCAAGC ACCACCTCGC GTTCCGCGAC GCGGAGGGCC GGCTCCTCCA CGAGGAGGCG TACACCCGCG CCGGCTTCGA CGGTCCGTAC GCCCTGCTCT ACCACCGCAA CCGGCCGCAC GCGGTCCACG CCGCCCAGGC CCGGAACGGC TTCCCGCTGC CGCAGCCCGC GCCGGCGCGG GCGCTGCTGA AGCGCCACTA CCGCACCCAG GACCTCGCGG CGCGCGGCGG GCCGCCGGTG GACGCGCGGC GCCCGCTCCT GTTCAACGCC GACGTGGTGG TCGGCCTGGT GACGCCCACC TCCGAGGACC CGGTCTACTT CGCGAACGGC GACGGCGACG ACCTCTACTT CGTCGCGGAG GGCGGCGGCC TGCTGCGCTC GCCCATGGGC GACCTGCGCT TCGGGCAGCA CGACTACGTC TACGTGCCGA GGGGCCTGCT GCACCGGTTC GTGCCCGACG CCGGGCCGCA GCGCTGGCTG TCGCTCGAGT TCCCCGGCGG CATGCACCTG CCGGCGCAGT GGCGGAACGA GACCGGCCAG CTCCGCATGG ACGCCCCCTA CTCCCACCGC GACTTCCGGC GGGTGGAGCT GAAGGGCCCG GTGGACGAGG GCCTCCGCGA GCTGGTGGTG AAGCGGGCCG GGGCGTTCCA CGGGTTCCGC TACGACGCGT CGCCGCTCGA CGTCGTCGGC TGGGACGGCG CGCTCTACCC GTTCGCCTTC CCCATCCTGA ACTTCCAGCC GCGGGCCGGG CTGGTGCACC TGCCGCCGAC GTGGCACGGC ACCTTCGCGG CGCGCGGCGC GCTGGTCTGC TCGTTCGTGC CGCGGATGAC CGACTTCCAC CCGGAGGCCG TGCCCTGCCC GTACCCGCAC GCCTCGGTGG ACGTGGACGA GATCCTGTTC TACGTGCGGG GGGAGTTCAC CAGCCGCCGC GGCGTCGGCC CGGGCTCGAT CTCCCACCAC CCCGCCGGCA CCATGCACGG CCCCCACCCC GGCGCCTACG AGGCGTCGAT CGGCACGCGC AGCACGAACG AGCTGGCGGT GATGCTCGAC TGCTACCTGC CGCTGCAGCC CACGGCGATC GCGCTCGGCA TCGAGGACCC CGGCTACCAG GAGAGCTTCG TCGGCTGA
|
Protein sequence | MLDRIVQGEV PRKHHLAFRD AEGRLLHEEA YTRAGFDGPY ALLYHRNRPH AVHAAQARNG FPLPQPAPAR ALLKRHYRTQ DLAARGGPPV DARRPLLFNA DVVVGLVTPT SEDPVYFANG DGDDLYFVAE GGGLLRSPMG DLRFGQHDYV YVPRGLLHRF VPDAGPQRWL SLEFPGGMHL PAQWRNETGQ LRMDAPYSHR DFRRVELKGP VDEGLRELVV KRAGAFHGFR YDASPLDVVG WDGALYPFAF PILNFQPRAG LVHLPPTWHG TFAARGALVC SFVPRMTDFH PEAVPCPYPH ASVDVDEILF YVRGEFTSRR GVGPGSISHH PAGTMHGPHP GAYEASIGTR STNELAVMLD CYLPLQPTAI ALGIEDPGYQ ESFVG
|
| |