Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Adeh_3423 |
Symbol | |
ID | 3889431 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Anaeromyxobacter dehalogenans 2CP-C |
Kingdom | Bacteria |
Replicon accession | NC_007760 |
Strand | + |
Start bp | 3938590 |
End bp | 3939747 |
Gene Length | 1158 bp |
Protein Length | 385 aa |
Translation table | 11 |
GC content | 74% |
IMG OID | 637864978 |
Product | homogentisate 1,2-dioxygenase |
Protein accession | YP_466627 |
Protein GI | 86159842 |
COG category | [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG3508] Homogentisate 1,2-dioxygenase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.00233011 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCTCGACC GGATCGTCCA GGGAGAGGTG CCGCGCAAGC ACCACCTCGC GTTCCGCGAC GCGGAGGGCC GGCTGCTCCA CGAGGAGGCG TACACCCGCG CCGGCTTCGA CGGGCCCTAC GCGCTGCTGT ACCACCGCAA CCGGCCGCAC GCGGTCCACG CCGCCCAGGC CCGGAACGGC TTCCCGCTGC CGCAGCCCGC GCCCGCCCGG GCGCTGCTGA AGCGCCACTA CCGGACGCAG GACCTCGCGG CGCGCGGCGG GCCGCCGGTG GACGCCCGGC GCCCGCTCCT GTTCAACGCC GACGTGGTGG TCGGGCTGGT GACGCCCACC TCCGAGGACC CGGTCTACTT CGCGAACGGC GACGGCGACG ACCTCTACTT CGTCGCCGAG GGCGGCGGGC TCCTGCGCTC GCCCATGGGC GACCTGCGCT TCGGGCAGCA CGACTACGTC TACGTGCCGA GGGGCCTGCT GCACCGGATC GTGCCCGACG CCGGGCCGCA GCGCTGGCTG TCGCTCGAGT TCCCGGGCGG CATGCACCTG CCGGCGCAGT GGCGCAACGA GACCGGCCAG CTCCGCATGG ACGCCCCCTA CTCCCACCGC GACTTCCGGC GCGTGGAGCT GAAGGGCCCG GTGGACGAGG GGCTCCGCGA GCTGGTGGTG AAGCGCGCCG GCGCGTTCCA CGGCTTCCGC TACGACGCGT CGCCGCTCGA CGTGGTGGGC TGGGACGGCG CGCTCTACCC GTTCGCGTTC CCCATCCTGA ACTTCCAGCC GCGCGCCGGG CTGGTGCACC TGCCGCCGAC GTGGCACGGC ACCTTCGCGG CGCGCGGCGC GCTGGTCTGC TCGTTCGTGC CGCGGGTGAC CGACTTCCAC CCCGAGGCCG TGCCCTGCCC GTACCCGCAC GCCTCGGTGG ACGTGGACGA GATCCTGTTC TACGTGCGGG GCGAGTTCAC CAGCCGCCGC GGCGTCGGCC CCGGCTCGAT CTCGCACCAC CCCGCCGGCA CGATGCACGG CCCGCACCCC GGCGCGTACG AGGCGTCGAT CGGCACGCGC AGCACGAACG AGCTGGCGGT GATGCTCGAC TGCTACCTGC CGCTGCAGCC CACGGCCATC GCGCTCGGCA TCGAGGACCC CGGCTACCAG GAGAGCTTCG TCGGCTGA
|
Protein sequence | MLDRIVQGEV PRKHHLAFRD AEGRLLHEEA YTRAGFDGPY ALLYHRNRPH AVHAAQARNG FPLPQPAPAR ALLKRHYRTQ DLAARGGPPV DARRPLLFNA DVVVGLVTPT SEDPVYFANG DGDDLYFVAE GGGLLRSPMG DLRFGQHDYV YVPRGLLHRI VPDAGPQRWL SLEFPGGMHL PAQWRNETGQ LRMDAPYSHR DFRRVELKGP VDEGLRELVV KRAGAFHGFR YDASPLDVVG WDGALYPFAF PILNFQPRAG LVHLPPTWHG TFAARGALVC SFVPRVTDFH PEAVPCPYPH ASVDVDEILF YVRGEFTSRR GVGPGSISHH PAGTMHGPHP GAYEASIGTR STNELAVMLD CYLPLQPTAI ALGIEDPGYQ ESFVG
|
| |