Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cagg_3805 |
Symbol | |
ID | 7266285 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chloroflexus aggregans DSM 9485 |
Kingdom | Bacteria |
Replicon accession | NC_011831 |
Strand | + |
Start bp | 4641402 |
End bp | 4642568 |
Gene Length | 1167 bp |
Protein Length | 388 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 643568617 |
Product | homogentisate 12-dioxygenase |
Protein accession | YP_002465077 |
Protein GI | 219850644 |
COG category | [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG3508] Homogentisate 1,2-dioxygenase |
TIGRFAM ID | [TIGR01015] homogentisate 1,2-dioxygenase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.00200515 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 0.427722 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCGTTTT ACCAGCGTCT CGGCCACGTT CCCCACAAGC GCCATACCCA GTTCCGTAAG CCTGATGGCA AGCTCTACCG CGAAGAGGTG ATGGGTCTTG AAGGCTTTCA CGGCATTCAG TCCATCCTCT ACCATCACTT CTTGCCGCCA CGGGTGCTGC GCGCCGAACT GGTTGGTTCC GCTAAGCCGG AATACGTCGA ATTCGGCCCG ATCCGCCACC GTGCGTTTAC CACGGCGAAC GTGCCCGCTG GCGGCGATCC GGTGAGTTCC CGCGTTACCT TGCTTGGTAA TAATGATGTG ACGATCGGCG TGAGCCGGCC CACCGAGAGT ATGACCGGCT TCTATCGCAA TGCACAAGCG TATGAGGTGT GGTTTGCGCA CGAGGGCAGC GGTGAGCTGC TCTCGCAGTT TGGTCGTTTG CCGTTTAGCG CCGGTGATTA CGTCGTTATC CCGTTTGGCG TGACATGGCA GATGCAGCTC GCTGGCCCGG CCCGCTTTTT GGTGATAGAA GCGACCGGCC AGATCGCGCC CCCCAAACGC TACCGCAACC AGTTTGGCCA GTTGCTTGAG CATGCACCCT ATTGCGAGCG GGATATTCGC GGCCCCGGCG AGTTGCTCAC CTTTACCGAT ACCGGCGAGT TTGAGGTGTT GGTGAAGGTA CGCGATCAGC TCACGCGCCA CGTGCTCGAT CACCATCCCT TTGATGTGGT GGGCTGGGAT GGCTATCTCT ACCCGTGGGC CTTTTCGATC CACGACTTCG AGCCGATTAC CGGGCGCATC CATCAGCCGC CGCCGGTGCA TCAAACTTTC GAGGGTCATA ACTTTGTAAT CTGTTCGTTC GTGCCGCGTC TGTTCGATTA TCATCCTGAA GCGATCCCGG CTCCGTACAA TCACTCGAAC GTTAACTCCG ATGAGGTGAT TTATTACTGT GATGGTAACT TCATGTCGCG CCGTGGTATT GAGCGATGTG ACATAACTTT GCATCCTGCC GGTTTGCCGC ACGGTCCGCA GCCGGGTAGC ACCGAAGCTA GTATCGGTGC CAAAGAAACG CGCGAGTTAG CCGTGATGAT CGATACCTTC CATCCGCTGC ATCTGACGAC TGCCGCGCTC GAGTTGGAGA AGGCGGGGTA TATGGATTCG TGGAGCGTGG GTGATCCGGA AGGTTAG
|
Protein sequence | MPFYQRLGHV PHKRHTQFRK PDGKLYREEV MGLEGFHGIQ SILYHHFLPP RVLRAELVGS AKPEYVEFGP IRHRAFTTAN VPAGGDPVSS RVTLLGNNDV TIGVSRPTES MTGFYRNAQA YEVWFAHEGS GELLSQFGRL PFSAGDYVVI PFGVTWQMQL AGPARFLVIE ATGQIAPPKR YRNQFGQLLE HAPYCERDIR GPGELLTFTD TGEFEVLVKV RDQLTRHVLD HHPFDVVGWD GYLYPWAFSI HDFEPITGRI HQPPPVHQTF EGHNFVICSF VPRLFDYHPE AIPAPYNHSN VNSDEVIYYC DGNFMSRRGI ERCDITLHPA GLPHGPQPGS TEASIGAKET RELAVMIDTF HPLHLTTAAL ELEKAGYMDS WSVGDPEG
|
| |