Gene Cagg_3805 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCagg_3805 
Symbol 
ID7266285 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChloroflexus aggregans DSM 9485 
KingdomBacteria 
Replicon accessionNC_011831 
Strand
Start bp4641402 
End bp4642568 
Gene Length1167 bp 
Protein Length388 aa 
Translation table11 
GC content58% 
IMG OID643568617 
Producthomogentisate 12-dioxygenase 
Protein accessionYP_002465077 
Protein GI219850644 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG3508] Homogentisate 1,2-dioxygenase 
TIGRFAM ID[TIGR01015] homogentisate 1,2-dioxygenase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00200515 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.427722 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCGTTTT ACCAGCGTCT CGGCCACGTT CCCCACAAGC GCCATACCCA GTTCCGTAAG 
CCTGATGGCA AGCTCTACCG CGAAGAGGTG ATGGGTCTTG AAGGCTTTCA CGGCATTCAG
TCCATCCTCT ACCATCACTT CTTGCCGCCA CGGGTGCTGC GCGCCGAACT GGTTGGTTCC
GCTAAGCCGG AATACGTCGA ATTCGGCCCG ATCCGCCACC GTGCGTTTAC CACGGCGAAC
GTGCCCGCTG GCGGCGATCC GGTGAGTTCC CGCGTTACCT TGCTTGGTAA TAATGATGTG
ACGATCGGCG TGAGCCGGCC CACCGAGAGT ATGACCGGCT TCTATCGCAA TGCACAAGCG
TATGAGGTGT GGTTTGCGCA CGAGGGCAGC GGTGAGCTGC TCTCGCAGTT TGGTCGTTTG
CCGTTTAGCG CCGGTGATTA CGTCGTTATC CCGTTTGGCG TGACATGGCA GATGCAGCTC
GCTGGCCCGG CCCGCTTTTT GGTGATAGAA GCGACCGGCC AGATCGCGCC CCCCAAACGC
TACCGCAACC AGTTTGGCCA GTTGCTTGAG CATGCACCCT ATTGCGAGCG GGATATTCGC
GGCCCCGGCG AGTTGCTCAC CTTTACCGAT ACCGGCGAGT TTGAGGTGTT GGTGAAGGTA
CGCGATCAGC TCACGCGCCA CGTGCTCGAT CACCATCCCT TTGATGTGGT GGGCTGGGAT
GGCTATCTCT ACCCGTGGGC CTTTTCGATC CACGACTTCG AGCCGATTAC CGGGCGCATC
CATCAGCCGC CGCCGGTGCA TCAAACTTTC GAGGGTCATA ACTTTGTAAT CTGTTCGTTC
GTGCCGCGTC TGTTCGATTA TCATCCTGAA GCGATCCCGG CTCCGTACAA TCACTCGAAC
GTTAACTCCG ATGAGGTGAT TTATTACTGT GATGGTAACT TCATGTCGCG CCGTGGTATT
GAGCGATGTG ACATAACTTT GCATCCTGCC GGTTTGCCGC ACGGTCCGCA GCCGGGTAGC
ACCGAAGCTA GTATCGGTGC CAAAGAAACG CGCGAGTTAG CCGTGATGAT CGATACCTTC
CATCCGCTGC ATCTGACGAC TGCCGCGCTC GAGTTGGAGA AGGCGGGGTA TATGGATTCG
TGGAGCGTGG GTGATCCGGA AGGTTAG
 
Protein sequence
MPFYQRLGHV PHKRHTQFRK PDGKLYREEV MGLEGFHGIQ SILYHHFLPP RVLRAELVGS 
AKPEYVEFGP IRHRAFTTAN VPAGGDPVSS RVTLLGNNDV TIGVSRPTES MTGFYRNAQA
YEVWFAHEGS GELLSQFGRL PFSAGDYVVI PFGVTWQMQL AGPARFLVIE ATGQIAPPKR
YRNQFGQLLE HAPYCERDIR GPGELLTFTD TGEFEVLVKV RDQLTRHVLD HHPFDVVGWD
GYLYPWAFSI HDFEPITGRI HQPPPVHQTF EGHNFVICSF VPRLFDYHPE AIPAPYNHSN
VNSDEVIYYC DGNFMSRRGI ERCDITLHPA GLPHGPQPGS TEASIGAKET RELAVMIDTF
HPLHLTTAAL ELEKAGYMDS WSVGDPEG