Gene Cagg_3806 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCagg_3806 
Symbol 
ID7266286 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChloroflexus aggregans DSM 9485 
KingdomBacteria 
Replicon accessionNC_011831 
Strand
Start bp4642680 
End bp4643927 
Gene Length1248 bp 
Protein Length415 aa 
Translation table11 
GC content58% 
IMG OID643568618 
Productfumarylacetoacetase 
Protein accessionYP_002465078 
Protein GI219850645 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG0179] 2-keto-4-pentenoate hydratase/2-oxohepta-3-ene-1,7-dioic acid hydratase (catechol pathway) 
TIGRFAM ID[TIGR01266] fumarylacetoacetase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.020804 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.595567 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCACTGC AAAGCTTTGT TTCTATTACA CCTGACAGTG ATTTTCCGCT CGAAAATCTG 
CCCTACGGTG TCTTCCGTCT GCGTAGTGGC GGTACGGCGC GGGTCGGGGT GGCGATTGGT
GAATACGTGC TCGATCTCGC AGTGCTCGAT GAGGCCGGTT TGTTGGCTTC GACGCCGGTG
GCCGGGCAAG GGTTGTTTAC CCGTGATTCC CTTAACGGAT TTATGGCTGC GGGTCCGGCG
GCGTGGCAGG CAGTGCGCAA CACGCTGCAA CGGCTGCTCG CTGCCGATGA GCCAACGTTA
CGCGATCACC AGCCGCTGCG CGACGCCGCG CTGATCCGGC AAAGCGAGGT TGAGCTGCTG
CTGCCGGTGC AGATCGGCGA TTTCACCGAC TTCTATTCGT CGCTTTACCA TGCCACCAAC
ACCGGCAAGA TGCTGCGTCC CGATAGTCCT CCACTTTACC CGAATTGGCG GCATATGCCG
GTAGCGTACC ATGGTCGGGC TAGTACCGTG GTAGTTAGCG GTACACCGAT TCGCCGTCCC
TGTGGTCAGA TCAAGCCGTC GCGTAGCCCA GAACCGTTCT TTTCACCGTC ACGTGCCCTC
GATTTCGAGG TTGAGTTGGC GATGGTTATC GGTGTGGGTA GCGAGTTAGG GGTGCCGGTA
CCGATTGCGC AGGCTGAAGA GCACATCTTT GGCTTTGTGA TCCTCAATGA CTGGAGCGCG
CGTGATATTC AGGGGTGGGA GTATCAGCCG CTTGGCCCCT TCTTGTCAAA GAATTTTGCG
ACAACGATTA GCCCGTGGGT AGTTCCACTC GCAGCACTCG AACCGTTCCG CTGTAGTGGT
GAGCCGCAAG ACCCGCCACC GTTGTCGTAT CTGCAACCGC CACGACCGGG ACATTTTGAT
GTCACGCTCG AAGTTTGGCT CAACGATACG CGCATCTGCC AGACCAATGC TCGTCATCTG
TACTGGAGCT TTGCCCAGCA GCTTGCACAT CATACGGTGA ATGGTTGTCG GTTGCGGCCC
GGTGACCTTA TGGGTTCGGG AACGATCAGT GGTCCAACGA AGGAGTCGCG GGGTTGTTTG
TTTGAGTTGA CGTGGCGTGG TACCGAGCCG ATCCAACTGG CCGATGGTTC AACGCGGCGT
TGGTTGGAGG ATGGCGATAC GGTAACGATG CGCGCATGGG CGCAGGGTGA TGGGTACCGC
ATCGGGTTTG GCGAGGCGAC GGGGACGATC GTGGCAAATT CACCGTAG
 
Protein sequence
MPLQSFVSIT PDSDFPLENL PYGVFRLRSG GTARVGVAIG EYVLDLAVLD EAGLLASTPV 
AGQGLFTRDS LNGFMAAGPA AWQAVRNTLQ RLLAADEPTL RDHQPLRDAA LIRQSEVELL
LPVQIGDFTD FYSSLYHATN TGKMLRPDSP PLYPNWRHMP VAYHGRASTV VVSGTPIRRP
CGQIKPSRSP EPFFSPSRAL DFEVELAMVI GVGSELGVPV PIAQAEEHIF GFVILNDWSA
RDIQGWEYQP LGPFLSKNFA TTISPWVVPL AALEPFRCSG EPQDPPPLSY LQPPRPGHFD
VTLEVWLNDT RICQTNARHL YWSFAQQLAH HTVNGCRLRP GDLMGSGTIS GPTKESRGCL
FELTWRGTEP IQLADGSTRR WLEDGDTVTM RAWAQGDGYR IGFGEATGTI VANSP