Gene Cagg_2235 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCagg_2235 
Symbol 
ID7266808 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChloroflexus aggregans DSM 9485 
KingdomBacteria 
Replicon accessionNC_011831 
Strand
Start bp2733000 
End bp2734010 
Gene Length1011 bp 
Protein Length336 aa 
Translation table11 
GC content60% 
IMG OID643567066 
Productfumarylacetoacetate (FAA) hydrolase 
Protein accessionYP_002463554 
Protein GI219849121 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG0179] 2-keto-4-pentenoate hydratase/2-oxohepta-3-ene-1,7-dioic acid hydratase (catechol pathway) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0117437 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00077103 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
GTGCGACTGG TGAGTTTCAT TCCGCCTGCG GAAACTACGG CCCGCACCGG CGTTTTGCTC 
GGTGAGGCAA TTATCGATCT AGCGGCAGCG GCAGCACTGG TGAGTGAAGA AGTAGCCGCC
GCACCTTGGG ATATGCTCAC CCTCCTACGA GGTGATCATC CTGAGGTTAC GATCGCGACG
GCTGCCGATA TTGTGCAGGC AGTCGTGAAT GTGATGGCCG GTGAAGAGCC GGCAGAAGCA
CCACTGACCG AATTTGCGTG GCAGGCAGGC CTCACCATCG GCGAGACAGC ATTGGTGTTG
CCGGCGACCC AAGTACGCCT GGTTGCCCCA TTACCCCAGG CGCTGTCGCT GCGTGAATTT
GATGCGCTTG TCGATGAACC GACGGCTGCC CTCCGCCAAG CCGCCGGCTA TTGGGTGGGC
GACCGACGCT GGCCGTCCTT CCGCTTCGCT AATCACACCG CGATCTATGG CCCCGACGAC
CCTATCCCCT TACCGATGAG CGGCCCACTT GATTGCGGAA TGGCACTGGG GTGCGTGATC
GGGCAGGTCG GTCGTGATAT TCCTCCCGAC GAAGCTGATG CCTACATCGC CGGGTACGTC
CTCGTCAATG CGTGGACTAT CCGCGACCCG GTGCAAGCTG CTCTACGGCC GCGTGATGTT
GGCACCTCAT TGGGACCGTG GCTGGTAACA CCAGATGAAG TAGAATGCTA CCGCGACGAC
GATGGCCGGT TGATGTTGAC CCTTCGACTG TCGCTGAATG GGCGTGAGAT TGGGCAGTGT
AACACGGCAC TCATGCGGTT TTCATTTGGT GAACTGATTG CGTTTGCGAG CCGCGATACG
ACCCTCTACC CGGGTGAGGT ATTGATCGGT GGTGTCGCGA CCGGTGGTTG TCTCCTCGAT
CACTACGGTG ATGAAGGCCC ATGGCTGCGC ACCGGTGATG AGGTTGTGGT TGAGTGTCAT
GAGCTTGGCC GCCTACGTTC ACCGATCGGG CAGACGGATG AGCTGTTTTG A
 
Protein sequence
MRLVSFIPPA ETTARTGVLL GEAIIDLAAA AALVSEEVAA APWDMLTLLR GDHPEVTIAT 
AADIVQAVVN VMAGEEPAEA PLTEFAWQAG LTIGETALVL PATQVRLVAP LPQALSLREF
DALVDEPTAA LRQAAGYWVG DRRWPSFRFA NHTAIYGPDD PIPLPMSGPL DCGMALGCVI
GQVGRDIPPD EADAYIAGYV LVNAWTIRDP VQAALRPRDV GTSLGPWLVT PDEVECYRDD
DGRLMLTLRL SLNGREIGQC NTALMRFSFG ELIAFASRDT TLYPGEVLIG GVATGGCLLD
HYGDEGPWLR TGDEVVVECH ELGRLRSPIG QTDELF