Gene Cagg_3589 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCagg_3589 
Symbol 
ID7269733 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChloroflexus aggregans DSM 9485 
KingdomBacteria 
Replicon accessionNC_011831 
Strand
Start bp4363711 
End bp4364769 
Gene Length1059 bp 
Protein Length352 aa 
Translation table11 
GC content55% 
IMG OID643568397 
Producthistone deacetylase superfamily 
Protein accessionYP_002464863 
Protein GI219850430 
COG category[B] Chromatin structure and dynamics
[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG0123] Deacetylases, including yeast histone deacetylase and acetoin utilization protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0000112978 
Fosmid HitchhikerNo 
Fosmid clonabilitydecreased coverage 
 

Sequence

Gene sequence
ATGACAACAG CTATCGTGAT CGATCAGCGG TTTGATTTAC ATACATGGCA CGGCCATGTC 
GAACATGCCG GACGGTTGCA AGCGATTCAG CGGGCATTGC AAACATCTGG ACTGCTACCC
AGCCTTATGC AACTCCCGAT TCGCGCCGCA ACCGAAGCTG AACTGCTGGC CGTGCATAGT
TCACATATGC TGCACCGGGT CCGCCAATTG GCCAGCTACG GCGGCGGTCA AATCGATAGT
GATACCTACG TGACTGCCGA TTCATGGGAC GTGGGGCTGT TAGCCGCCGG TGCAACTATT
GCTATGGTCG AAGCCATTGC CGAAGGCCGT TGCCATAATG GCTTTGCATT GGTACGTCCA
CCCGGCCACC ACGCGACCGA TGTGCGCTCG ATGGGGTTCT GTCTCTTCAA CAATATCGCC
ATCGCCGCAC GTGTCTTGCT CGACCGCTAC GACATTCGGC GGCTCGCCAT CGTTGACTTC
GACGTTCACC ATGGCAATGG CACCCAAGAC ATCTTTTATC GCGATGGACG GGTGCTCTTT
TGTTCTACCC ATGCTTCGCC ACTCTATCCC GGCACCGGTG CAGTGTACGA AACCGGTGAT
CCGCATATGG CAAACGGTAC CACACTGAAT GTTCCTCTTC CTTACGGGAC AGGGGATGAG
GGCTACGATC GTGTGTTTCG ACAGGTGATC GGCCCGGCCA TCCATCAGTT CCAACCCGAA
ATATTGCTGG TTTCGGCCGG ATTTGACGCA CACTGGAGCG ATCCGATTGG ACCGATGGCA
CTGTCGGTCC ACGGATTTGC CCGTCTCGTT CAGCATCTTC TAACGTGGGC ACAGACCTTA
TGTAACGGAC GGATCGGGTT CGTGCTTGAG GGTGGTTACA ATATAGCGGC TCTTACTGCA
AGTGTCATTG CCACCCTACG CCTCATGCTA GGGATGGATC CGGGACCAGA CCCACTCGGA
AAGATGAATG CACCCGAACC GAATATTGAT CGGATTATTA CCACGTTACA CACGCAACAT
CCGTTACTCA TACAAGCAGG CTATCAAGGA GTCGCATGA
 
Protein sequence
MTTAIVIDQR FDLHTWHGHV EHAGRLQAIQ RALQTSGLLP SLMQLPIRAA TEAELLAVHS 
SHMLHRVRQL ASYGGGQIDS DTYVTADSWD VGLLAAGATI AMVEAIAEGR CHNGFALVRP
PGHHATDVRS MGFCLFNNIA IAARVLLDRY DIRRLAIVDF DVHHGNGTQD IFYRDGRVLF
CSTHASPLYP GTGAVYETGD PHMANGTTLN VPLPYGTGDE GYDRVFRQVI GPAIHQFQPE
ILLVSAGFDA HWSDPIGPMA LSVHGFARLV QHLLTWAQTL CNGRIGFVLE GGYNIAALTA
SVIATLRLML GMDPGPDPLG KMNAPEPNID RIITTLHTQH PLLIQAGYQG VA