Gene Cagg_3456 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCagg_3456 
Symbol 
ID7269681 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChloroflexus aggregans DSM 9485 
KingdomBacteria 
Replicon accessionNC_011831 
Strand
Start bp4210695 
End bp4211570 
Gene Length876 bp 
Protein Length291 aa 
Translation table11 
GC content60% 
IMG OID643568266 
Product6-phosphogluconate dehydrogenase NAD-binding 
Protein accessionYP_002464734 
Protein GI219850301 
COG category[I] Lipid transport and metabolism 
COG ID[COG2084] 3-hydroxyisobutyrate dehydrogenase and related beta-hydroxyacid dehydrogenases 
TIGRFAM ID[TIGR01505] 2-hydroxy-3-oxopropionate reductase
[TIGR01692] 3-hydroxyisobutyrate dehydrogenase 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.236209 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0188789 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGCAAC GGATCGGCTT TATCGGTCTC GGAGTGATGG GCAAACCAAT GGCACGCAAC 
CTGCACCGCG CCGGGTTTAC CGTCACCGTG TGGAATCGCT CGCCCCAGCC AATGAACGAA
CTGGCCGCCG AAGGCTTGAT CCCGGCCAGT TCTCCTGCTG AACTAGCCCG GACAAGTGAC
GTGGTGATCA CGATGTTACC CAATGGTCCC GACGTTGCCA GAGTCGCTCA GGGAGCGGAC
GGCCTATTCG CCCACATGGG CCGGGGAAGC CTATTCATCG ATATGAGCAC GATTGCCCCT
GAAACGGTCC GTCAACTAGC GGCGGCAGCC GCCGACTATG GGATTGCAAT GCTCGATGCA
CCGGTCAGCG GGGGCGACAA GGGGGCCGCC ACCGCAACCC TCTCGATTAT GGTCGGTGGT
CAGCCGGAGG ATTTCGAGCG GGCGCTACCC ATCTTCCAAA CGCTGGGTAA AACGATCACC
TACTGCGGCC TCATTGGAGC CGGGCAGACG GTCAAAGCTT GTAACCAAAT CGCCGTTGCC
ATTACGATGG CCGCCGCCGC CGAAGCGTTA GCATTCGGTA AAGCTGCCGG TATCGCTCCT
GAGATCATCC TGCGCGTGCT CGGTGGTGGG CTGGCTCAGA GTCGGGTGCT CGACATCCGC
GGGCCGACAA TGGCGCGCAA CGAATTTCGT CCTGGCTTTC GGGTACGGCT TCATCAGAAG
GATATGGAGA TTATCCACAC GACCGCCACT GCACTTGGTC TCTCACTCCC ATTCAGCGAT
CTCGTGCGTA CTCACTTCCA ACGCCTGATC GACAGTGGTA ACGGCGATCT GGACCATTCG
GCGCTGGCCC TCACTGTCGG TTATCCCGGC GTGTAA
 
Protein sequence
MTQRIGFIGL GVMGKPMARN LHRAGFTVTV WNRSPQPMNE LAAEGLIPAS SPAELARTSD 
VVITMLPNGP DVARVAQGAD GLFAHMGRGS LFIDMSTIAP ETVRQLAAAA ADYGIAMLDA
PVSGGDKGAA TATLSIMVGG QPEDFERALP IFQTLGKTIT YCGLIGAGQT VKACNQIAVA
ITMAAAAEAL AFGKAAGIAP EIILRVLGGG LAQSRVLDIR GPTMARNEFR PGFRVRLHQK
DMEIIHTTAT ALGLSLPFSD LVRTHFQRLI DSGNGDLDHS ALALTVGYPG V