Gene Cagg_1232 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCagg_1232 
Symbol 
ID7266218 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChloroflexus aggregans DSM 9485 
KingdomBacteria 
Replicon accessionNC_011831 
Strand
Start bp1508562 
End bp1509593 
Gene Length1032 bp 
Protein Length343 aa 
Translation table11 
GC content62% 
IMG OID643566075 
ProductAlcohol dehydrogenase zinc-binding domain protein 
Protein accessionYP_002462577 
Protein GI219848144 
COG category[E] Amino acid transport and metabolism
[R] General function prediction only 
COG ID[COG1063] Threonine dehydrogenase and related Zn-dependent dehydrogenases 
TIGRFAM ID[TIGR01202] 2-desacetyl-2-hydroxyethyl bacteriochlorophyllide A dehydrogenase 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones31 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGTCAGCA ACATAACCAC CAGCACGCGG AGTGTACTCC CTGCCGATGC AATCTGGTTT 
CCTGCGCCGC GTACCGTCAC CGTGTGCCGA GAAACCGCGC CGCCACCCGA TGCCGGTGAA
GTGCAGGTGG CTGCCATCGC CTCACTGATC AGTCACGGCA CCGAACGGCT GGTCTATCGG
GGAGAGGTCG ATCCGACGCT CCCCCTCGAC TTGCCGACCC TACGCGGTAG TTTCGCGTTT
CCGATCAAGT ACGGCTACGC AATTGCCGGA CGGATCATCG ACGTTGGTCC TGGCGTTGAT
GATCTACGCA TCGGTGATGC AGTCGCTGCC TTGCACCCAC ACCAGAGTAT CTTCACCATC
CCGGCGGCGC TTGTCAAACG CCTACCCGCC AACCTCGATC CGGCCTTAGG CGGCTTTTAC
GCCAATGTCG AAACGGCGCT GACCATCTGC CACGACGCTG CACCCCGGCT CGGCGAAACG
GTGATAGTTT TCGGGCAGGG GGTGATTGGG CTGCTCGTTA CGCAACTGCT GCAATTGGCC
GGCGTACACG TGATTGCCGT CGATCCTGAT CCGCAGCGAC GTGAGTTGGC GGCGCGTTTT
GGGGCCACCG CGTTGGCCCA GCCCGATCCG GCAGTCATTG CCGATCTCAC CGACAAACGC
GGGGCTGATA TTGCGATTGA GGTGAGTGGC GCCCCGACAG CATTGCAACA GGCGATTGAG
GCGGTCACGG TGGAAGGGTT GGTCATTGTG GCATCGTGGT ACGGACAAAA GCCGGTAACG
CTCACGCTCG GCGGCCACTT CCACCGCGGT CGGGTTCGGG TGCGCTCATC GCAGGTCGGA
CGGCTGGCGC CGGAAACCCT GCCGCGCTGG GATTACACAC GGCGGACGGC AACCGTAATG
CGCCTGTTGC CTCGTCTGCA CCTCGCTGAG CTGGTCAGCC ATCGCTTTCC GCTGGAACAG
GCGCCAGCCG CATATGCATT ACTCGACGCG GGTACCACCG GTCTGGTGCA GGTGATGATC
AGCTACAGAT AG
 
Protein sequence
MVSNITTSTR SVLPADAIWF PAPRTVTVCR ETAPPPDAGE VQVAAIASLI SHGTERLVYR 
GEVDPTLPLD LPTLRGSFAF PIKYGYAIAG RIIDVGPGVD DLRIGDAVAA LHPHQSIFTI
PAALVKRLPA NLDPALGGFY ANVETALTIC HDAAPRLGET VIVFGQGVIG LLVTQLLQLA
GVHVIAVDPD PQRRELAARF GATALAQPDP AVIADLTDKR GADIAIEVSG APTALQQAIE
AVTVEGLVIV ASWYGQKPVT LTLGGHFHRG RVRVRSSQVG RLAPETLPRW DYTRRTATVM
RLLPRLHLAE LVSHRFPLEQ APAAYALLDA GTTGLVQVMI SYR