Gene Cagg_1718 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCagg_1718 
Symbol 
ID7269424 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChloroflexus aggregans DSM 9485 
KingdomBacteria 
Replicon accessionNC_011831 
Strand
Start bp2099769 
End bp2100809 
Gene Length1041 bp 
Protein Length346 aa 
Translation table11 
GC content59% 
IMG OID643566560 
ProductAlcohol dehydrogenase zinc-binding domain protein 
Protein accessionYP_002463055 
Protein GI219848622 
COG category[R] General function prediction only 
COG ID[COG1064] Zn-dependent alcohol dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.271059 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones31 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGTGCAG TTTACTACGA AGCCTTTGGT CAAATGCCAT GGCTTGCCAA CCTGCCTGAT 
CCTACGCCAA CCCCTGATGG AGTAGTGCTG GCCGTGCGCG CGACCGGCCT CTGCCGTAGC
GATTGGCACG CCTGGATGGG GCATGATCCC GATATTCGGT TACCACATGT GCCGGGTCAT
GAATTGGCCG GTGAAGTAGT GGCCGTCGGT GCGCGGGTAA CTCGCTGGCG GATCGGTGAT
CGGGTGACGG TACCGTTTGT CTGCGCCTGT GGTGTTTGTC CGCAGTGTCA GGCCGGTCAG
CAGCAGGTGT GCGATCACCA GTTTCAGCCG GGCTTTACCC ATTGGGGATC GTTCGCCGAA
TATGTGGCTA TCGACCGTGC CGATCTCAAT CTGGTCCGCC TGCCGGATGA TATGACCTAC
GTGACGGCGG CCAGTCTCGG TTGTCGTTTT GCTACTGCCT TCCGTGCAGT TGTTGATCTG
GCGAAGGTGA GCGCCGGTGA GTGGGTGGCA GTCTATGGCT GCGGTGGAGT TGGCTTGTCG
GCGATCATGC TGGCTCATGC GTTAGGTGGG CAGGTGATAG GGATCGATAT TAATCCCGAA
CGTCTCGCAC TCGCCCGTGA TCTTGGTGCT GTGGCCGTGG TTAATGCTGC TACCGAAGCT
GATGTTGTCG GGGTTGTTCG AGAGTTAAGT CGTGGTGGGG TACATATCGC AATCGATGCT
TTGGGTAGCC CCACTACCTG TGCCAACGCG ATCGCGAGTC TGCGCAAACG AGGACGTCAC
GTGCAGGTAG GTCTGCTGTT GGCCGAGCAA CGTATGCCAC CGCTGCCGAT GGATATTGTG
GTAGCCCGCG AGCTGGTGAT TATGGGTAGT CACGGCATGC AAGCCCATCG GTATGATGCC
ATGCTCGATC TGATCCAATC GGGAAAGGTC CAGCCGCAAC GCTTGATCGG GCGCACGATC
CGGCTGGACG AAGCGCCGGT GGCGCTGGTC GATCTCGATA GCTTTCGTGG GGCGGGGGTG
ACGGTGATTA CCGAATTCTA A
 
Protein sequence
MRAVYYEAFG QMPWLANLPD PTPTPDGVVL AVRATGLCRS DWHAWMGHDP DIRLPHVPGH 
ELAGEVVAVG ARVTRWRIGD RVTVPFVCAC GVCPQCQAGQ QQVCDHQFQP GFTHWGSFAE
YVAIDRADLN LVRLPDDMTY VTAASLGCRF ATAFRAVVDL AKVSAGEWVA VYGCGGVGLS
AIMLAHALGG QVIGIDINPE RLALARDLGA VAVVNAATEA DVVGVVRELS RGGVHIAIDA
LGSPTTCANA IASLRKRGRH VQVGLLLAEQ RMPPLPMDIV VARELVIMGS HGMQAHRYDA
MLDLIQSGKV QPQRLIGRTI RLDEAPVALV DLDSFRGAGV TVITEF