Gene Cagg_2038 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCagg_2038 
Symbol 
ID7269197 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChloroflexus aggregans DSM 9485 
KingdomBacteria 
Replicon accessionNC_011831 
Strand
Start bp2498568 
End bp2500022 
Gene Length1455 bp 
Protein Length484 aa 
Translation table11 
GC content57% 
IMG OID643566873 
ProductAldehyde Dehydrogenase 
Protein accessionYP_002463362 
Protein GI219848929 
COG category[C] Energy production and conversion 
COG ID[COG1012] NAD-dependent aldehyde dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGATTACC CATTTTTTCA GGTGCAACAC TACATCGACG GCCAGTTCGT CGATGGTGGA 
CCGCGATTTG AGATTATTTA CCCGGCGACC AACCACGTCA TAGGTTCGGC GCCGGAAGCC
GGCATGGCTG AAGTTGATGC AGCGGTGCAA GCCGCGGCAC GCGCCTTTCG GCAGTGGAGT
CGTACCTCGG TCGCCGAGCG TCGGCTCATT CTAAAGCAGT TTGCGCAACT TATCCGCGAG
CATGAAGCTG AACTAGCTGC GATTGAAACG TGGGATGTCG GGCGACCGAT CAGTGAAAAT
CGGGCCGGCT ACATTCCACG GATTGCAGCG AACATCGAAT TTTTCGCCGA TTTTGCGGCG
ACGCACGGGA GCGAAGCCTA TCCGATGGAT AACGGCTATA TCAACTACGT GCTGCGTCAA
CCGGTCGGTG TGGCAGCACT CATTGCACCG TGGAACATTC CGCTTTTGCA GGCCACGTGG
AAGATGGGAC CGGCGCTCGC GTTCGGCAAT ACGGTTGTTC TGAAACCGGC CGAGTTAACC
CCGATTGGAA CGTGGCGGTT GGCTCAACTA GCGCACGAAG CCGGTCTGCC ACCGGGTGTG
GTCAACGTGA TCCACGGTTT TGGCCCCAAC TCAGCCGGCG AATTCTTGAC CAAACACCCT
GATGTGAGGT TGATCTCGTT CACCGGTGAG ACAACCACCG GCAAGGTCAT CATGACTGTT
GCGGCTGCCA CCCTCAAACG AGTCTCCTTT GAATTAGGTG GTAAAGGCGC AAACATCATC
TTTGCCGACG CCGATCTCGA ACGGGCGGTA GCGATCAGCT TACGTTCGTC CTTCTTCAAT
CAAGGTGAAT TTTGTCTGGC CGGACCACGG ATCCTCGTTC AGCGACCAAT CTACGAAACT
TTTCTCGAAC GGTTTCAAGC GGCAGCGCAG CGGCTCAACG TTGGTGATCC GTTCGATCCG
GCGACGCAGG TCGGCGCACT GATCGCCCGC GAGCATCTCG ACCGCGTCAG CGGATACATC
GAAGTGGCCC GCACAAGTGC AGCACGGATC GTCCTTGGCG GAGGGCGTCC GGAACTACCG
GCACCGTTCG ATCAAGGAAA CTTTCTACAG CCGACGATTA TCGTCGATGT CAAGCCACAA
GATCGCGTTT GTCAAGAAGA GATCTTTGGG CCGGTGGTAA CGGTAGCACC GTTTGACGAA
GAAGAGGAAG TGATTGCGAT GGCCAACGGC GTAAGCTATG GTCTATCAGC AGTAGTCCAG
ACCCGCGACG TAGGACGGGC AGTGCGTCTT GGTGCTGCCC TGGAGGCCGG TACGATCTGG
GTCAACGACT TCTTTGTGCG TGATCTGCGT GTACCGTTTG GCGGTATGAA AAACAGCGGC
ATTGGGCGCG AAGGCGGCCA CTACAGCCTG GAGTTCTACA CGGAGGCCAA GACGATATGC
CTGAGCAATC AGTAG
 
Protein sequence
MDYPFFQVQH YIDGQFVDGG PRFEIIYPAT NHVIGSAPEA GMAEVDAAVQ AAARAFRQWS 
RTSVAERRLI LKQFAQLIRE HEAELAAIET WDVGRPISEN RAGYIPRIAA NIEFFADFAA
THGSEAYPMD NGYINYVLRQ PVGVAALIAP WNIPLLQATW KMGPALAFGN TVVLKPAELT
PIGTWRLAQL AHEAGLPPGV VNVIHGFGPN SAGEFLTKHP DVRLISFTGE TTTGKVIMTV
AAATLKRVSF ELGGKGANII FADADLERAV AISLRSSFFN QGEFCLAGPR ILVQRPIYET
FLERFQAAAQ RLNVGDPFDP ATQVGALIAR EHLDRVSGYI EVARTSAARI VLGGGRPELP
APFDQGNFLQ PTIIVDVKPQ DRVCQEEIFG PVVTVAPFDE EEEVIAMANG VSYGLSAVVQ
TRDVGRAVRL GAALEAGTIW VNDFFVRDLR VPFGGMKNSG IGREGGHYSL EFYTEAKTIC
LSNQ