Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cagg_2038 |
Symbol | |
ID | 7269197 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chloroflexus aggregans DSM 9485 |
Kingdom | Bacteria |
Replicon accession | NC_011831 |
Strand | + |
Start bp | 2498568 |
End bp | 2500022 |
Gene Length | 1455 bp |
Protein Length | 484 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 643566873 |
Product | Aldehyde Dehydrogenase |
Protein accession | YP_002463362 |
Protein GI | 219848929 |
COG category | [C] Energy production and conversion |
COG ID | [COG1012] NAD-dependent aldehyde dehydrogenases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 24 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGATTACC CATTTTTTCA GGTGCAACAC TACATCGACG GCCAGTTCGT CGATGGTGGA CCGCGATTTG AGATTATTTA CCCGGCGACC AACCACGTCA TAGGTTCGGC GCCGGAAGCC GGCATGGCTG AAGTTGATGC AGCGGTGCAA GCCGCGGCAC GCGCCTTTCG GCAGTGGAGT CGTACCTCGG TCGCCGAGCG TCGGCTCATT CTAAAGCAGT TTGCGCAACT TATCCGCGAG CATGAAGCTG AACTAGCTGC GATTGAAACG TGGGATGTCG GGCGACCGAT CAGTGAAAAT CGGGCCGGCT ACATTCCACG GATTGCAGCG AACATCGAAT TTTTCGCCGA TTTTGCGGCG ACGCACGGGA GCGAAGCCTA TCCGATGGAT AACGGCTATA TCAACTACGT GCTGCGTCAA CCGGTCGGTG TGGCAGCACT CATTGCACCG TGGAACATTC CGCTTTTGCA GGCCACGTGG AAGATGGGAC CGGCGCTCGC GTTCGGCAAT ACGGTTGTTC TGAAACCGGC CGAGTTAACC CCGATTGGAA CGTGGCGGTT GGCTCAACTA GCGCACGAAG CCGGTCTGCC ACCGGGTGTG GTCAACGTGA TCCACGGTTT TGGCCCCAAC TCAGCCGGCG AATTCTTGAC CAAACACCCT GATGTGAGGT TGATCTCGTT CACCGGTGAG ACAACCACCG GCAAGGTCAT CATGACTGTT GCGGCTGCCA CCCTCAAACG AGTCTCCTTT GAATTAGGTG GTAAAGGCGC AAACATCATC TTTGCCGACG CCGATCTCGA ACGGGCGGTA GCGATCAGCT TACGTTCGTC CTTCTTCAAT CAAGGTGAAT TTTGTCTGGC CGGACCACGG ATCCTCGTTC AGCGACCAAT CTACGAAACT TTTCTCGAAC GGTTTCAAGC GGCAGCGCAG CGGCTCAACG TTGGTGATCC GTTCGATCCG GCGACGCAGG TCGGCGCACT GATCGCCCGC GAGCATCTCG ACCGCGTCAG CGGATACATC GAAGTGGCCC GCACAAGTGC AGCACGGATC GTCCTTGGCG GAGGGCGTCC GGAACTACCG GCACCGTTCG ATCAAGGAAA CTTTCTACAG CCGACGATTA TCGTCGATGT CAAGCCACAA GATCGCGTTT GTCAAGAAGA GATCTTTGGG CCGGTGGTAA CGGTAGCACC GTTTGACGAA GAAGAGGAAG TGATTGCGAT GGCCAACGGC GTAAGCTATG GTCTATCAGC AGTAGTCCAG ACCCGCGACG TAGGACGGGC AGTGCGTCTT GGTGCTGCCC TGGAGGCCGG TACGATCTGG GTCAACGACT TCTTTGTGCG TGATCTGCGT GTACCGTTTG GCGGTATGAA AAACAGCGGC ATTGGGCGCG AAGGCGGCCA CTACAGCCTG GAGTTCTACA CGGAGGCCAA GACGATATGC CTGAGCAATC AGTAG
|
Protein sequence | MDYPFFQVQH YIDGQFVDGG PRFEIIYPAT NHVIGSAPEA GMAEVDAAVQ AAARAFRQWS RTSVAERRLI LKQFAQLIRE HEAELAAIET WDVGRPISEN RAGYIPRIAA NIEFFADFAA THGSEAYPMD NGYINYVLRQ PVGVAALIAP WNIPLLQATW KMGPALAFGN TVVLKPAELT PIGTWRLAQL AHEAGLPPGV VNVIHGFGPN SAGEFLTKHP DVRLISFTGE TTTGKVIMTV AAATLKRVSF ELGGKGANII FADADLERAV AISLRSSFFN QGEFCLAGPR ILVQRPIYET FLERFQAAAQ RLNVGDPFDP ATQVGALIAR EHLDRVSGYI EVARTSAARI VLGGGRPELP APFDQGNFLQ PTIIVDVKPQ DRVCQEEIFG PVVTVAPFDE EEEVIAMANG VSYGLSAVVQ TRDVGRAVRL GAALEAGTIW VNDFFVRDLR VPFGGMKNSG IGREGGHYSL EFYTEAKTIC LSNQ
|
| |