Gene Cagg_1702 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCagg_1702 
Symbol 
ID7269408 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChloroflexus aggregans DSM 9485 
KingdomBacteria 
Replicon accessionNC_011831 
Strand
Start bp2078152 
End bp2079648 
Gene Length1497 bp 
Protein Length498 aa 
Translation table11 
GC content57% 
IMG OID643566544 
ProductAldehyde Dehydrogenase 
Protein accessionYP_002463039 
Protein GI219848606 
COG category[C] Energy production and conversion 
COG ID[COG1012] NAD-dependent aldehyde dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones47 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCGCAA CGCGGATCTT TCACAATCTG ATCGGCGGCG AGTTTGTACC GGCCCAAAGT 
GGCAAAACCT TCGAGCGACG CAATCCCGCC GACACCCGCG ATCTGGTCGG TGTATTCGCC
GATAGTGATG AACGCGATGT CCATGCCGCG GTTGAAGCGG CTAAAGCAGC CTTTGCCACA
TGGAGCAAAG TGCCTGCCCC CAAGCGTGGT GAGATTCTGC TCCGCGCTGC CGAGTTACTT
CAAGCCCGCA AAGAGCAGTA TAGTCGTGAT CTCACCCGCG AGATGGGCAA ACCCCTGTTT
GAAGCCGGTG GCGACATCGT TGAGGCTATC GGGATGGCTC AATATGCCGG TAGCGAAGGA
CGCCGCATGC ACGGTGTGAC CACCCCGTCG GAATTGCCCG ACAAATTCCA GATGAGCATT
CGCCAGCCCA TCGGCGTAGT CGGGCTTATC ACCCCCTGGA ACTTCCCGAT GGCCATTGCC
TCGTGGAAGA TGCTACCGGC GATCGTCTGC GGCAACACGG TTGTCATCAA ACCCGGCGAA
GACGCTTCGG TGAGTACCTA CAACCTGGTA CAGTGTCTGA TGGAAGCCGG CCTACCGCCC
GGCGTCGTCA ATATCGTCAC CGGCTATGGG CCAAAAGCCG GCCAACCTCT CGTCGAACAT
CCTGACGTAC CGGTCATCTC TTTCACCGGT AGCACCGAAA CCGGCCGGCT CGTGTATGAG
CTAGGCGCAC GCCGGATCAA GCGTGTCAGC CTTGAGATGG GTGGTAAAAA CCCGTTGATT
GTCATGCCTG ACGCCGATCT CGATCTTGCG CTCGACGGCG TCTTGTGGGC TGCGTTTGGC
ACGACCGGCC AACGTTGCAC TGCCACCAGC CGCCTGATCG TTCACCGTGC CGTCGCGAAG
GAGCTGGTTG ACCGAATTGT TGCTCGCGCC CAATCGCTCC GCATTGGGAA TGGTCTCGAT
CCGAATACCG ATATGGGTCC GCTGGTGAAC GAGAAACAGA TGCAACGAGT GCTTAATTAT
ATCGAGATTG GTAAGCAAGA AGGAGCAACC TTGCTCTGTG GCGGTTACCG CCTCACCGGC
GGCGATTACG ACTACGGCTA TTTTATTGCG CCGACCGTCT TCACCAACGT CACACCCCAG
ATGCGCATCG CTCGCGAAGA AATCTTTGGG CCGGTGCTGA GTGTGATTGA GGTTGATAGT
CTCGATGAAG CGGTGGCAAT TGCCAATGAT GTGGCCTATG GCCTTAGTTC GGCAATTTAT
ACTCGTGATA TTAACGCTGC TTTCCGCGCT ATGCAGGCGC TGCAAGCCGG GATTGTTTAC
ATCAACGCTC CTACCATCGG CGCTGAAATT CACTTGCCGT TCGGTGGTGT GAAGGCGACC
GGTAACGGTC ATCGCGAAGC CGGCCCGACG ATGCTCGATG TCTTCTCGGA GTGGAAGAGC
GTCTACATTG ATTACAGCGG CAGATTGCAG CGCGCACAGA TTGATAACGC CGATTAG
 
Protein sequence
MTATRIFHNL IGGEFVPAQS GKTFERRNPA DTRDLVGVFA DSDERDVHAA VEAAKAAFAT 
WSKVPAPKRG EILLRAAELL QARKEQYSRD LTREMGKPLF EAGGDIVEAI GMAQYAGSEG
RRMHGVTTPS ELPDKFQMSI RQPIGVVGLI TPWNFPMAIA SWKMLPAIVC GNTVVIKPGE
DASVSTYNLV QCLMEAGLPP GVVNIVTGYG PKAGQPLVEH PDVPVISFTG STETGRLVYE
LGARRIKRVS LEMGGKNPLI VMPDADLDLA LDGVLWAAFG TTGQRCTATS RLIVHRAVAK
ELVDRIVARA QSLRIGNGLD PNTDMGPLVN EKQMQRVLNY IEIGKQEGAT LLCGGYRLTG
GDYDYGYFIA PTVFTNVTPQ MRIAREEIFG PVLSVIEVDS LDEAVAIAND VAYGLSSAIY
TRDINAAFRA MQALQAGIVY INAPTIGAEI HLPFGGVKAT GNGHREAGPT MLDVFSEWKS
VYIDYSGRLQ RAQIDNAD