Gene Cagg_3503 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCagg_3503 
Symbol 
ID7266431 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChloroflexus aggregans DSM 9485 
KingdomBacteria 
Replicon accessionNC_011831 
Strand
Start bp4269146 
End bp4272451 
Gene Length3306 bp 
Protein Length1101 aa 
Translation table11 
GC content58% 
IMG OID643568311 
Productdiguanylate cyclase/phosphodiesterase with PAS/PAC sensor(s) 
Protein accessionYP_002464778 
Protein GI219850345 
COG category[T] Signal transduction mechanisms 
COG ID[COG5001] Predicted signal transduction protein containing a membrane domain, an EAL and a GGDEF domain 
TIGRFAM ID[TIGR00229] PAS domain S-box
[TIGR00254] diguanylate cyclase (GGDEF) domain 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATTGACG ATACCGCATC TGTACCCTCT CCTTTTCTGA ACAGTGATGT CGTAGCTGCG 
CCTCTCAACA CGCTGCTAGA ATGTATCGTC GAAATAGTCA TCATTCTCGC ACCTGATCAG
ACCATCCGCT TTGCCAGCCC AGCCCTGACC AACCTGCTCG GCCATCCGCA AGCCCTTGTG
GTCAACCGAT CCCTCTATGA TTTCGTTCAC GTCGACGACC ACGACCGGTT GCAGCGCCAT
CTCGCTCAAT TAGCCGATAG CCAATGCCGG CGCCAACCAA TCCGGGTACG TCTACAACAC
GCCGAGGGCG GCTGGTACAT TTTCGAGCTG CGGTGCTGTA ACCGCCAGCA CCAACCCGAA
GTCGGCGGGA TCATTCTCAG CGGCTATGAT GTCACGGCGT TGCTTGCCGC CGATTATGCG
CTGCGCAACA ACGAATTTCG CTACCGCCAA CTGCTCGAAT TGGCCCCGCT GCTGATCATC
ATCAGCACCC TTCCAGAGGG AATTATCCGC TACTGCAACC CAGCCGGCGC ACGCATCTTT
GGCGTGGACG ATCCAGCCGA GCCGGTCAAC CAACCGCTGA CCGAATTTCT CTCTGGCGAA
GATCACGACC TGATGCGCAA GCGCCGTCAG GCACTGCGCA ACGGCCAACT CCTCCAACCG
GCCCGGTATC ACATTTATGC CCGCGATCAG ATTGTGCGCG TGATAGAGGT GCAAGGCACG
TTGATCGAAG GCCCTGGCGA AATACAGGTC TTGACGATCG GTTACGATGT CACCGACCGG
GTGGAAGCGG AACGGCAATT AAACTTGCGG GCGCGGGTGC TGTCGCAAAT TAACGAGGCG
GTGATCGTGA CTGACCGCAG CGGCGCGATT ATCTATCTCA ACGAGATGGC GGCGTACCTC
TACGCTGTTC AACCGGAAGC GATCATCGGC CAGCCGCTGA GCCGTCTGTT TACTGCTGAA
CCGCTGCCGT ACCAGCCGGT GCAACCTTCG GTGCAGAGCT TCAGCACAGG TTGGCGCGGT
GAATTAATCC ATCGCCTGCC AGACGGGCGG GCCATGATGG TGGAGGTTCG TATCGATCCG
CTCTACCACG ACGAGTTAGC CAACGGCGGC GCGCTGATCG TGGTGCGCGA TATCACTGCG
CGCAAACAGG CTGAAGAACG GTTGCGCCTG CTCGAATCGG TGGTGGTGCA TACCAACGAC
GCCGTGATCA TCCTCGACGC TGAACCGCTG CATGCACCCG GCCCATTTAT TCGCTATGCC
AACGCGGCCT GTAGCCAACT AACCGGCTAT GCCACCAATG AGCTGATTGG CCGTAGCTTG
CGCATCTTGC ACGGCCCGGC TACCGATTCG GTGACACTCA ACCGGCTGTG GGAAGCGATG
GAGCGACAGC AGCCAGCTCG CTTTGCATTG CTCTACTACA CGCGCGCTGG CAAACCGATT
TGGGTCGATC TCAGCTTGTC ACCAGTTACT GACGAATATG GCAAAGCAAC GCATTTTATT
GCCTTGCACC GCGACATCAG CGCGCAGAAG CTGGCGGCAT TCCTCGAACA CGACCGCTCG
GCGATTTTGG CCTTGTTGCT CCAACACGCA CCAGTCACCG CCATGCTGAA CCGGCTGGTA
ACGCTCATCG AACGGCAACG ACCCAACTGG CTGGTGTGGG CTGAATTGGA CGGGCACCCC
GCTGCCGCTA GCAAACCATT GGCCGGTGAA GCCGACCTGA TTGGGCCACG CTTGCGCCAG
ATGATCGCGC GGCATAACCA AAAAGCGCCC ATCGTGCTGG ATCTGCATAC TCATTGGGAT
CAATTGCGTC ATTTGAGCGT CAGCAGCGGC TGGATTTGGC CGCTGGCCAA AGCCCATGGC
GCGCTCGCCA TCTTCCACCA ACAAACCGAT TGGGTAAGCG AGAGCGATCA AATGCTATTG
CGCGTCGCAA CCGAACTTTT CAACTTGATC GCTGATCACG CTAACCTCAA TGCGCAGCTC
AACTATCAAG TGCGCTATGA CACGCTGACC GGCTTGCCAA ACCGCAATCT GTTGCAAGAG
CGGATCGATC TCGCCCTGCG CGATGCGCGC GAGCGGCGCC ATATTGTGGC ACTACTCTTT
ATCGATCTCG ATGGCTTTAA GCAGGTGAAT GACTCGCTTG GGCATCCCAT CGGGGACCGT
TTTCTCAAAC ACGTTAGCCA GGCCTTCGCT GCATGCGCCC GCCCCCAAGA TACGCTGGGC
CGTATGGGAG GCGATGAGTT TCTGTTGCTG ATGCCTGATC TGCCTGATGC ACGATTAGCC
GATATTGCCG CTCAACGCTT GCTTGATGCG TTGCAAACGC CGTTCCTTCA CGATGGGCAT
GAATTGCGCT TAACCGCCAG CATCGGGATC AGCCTCTTTC CGCGCGATGG CGTCGATGTG
GTGACCTTGC TCAAGAATGC GGACAGTGCC ATGCACCGCG CCAAGGAACT CAAACGCAGC
GGTTTTCTGC ATTACCGGCC CGAACATAGC CGACGCGCTC ATACTCGCCT CGCGCTGGAA
GCCCAACTGC GCCGCGCGCT TGAACGGCAT GAACTAGCGG TGTACTACCA ACCGCAATAC
GATTTAGTCA GCGAAGGTAT CACCGGCGTC GAGGCGCTGG TGCGCTGGCA CCATCCGCAA
CGCGGCCCGA TTGCTCCCGG CGAGTTTGTG CCGATTGCCG AAGAGAGTGA TCTGATTATC
GAGATTGGTT CGTGGGTGCT GCGCGAAGCC TGCCGGCAAG CAATGGCATG GCAACAGGCT
GGCTATCCGC TCTTGCGCAT CAGCGTCAAC GTCTCGGCCC GCCAACTATT GCGGCCTGAG
TTTGTGGCCG AAGTGGCAGC GGTCTTGCAC GCCACCGGTC TGCCGGCTCA GTATCTCGAA
CTAGAGATTA CCGAAGGCGT GATGCTTGAT GACCCGATCT ACGCGGCGCG CCAGATCGAT
CAGTTGCGCC ACATGGGGGT GCGGATTGCG CTCGACGATT TCGGCACCGG GTATTCGTCG
CTCGCCTATT TGCGCCAGCT CCGCCTCGAT GCGCTCAAGA TTGACCAAGC GTTTGTGCGC
GCCATTGACG AACAACAGAA TGTTGTCACG AACAGCCGTG CTCTGTTGCG CGCGATTATC
AATTTGGCGC ATAGCCTTGG ACTGGCCGTC GTCGCTGAAG GGGTTGAAAC CGATGAGCAG
CGGGCCGCGT TGCTCAGTAT GGGGTGCGAC GTGTTGCAGG GTTACCTGAT CTCCCAGCCA
CTACCCGCCG ACGAGGTCTG GCCGATGCTT GTACGGCTGC ATAACCAGCA ACGACTGATG
GAGTGA
 
Protein sequence
MIDDTASVPS PFLNSDVVAA PLNTLLECIV EIVIILAPDQ TIRFASPALT NLLGHPQALV 
VNRSLYDFVH VDDHDRLQRH LAQLADSQCR RQPIRVRLQH AEGGWYIFEL RCCNRQHQPE
VGGIILSGYD VTALLAADYA LRNNEFRYRQ LLELAPLLII ISTLPEGIIR YCNPAGARIF
GVDDPAEPVN QPLTEFLSGE DHDLMRKRRQ ALRNGQLLQP ARYHIYARDQ IVRVIEVQGT
LIEGPGEIQV LTIGYDVTDR VEAERQLNLR ARVLSQINEA VIVTDRSGAI IYLNEMAAYL
YAVQPEAIIG QPLSRLFTAE PLPYQPVQPS VQSFSTGWRG ELIHRLPDGR AMMVEVRIDP
LYHDELANGG ALIVVRDITA RKQAEERLRL LESVVVHTND AVIILDAEPL HAPGPFIRYA
NAACSQLTGY ATNELIGRSL RILHGPATDS VTLNRLWEAM ERQQPARFAL LYYTRAGKPI
WVDLSLSPVT DEYGKATHFI ALHRDISAQK LAAFLEHDRS AILALLLQHA PVTAMLNRLV
TLIERQRPNW LVWAELDGHP AAASKPLAGE ADLIGPRLRQ MIARHNQKAP IVLDLHTHWD
QLRHLSVSSG WIWPLAKAHG ALAIFHQQTD WVSESDQMLL RVATELFNLI ADHANLNAQL
NYQVRYDTLT GLPNRNLLQE RIDLALRDAR ERRHIVALLF IDLDGFKQVN DSLGHPIGDR
FLKHVSQAFA ACARPQDTLG RMGGDEFLLL MPDLPDARLA DIAAQRLLDA LQTPFLHDGH
ELRLTASIGI SLFPRDGVDV VTLLKNADSA MHRAKELKRS GFLHYRPEHS RRAHTRLALE
AQLRRALERH ELAVYYQPQY DLVSEGITGV EALVRWHHPQ RGPIAPGEFV PIAEESDLII
EIGSWVLREA CRQAMAWQQA GYPLLRISVN VSARQLLRPE FVAEVAAVLH ATGLPAQYLE
LEITEGVMLD DPIYAARQID QLRHMGVRIA LDDFGTGYSS LAYLRQLRLD ALKIDQAFVR
AIDEQQNVVT NSRALLRAII NLAHSLGLAV VAEGVETDEQ RAALLSMGCD VLQGYLISQP
LPADEVWPML VRLHNQQRLM E