Gene Cagg_1597 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCagg_1597 
Symbol 
ID7268163 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChloroflexus aggregans DSM 9485 
KingdomBacteria 
Replicon accessionNC_011831 
Strand
Start bp1951212 
End bp1954157 
Gene Length2946 bp 
Protein Length981 aa 
Translation table11 
GC content54% 
IMG OID643566438 
Productadenylate/guanylate cyclase with GAF sensor(s) 
Protein accessionYP_002462934 
Protein GI219848501 
COG category[K] Transcription
[T] Signal transduction mechanisms 
COG ID[COG0745] Response regulators consisting of a CheY-like receiver domain and a winged-helix DNA-binding domain
[COG2114] Adenylate cyclase, family 3 (some proteins contain HAMP domain) 
TIGRFAM ID[TIGR00229] PAS domain S-box 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000711164 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCGAAC CGGCACATAT TTTGGTCGTT GAAGACGACC CCGATATTCG GCGAATCTTT 
CAAGCGGTAC TCTCCCGTGA CGGCTTTCGC GTCTCAATTG TCAATTCAGG CGAAGAGGCG
TTAAGTTTTC TGCAACTGAT TACGCCGGAT TTAATCTTAC TTGATTTAGC TTTGCCCGGT
ATTGGTGGGG ATGAAGTAAC CAGGCGAATT AAGTCGGATC GCTCGAAGCC TTTCATTCCG
GTGATTATCG TTTCGGCCCA CGCCGATCTC GATACAAGCG TTAGTAATCT CGATGCCGGC
GCCGATGATG TACTGGTCAA GCCGGTTGAT TTCAACCTTT TGCTGGCACG GGTACGCGCT
TTGCTTCGTC TGCAACGGGC ACAACGTAGT TTGCAGCAAG AGCAGCGCAA AACGGAATTA
TTGCTCCACC TCTCTCGCGA TCTAGGATCG AGCATCGATC TCGATGTATT ACTCACCGGT
TTTCTCAACC ATTTGGCCGA TGCTGTTGGC GCAGTACGGG CGAGCATTAT TCTGACCGGT
TTTGTCGAAG ATAAAATCTT GCTCTACTCA AGTAGTCGCT ATCCGGCAGT TGCCGAGTTG
CAAGATATTG TGCGTTACGG AGTGGCCGGT TTGGCCTTGC GTGAACGCAG CCCGATTTTG
ATTGCCGATA CGCGGCTCGA TCAGCGCTGG TTGGTCACCT CGCCGGTCCA TAACGATGTG
CGGTCGGTCG TGGCGATGCC GATTATCCGT GACAATCGGG TATGGGGAGT GATCACCCTC
GTTCACCATA CACCGGGCTA TTTTACCTCA GAACATTTAG AATTGTTAGC TTCGGTGGCA
GCCCAAAGTG CAGTAGCATT AGAGAGTGCG CGTCTGTACC GGTTGAGTGA GCAACAGAAA
GAGCTGCTCG CGCGACGGGC CGAAGAATTA CGTCAGATCA ACGAGATCAA TCAACTCTTG
TCGGAGTTGA TGCAGATCGA TCAACTGGGG CGCTTGTTGG TGCAGATGCT CCACCACCAA
TTCGGATACC CGTTGGTGGC GCTGGTGCTG CGTCAAGCCG AACAACTTGT GGTGCAGTCC
GTTGCCGGCA CTCAGCAGTT TGAGACGGGC AAGGTCGTGA TCGGCGCCGA GAGTGGTATC
AGTGGGTGGG TGTGTCGTAA CCGCAAACCG TTGCGGATCG ACGATGTGAC CCAAGATGCC
CGCTTTATCC CTTGCTTGCC CGATGAAGCG CGGATGCGGT CGGCGTTGAG TGTGCCGGTG
CTGTTGCCGC GCGAAGGGGT GGTAGGCACC ATTGAGCTGC GTAGTCCGCA GCTTGCTGCG
TTTAGTCCGA ACGATGAAGC TATTGTATCG GCGATTGCCA ATCAGTTGGC GATTGCGATT
TACAATGCCC GCTTATTTGC AAATGAGCAA CGGCGGATTG CCCAACTTAG CGAGATCAAC
CGATTGTCGC TGGCCCTGAC CGCACAGTTC ACGGCACCTG ATAATCTCCA ACGCACGGTC
GAAGCGGTTG CCCATATCTT TCAGGTAGAC CAGGCAGCAA TGGTGTTATT TGGTATTCAG
CCTAGTGAGA CGGTTGTCGT AATGAGTGGG CCGGCCTCAC CACACGATAA CGATCTCATC
AATTTTGTCA GCAACCATGC CTTGCTGGCT ACCTATGTCG CCAAATTGGC GCAGCCACAA
CAATATACCG ATTTGGACGC GCACGATATA TTAGCTCCGC TGCGAGCGTT CTTTGCCGCC
CGTGGGATTC GGTCAATCGT GATCGTTCCA CTCATTGTCA CCGGTCAAGC GCAGGGTATT
CTGGCGTTGG ATATTACTCG ACGGGGGGTG CTGGAGAAGA CGGAATTAGA GATGGCCTCG
ACCGTCGCCA GCCTGATCGT GCAGATTTTG GAAAATGCTC GGCTGTACCG AGTGGTGAAT
GATGAGCGTT CAACCCTCGA TGCAGTCTTG CGGAGCGCAA CCGACCCCAT TTTACTGATA
GATACTGATG CGCGGTTGCT CTTGTCGAAT CCGGCAGCAC ACGAGCGCTT GCAGATCGAT
CCGGTCGTTC ACCGTGATCA GCCGGTTGAT CAATTTCCGG CGTTGCGTGC TGTGTTGCCG
TTCCTTGATC GGGATAGTCC GACGACGGTT GAGATCGAGC CGAAGCCGAA CGTCATTTTT
AGCGTCAGCA TTGCGCCGGT GCGAGGTGTG GATAACAAAG AGATTGGCCG CGTTGTCGTC
TTTCGCGACA TCAGTGCGAT CAAACAACTC GAACGGCAAG AGCGTGAACG GGTGCGCAGT
GTCTTCCGTC GCTACGTTTC ACCGCAGGTC GCCGAACGGT TGTTGAGTGC CGGGAGTGAT
TTTGGAGCAC CAACCGAGCG TACTGTGGCG GTGCTGTTCG CCGATATGCG CGGTTTTACC
ACCCTGACCG AGCAAATCGA TGCACGGGTG TTGGTCGAGC GGATTCTCAA TCGCTATTTC
ACCGCTATGA CCGAAGTGCT TTACGCTTAC GATGGCACGA TTGACAAGTT TCTCGGCGAT
GGGCTGATCG GTGTGTTTGG TTCACCTATT GCCCATCCTG ATGATCCACA ACGAGTGATT
CGAGCGGCGG TGTCGATGCA GCGAGCCTTT GCCAAGCTGG CTAAACAGTG GCAGGAGGAG
CTGAATCTGC GGATCGGGAT GGGGATTGGG ATCGGCTATG GTACGGCCGT TGTTGGTAAT
GTCGGTTCGG CGCAGCGCCA AGATTATACG CTGATCGGCG ATGTGGTCAA TACAGCCTCA
CGTTTGTGTG GGATTGCGCA GGCCGGTCAG ATAATTGTCT CATCACAGTT GGCGAGTGTC
CTCGGTGAAC AATCGCCTTA CCCGCTGCGA TTACTGGGTG TGACCCGCTT GAAGGGAAAA
CAGGAAGAAC ACATGATTTA CGAGGTGGTG CTTGATCAGA TGGTGCGTAC CTCGCTGGCC
CGTTGA
 
Protein sequence
MTEPAHILVV EDDPDIRRIF QAVLSRDGFR VSIVNSGEEA LSFLQLITPD LILLDLALPG 
IGGDEVTRRI KSDRSKPFIP VIIVSAHADL DTSVSNLDAG ADDVLVKPVD FNLLLARVRA
LLRLQRAQRS LQQEQRKTEL LLHLSRDLGS SIDLDVLLTG FLNHLADAVG AVRASIILTG
FVEDKILLYS SSRYPAVAEL QDIVRYGVAG LALRERSPIL IADTRLDQRW LVTSPVHNDV
RSVVAMPIIR DNRVWGVITL VHHTPGYFTS EHLELLASVA AQSAVALESA RLYRLSEQQK
ELLARRAEEL RQINEINQLL SELMQIDQLG RLLVQMLHHQ FGYPLVALVL RQAEQLVVQS
VAGTQQFETG KVVIGAESGI SGWVCRNRKP LRIDDVTQDA RFIPCLPDEA RMRSALSVPV
LLPREGVVGT IELRSPQLAA FSPNDEAIVS AIANQLAIAI YNARLFANEQ RRIAQLSEIN
RLSLALTAQF TAPDNLQRTV EAVAHIFQVD QAAMVLFGIQ PSETVVVMSG PASPHDNDLI
NFVSNHALLA TYVAKLAQPQ QYTDLDAHDI LAPLRAFFAA RGIRSIVIVP LIVTGQAQGI
LALDITRRGV LEKTELEMAS TVASLIVQIL ENARLYRVVN DERSTLDAVL RSATDPILLI
DTDARLLLSN PAAHERLQID PVVHRDQPVD QFPALRAVLP FLDRDSPTTV EIEPKPNVIF
SVSIAPVRGV DNKEIGRVVV FRDISAIKQL ERQERERVRS VFRRYVSPQV AERLLSAGSD
FGAPTERTVA VLFADMRGFT TLTEQIDARV LVERILNRYF TAMTEVLYAY DGTIDKFLGD
GLIGVFGSPI AHPDDPQRVI RAAVSMQRAF AKLAKQWQEE LNLRIGMGIG IGYGTAVVGN
VGSAQRQDYT LIGDVVNTAS RLCGIAQAGQ IIVSSQLASV LGEQSPYPLR LLGVTRLKGK
QEEHMIYEVV LDQMVRTSLA R