Gene Cagg_2019 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCagg_2019 
Symbol 
ID7269177 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChloroflexus aggregans DSM 9485 
KingdomBacteria 
Replicon accessionNC_011831 
Strand
Start bp2478036 
End bp2481086 
Gene Length3051 bp 
Protein Length1016 aa 
Translation table11 
GC content54% 
IMG OID643566853 
ProductGAF sensor hybrid histidine kinase 
Protein accessionYP_002463343 
Protein GI219848910 
COG category[T] Signal transduction mechanisms 
COG ID[COG2204] Response regulator containing CheY-like receiver, AAA-type ATPase, and DNA-binding domains
[COG2205] Osmosensitive K+ channel histidine kinase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.25506 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones32 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTATTGG TAATACCGCG AGTGTATAAT AAAATTATGA CAAACCAAGA CACACCATCT 
GACCAATCTG CCGTCACCTC ACCGATCGCC ACCATTCTTG CACTCAGTAA GGCAGTTCTT
TCATCGCTCG AACTGCCGGA TGTGTTACAG CGTGTATTAG TCGCCACTCG TGATCTAAGC
GGAGCCGATG TTGTGACTAT CTGGCTGCTC GACGAGCAAG GCGAGTGGTT GACAAGCGCT
GCTATCCTTG GGCTTGAAGA CCGTCCTGAG CGTGAACGTA TCTTCCGTTT GCGGGTCGGT
GAAGGTATTG CCGGTTGGTC AGTTGCCCAT CGTCAAATCC TCCAAATTAC CAACCCACTT
CACGATCCAC GCTACGTACC CAAACTCGAC CGCCATCCGG CTATTATACT CTCTATCCCT
CTGATCGTTC GCGCGCAGTG CGTCGGTGCG CTTAGCCTAT CTCGCTACAC TGTCGCCACA
CCGTTTAGCC AAGAAGTTAT CGAAACGATT TCTATTTTCG CCGATCAGGC TGCCATCGCT
ATCGACAACG CTACAAAGGC TCAGATCCTG CGTTACGCCA CAGCACGCGA ACGAATCTTG
GCCCATAGCA CCGAATCGAC CCAAGCCGAA ACGTTGATCT TGGCCGAATT AGCCGTCGTC
CTTGGGGTCG AACCTCTCCT TATTAGCACC ACGTATGATG GGAACCTTCT CGATCACGCC
GGTCACAACG TTCACTACGA TGACCTGAAC CGCTGGCGTA ATGAAGGCGC GCTGATCACC
GATCTACGTT TCGATGGATT ACCGGCATGG CTAGTAGTCA ACCAACCGGG CCGCTATTGG
AGCAAAGAGG ATACTAACCT GCTTGCTTTT GCGGCCAATC AATTGATGCG TGCCCGCCAG
CGCGCTTTCG AGCAACGCGC ACACGCTCGC GCCGAAGCGC TGCACCGTCT CGTGGCACTG
ACCAACGCAC GCATCGATCA GGCCAGCGTG CTTGATCAGA TCCTGGCCGA ATTGCAACGA
TTTATCCCGT TCGACTCGGC TTGTGTCTTC GTTGTGCATG ACGATGAGTA TGTCCGCCTC
ATTGCCCAGC GTGGTCTGCG TACTCCGGTC GATCAAGTAA CTCTCTTCGC CGGACCGGGT
TCAACGATTT ATGATCTGCG TCAAGTCGGT ACGGCAAGGT ATCATCCCGA TGTACAGCAA
TTACCCGGTT GGCAGAAGGT ACCCGATAGC GAGATCATCC GTTCCTGGAT CGGTGTACCA
CTGCGGGTTG ATCAGACCAC AATTGGCTTT TTAACGATTG ATAAATGGAT CCCCAATGCC
TTCACTGCCG AAGATGTCGG TACGGCACAG ATGTTTGGCG AACAGGTTGC CGCCGTCATC
AATAACGTTC GTCTTCTACG TGAGGCACAA GAACGGGCCA GCCAATTTCA GGTTCTGCAA
CAGTTCACCG TCCGTATTGG CGCCGTCCGT GATATCGATC AATTGCTCGA TGAAGCTACC
CAATTGCTTC ACCGTACCTT CGGGTACTAT CAAGTACTCA TTAGTGTCAT TGAGCATGAT
CAGCTCATCG TGCGTGCTGC CTATGGTCGC CTGATGCACT GCCAATCGAG CGAACAGGTC
TTTCCCCCCT TGCCGTGTGA TGTCGGTATC AGCGGTTGGG TCATTCGCCA TGCCCAACCG
GCAATCGTCA ACGATGTCCT CCGCGACGAA CGCTATGTCT GCCATCCCTT TCTGCCGGCT
ACTGCGGCTG AAATGATCGT CCCGATTTTG GTTGACGAGC GTGTATTCGG CGTCATCACC
ATCGAAAGCG ATGTTCGCGG GGTGTTTGCT CAAAGTGACC TCGACTTGGT GACGGCAATG
GCTCATCTGA TCGGTGTCAC TATCGCTAAC CTTCAACACG ACGCCGAACT CCAGCGCGCC
CGTGAACAGT TGATCGAACG TGACCGATTA CGGGCACTCG GTGAGTTATC TAGCGGTGTG
GCTCACGATT TTAACAACTT ACTGGCGAGC ATTCTTGGTC ACGTTCAGTT ATTGCTCAAC
GAGTACCACG ATCCACGATT GCAAGAGGGG TTACGGGCAA TTGAGTTAGC TGCGATCGAT
GGCGCAGCTA CAATTAAACG TCTGCAAGGA TTTGCCCAAA CCAGCCAATC GACTCCACAG
GGTGCTGTTG ATCTTAACCA AGTGGTCGAA GAGAGTCTGG CGTTGACTCG TCCCCGTTGG
CGCGACGAAG CACAGAGTCG TGGGATCGTG ATCGAGGTGC GTACCGACCT AGAGCCGTTG
CCGATAATTA CCGGTGATGC TCCTGCGTTA CGCGAATTGA TCATCAATCT CGTGTTAAAC
GCCATCGACG CACTACCCAA TGGTGGGACG ATAACGATCC GTACTACTCC TGCACCTCCT
GAGGTGTTCG GCGCAGCCGG TGTGCTGTTG GTGATCCAAG ATAATGGTGT CGGGATCGAT
CCCGCCCTCC ACGAACGGAT CTTTGCACCC TTCTTTTCGA CCAAAGGTGT GCGCGGTACC
GGAATGGGGT TGGCGATTGT TCGTGGTATC GTGCAACAGC ATGGTGGACG GATTACGCTC
GAAAGTGAAC CCGGTAAAGG TACGATGTTC CAGATTTGGC TGCCGGTAGG GCAACCACCG
GAACTACCCA CCTCATCATC ACAACCGTCA CCTATGGTGC CGCTGCACAT TTTAGTGGTT
GATGATGAAA CGGCGGTGCG GCAGGTGCTG ACCCGCATCC TTGAGCGTCA GGGTCATCGG
GTCGTCGAAG CTACTTCGGG CGAAGAAGCA CTTGCCCAGT ATCGGCCCGG TCGGTACGCG
ATTATCTGTA CCGATCTCGG CATGCCCGGT ATGTCAGGCT GGGAATTGGC CTCACGGGTG
CGACAGATCG ATACAACGGT TCGGATCGTT CTGGTGACCG GCTGGAGCGA ACAAGTTGAT
CCGGCTGATA TGCAGCGGTA CGGTGTCAAC GCGATACTCG CCAAACCGTT TACAATCCAA
TCGGTGCAGA ATCTCATTGC TAGCTTAGTT GACGCAGCCG AACAAGTATG A
 
Protein sequence
MLLVIPRVYN KIMTNQDTPS DQSAVTSPIA TILALSKAVL SSLELPDVLQ RVLVATRDLS 
GADVVTIWLL DEQGEWLTSA AILGLEDRPE RERIFRLRVG EGIAGWSVAH RQILQITNPL
HDPRYVPKLD RHPAIILSIP LIVRAQCVGA LSLSRYTVAT PFSQEVIETI SIFADQAAIA
IDNATKAQIL RYATARERIL AHSTESTQAE TLILAELAVV LGVEPLLIST TYDGNLLDHA
GHNVHYDDLN RWRNEGALIT DLRFDGLPAW LVVNQPGRYW SKEDTNLLAF AANQLMRARQ
RAFEQRAHAR AEALHRLVAL TNARIDQASV LDQILAELQR FIPFDSACVF VVHDDEYVRL
IAQRGLRTPV DQVTLFAGPG STIYDLRQVG TARYHPDVQQ LPGWQKVPDS EIIRSWIGVP
LRVDQTTIGF LTIDKWIPNA FTAEDVGTAQ MFGEQVAAVI NNVRLLREAQ ERASQFQVLQ
QFTVRIGAVR DIDQLLDEAT QLLHRTFGYY QVLISVIEHD QLIVRAAYGR LMHCQSSEQV
FPPLPCDVGI SGWVIRHAQP AIVNDVLRDE RYVCHPFLPA TAAEMIVPIL VDERVFGVIT
IESDVRGVFA QSDLDLVTAM AHLIGVTIAN LQHDAELQRA REQLIERDRL RALGELSSGV
AHDFNNLLAS ILGHVQLLLN EYHDPRLQEG LRAIELAAID GAATIKRLQG FAQTSQSTPQ
GAVDLNQVVE ESLALTRPRW RDEAQSRGIV IEVRTDLEPL PIITGDAPAL RELIINLVLN
AIDALPNGGT ITIRTTPAPP EVFGAAGVLL VIQDNGVGID PALHERIFAP FFSTKGVRGT
GMGLAIVRGI VQQHGGRITL ESEPGKGTMF QIWLPVGQPP ELPTSSSQPS PMVPLHILVV
DDETAVRQVL TRILERQGHR VVEATSGEEA LAQYRPGRYA IICTDLGMPG MSGWELASRV
RQIDTTVRIV LVTGWSEQVD PADMQRYGVN AILAKPFTIQ SVQNLIASLV DAAEQV