Gene Cagg_0366 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCagg_0366 
Symbol 
ID7268467 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChloroflexus aggregans DSM 9485 
KingdomBacteria 
Replicon accessionNC_011831 
Strand
Start bp456562 
End bp459855 
Gene Length3294 bp 
Protein Length1097 aa 
Translation table11 
GC content60% 
IMG OID643565234 
Producttranscriptional activator domain protein 
Protein accessionYP_002461748 
Protein GI219847315 
COG category[K] Transcription 
COG ID[COG2909] ATP-dependent transcriptional regulator 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.37715 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACTACCG CTATGGCGCG TGAAAAGACA GTGTTGTTAC CTGCGAAAGT CGCACCACCG 
CGACCACACC GCTACCGACT GGTCCGTCCG GCGGTGACGG CGCGACTCCG CGAGGCGTTT
GATTACCGGG TAACCCTGGT ACAGGCCGGG GCCGGCTATA GCAAAACGAC AGCACTGGCC
GAGTTAGCGA CAAGCATGGC GCCGGTGTGT TGGTATACCA TCGGCGAAGA TGATCGTGAT
CCGGTCACAT TTCTTACCTA CTTGACGGCA GCATGTGCGC CGGTGCTGCC GCAAGGTGTT
CCCGACGTAC TGGCGATGCT GCATGCCCGG CCCGCCGACC GCACGGTATG GACGCACGTG
CTCGACGAGT TGCTGAATGC GCTGGCCGGT GTTCCACCCA AACCCACCCT TCTTATTCTC
GATGACTACC ATTTTGTCAC CGTTTCTGCG GAGATTCGAG CCTTAACCGA ACGGTTAATT
ACCTATGCAC CGCCGTGGTT CAGTTTGTTG ATCGCCACCC GTTACCCGAT TGTCAGCGGC
GAGTTGGTAC GCTGGCGTGC GCGCGGTGAG GTGCTCGAAT TAAACCGCGA AGCTTTGGCC
TTTACCCGCG ACGAAATAGC TGCGCTCTTC ACCGAGGTTT TTGGCATCGT GCTGAGCGAC
ACCGACATCG ATCTACTCAG TCACCAGACC GAAGGATGGC CCATCGCACT GCAACTGGTC
TGGCAGGGAC TACGTAGTAG GCAGGTGCGG AGCGTGGCCG AGGTATTGGC AGACAGCCCA
ACGTCTTTGG CGGCACTGTT TGACTTTTTG GCGTCGGATG TACTGGCTCG ACAGCCACCA
GAGATTGCTG CCTTCTTACG GGACACAGCC CCGTTGCGGG TCTTAACAGC GGCAGCGTGT
GACGCGGTAC GACAAGCCCA TGATAGTTCC GAGCTACTCG CTCAGGTGCG TGATCGCGAT
CTCTTTATCG TCGAGCTAGG CGACGAGCAC TACCGCTATC ATCACCTCTT CCACGACTTT
TTGCGTCAGC AGATTCGGCA CGACCCCGAT ATAGCGGAGC GACACCGCCG AGCAGCGCGG
CATTATACCG CTATCGGTGC TGCAGAAGAA GCGATCCATC ATTGGTTTGC CGCCGGTGAA
GGCAGCATCG CCGCCGATGC GATTGAAAGC GCCGGCGAAG AGGTGTTGCG CAATGGCCGG
CTTGACACGT TAGCCGATTG GATTGACGCC CTCCCGGCTG AGATCATCGC CGCGCGGCCC
CGGCTGCAAT GCTACCTCGG TGATCTGTAT CGGTTACGCG CTCGCTTTGA TGAAGCACGG
CGCTGGTACG TCCAAGCCGA AGCGACGAGT CGGCAGCGCG GTGATCGGGC GGAATTGGCC
CGCGCCTTAT ACGGACTGGC GCAGGTTTAT ATCGATCAAG TGCAGCCATC CCAAGCCGAG
AGTGTGCTGC AAGAGGCGCT ACGGGTGAGC GAAGGGCTAG AAGATCATCT GGCCCGCGCC
CGCGTACTGG AGTTGCTGGC CGAAAACAAG CTGAATATGG GCCAGCCGGC TGAAGCGGAA
GCCTTGCAGC ACCAGGCTCA ACAGTTGCGT GCGGCCAGCC CTGCCGCCGA TTTACTGAGC
GCACGGGTTA AGCTGCGTAC CGGCCAGCTT GCCGCAGCAC GTGCCTTACT ACAGGCGTGG
CGCGATCATG AACGAATTGT GACGGCACGC GGTGTAACTC CGGCCCCACG CGGGCACCGT
GAAGCAGTGT TGGTACTGGC ATTGATCGCC GCATTTGAGG GCGATCCTGA ACAAGCCTTG
GCCTTTGCTA CCGAAGGGGT GCAGGTCGGG TTTGAACGTA ACTCTCAGTT CATCACCTCG
GTCGCATATG CGCGAATTGG TCATGCGTGG TTGTTAAAGA GTGCAACCGA GGGCATCGCA
GCACGAGCGC ACGCGCTCGA TTATTACCGG CAAGCCTTAG CCGAAGGTAT GGCATTGGGA
GTAGAACGCT TGCGGGTAGA ACCACTCTGG GGGATGACCC GTCTCTACGG CCTGGCCGGA
GATCTGGCCG CTGCCGAGAG CGCTGCCGCC GATGGCCAAG CGTTTTGCCG CTGGGCCGGT
GATCTCTGGT TGGGCGCAAT GATCCAGATT CAACGTGGGG TGAGTCATTT GCTGGTCGGT
GAGGTGGAGC GGGCGCTAGA GCTGTTGGTG ACGGCACGTG CCGATTTACG CGCTTGTAGC
GACCGATTTG GTCAGGCAGT AACAGCGCTC TGGCTGGCGT TGGGGTATCA CGAGTTGCGA
CAAGAGGTAG CAGCAATCGC TGCAATCGTC GAGGCCCTTG AGTTAAGTGC AACGCATGGC
TACGACTATC TGTTTACACG ACCTACGTTT CTTGGCTTGA TCGATCCACG CCGAGCATTG
CCGATCTTGT TAGCGGCACG TTCGCGTGGT CATCATCGCG AATACATCGA TCGGTTATTA
TCGATCCTTG GCCTACGCGG CCTCGACGTA CATCCCGGCT ACCAGTTACG GGTCCAGACG
CTCGGTGGCT TTCGCGTGTG GCGTGGCGAT CACGAGATCG AGGCGCGCGA ATGGCAGCGC
GACAAAGCCC GCCAACTCTT TCAAGCCCTA ATTATCCATC GCGACCGGTG GTTGCAACGT
GACGAATTGG TCGAAATGCT GTGGCCACAC CTTGCGCCAG AGACGGCCAT CCGCGATTTT
AAAGTGGCGC TCAGCACGCT TTACCGCGTC TTAGAGCCGA TCCGCACCGA TGCCCCCTCG
GCGTTCATTG TGCGTGATGG CAGTGCGTAC CGCCTGCGTC CAAACGCCGA CCTCTGGCTC
GACTGTGCCG AGTTTCGTAC CGGTTGCACG ACCGGTCTCC GTCTCCTCGA TCACGGGCGC
ATCGCTGAAG GCATCCACCA CCTGCACACT GCGCTCCAAT TGTACCAGGG TGATTTCTTA
CCCGACACCC TCTACGAAAG TTGGGCCATG ACCGAACGGG AACGATTACG CACTATGTTC
GTGCGTAGCG CAGACCGTCT GGCCCAGTTT CTGGCCGAGC AAGGTCGCGA CGACGACCTG
ATTGCGCTAT CCGAACGTAT TCTCGTCTCC GATCCGTGCT GGGAACGGGC CTACCGCTTT
CTGATGCTGG CCTACGCTCG GCGCGGGAAC CGCGCCACGG CCCTGCGGAT GTTCCAACGT
TGCCGTGAAA CGCTCGCCCG TGAGCTAGAC GTCGAACCGG CGCCGGAGAC CATCGCCTTT
GCCGAGCGAC TCCGTCACGG GGATCCTATC CTTCCCCCTG TTACCGATTT GTAA
 
Protein sequence
MTTAMAREKT VLLPAKVAPP RPHRYRLVRP AVTARLREAF DYRVTLVQAG AGYSKTTALA 
ELATSMAPVC WYTIGEDDRD PVTFLTYLTA ACAPVLPQGV PDVLAMLHAR PADRTVWTHV
LDELLNALAG VPPKPTLLIL DDYHFVTVSA EIRALTERLI TYAPPWFSLL IATRYPIVSG
ELVRWRARGE VLELNREALA FTRDEIAALF TEVFGIVLSD TDIDLLSHQT EGWPIALQLV
WQGLRSRQVR SVAEVLADSP TSLAALFDFL ASDVLARQPP EIAAFLRDTA PLRVLTAAAC
DAVRQAHDSS ELLAQVRDRD LFIVELGDEH YRYHHLFHDF LRQQIRHDPD IAERHRRAAR
HYTAIGAAEE AIHHWFAAGE GSIAADAIES AGEEVLRNGR LDTLADWIDA LPAEIIAARP
RLQCYLGDLY RLRARFDEAR RWYVQAEATS RQRGDRAELA RALYGLAQVY IDQVQPSQAE
SVLQEALRVS EGLEDHLARA RVLELLAENK LNMGQPAEAE ALQHQAQQLR AASPAADLLS
ARVKLRTGQL AAARALLQAW RDHERIVTAR GVTPAPRGHR EAVLVLALIA AFEGDPEQAL
AFATEGVQVG FERNSQFITS VAYARIGHAW LLKSATEGIA ARAHALDYYR QALAEGMALG
VERLRVEPLW GMTRLYGLAG DLAAAESAAA DGQAFCRWAG DLWLGAMIQI QRGVSHLLVG
EVERALELLV TARADLRACS DRFGQAVTAL WLALGYHELR QEVAAIAAIV EALELSATHG
YDYLFTRPTF LGLIDPRRAL PILLAARSRG HHREYIDRLL SILGLRGLDV HPGYQLRVQT
LGGFRVWRGD HEIEAREWQR DKARQLFQAL IIHRDRWLQR DELVEMLWPH LAPETAIRDF
KVALSTLYRV LEPIRTDAPS AFIVRDGSAY RLRPNADLWL DCAEFRTGCT TGLRLLDHGR
IAEGIHHLHT ALQLYQGDFL PDTLYESWAM TERERLRTMF VRSADRLAQF LAEQGRDDDL
IALSERILVS DPCWERAYRF LMLAYARRGN RATALRMFQR CRETLARELD VEPAPETIAF
AERLRHGDPI LPPVTDL