Gene Cagg_1953 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCagg_1953 
Symbol 
ID7268869 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChloroflexus aggregans DSM 9485 
KingdomBacteria 
Replicon accessionNC_011831 
Strand
Start bp2384687 
End bp2387938 
Gene Length3252 bp 
Protein Length1083 aa 
Translation table11 
GC content59% 
IMG OID643566791 
Producttranscriptional activator domain protein 
Protein accessionYP_002463284 
Protein GI219848851 
COG category[K] Transcription 
COG ID[COG2909] ATP-dependent transcriptional regulator 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00247822 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
TTGCAACGTA GTACGTTGCT CGCGAAACTG ATCCCTACTA TCCTCATGGT ATCACAAGCG 
GTTACAGGTG AGTATAACGA ACATCTGTTG CGGCCTCGTC TATTGCCACC CCCACCACCG
TTGCACCGCA TCCGTCGGGT GCGGGTTGAG CAGCGGCTGG CGGTGGCTGT CGATGTCCCA
TTGACTATTG TAGTAGCACC TCCCGGTAGT GGCAAGACGG TTGCCTTAGC CGCTCTCGCG
ACCCATGGTG GCTGGCCGGC TGCATGGTGT CGTGCCGATA CCAACGATGA TCCATCGCGC
CTGCTCGCCC ACCTTGCTGC GGCTCTGAGT CGGGTAACAT CCCTCGACCC CAATCGTCTC
CCACGAACGA TTGATGGTTT GATTAACGCG CTAACCGGCG AGCTTGATGA TGAAACGTTG
TTGATTATCG ATGATGCGCA TCTGATCGAT GAGCAGCCAG AACTACGTGC CCTTATTGAA
CGGTTTATCG TGGCTCAGCC TCCAAGATTG CATCTGGTTC TCGCCGGACG GCGCGAACCC
AATTCGCCAC TTATCGCGAC GGCTCGTTTG CGTGGTGAAG CGTTGCTGAT CGAAGCCGCC
GATTTGCTTT TTACATCCGA TGAAGCAGCT CAGCTTTGGC AGCAGGCCGG CAAAACGCTG
CCGGTTGATT TCGATGAGTT AATGACATTC AGCGCCGGAT GGCCACTCGC CCTACGGATT
GCACTCGATG CGACCGATTG GCGGAGTGCG CTTGGGCTGC GTGAACAGAA GGGTAAGCCG
GCCCTTGACG ATTATCTGGA ACGGGAAGTG TTCGCGCTGC TGCCCGAGCC GTTACAACGT
TGCCTGCAAC GCAGCGCAGG TCTGCGCTGG ATCACCCCCG ATCTGTGTGC GGCCCTCGAT
CCAACCTTCA ACGCATCTGA ACTCATTGCC GAATTACGTC GTCGCCGCCT CTTTGTCGAA
CCGTTTGGTG AGCATGGTGT CCGGTTTCAG CCGCTGATCG CGGCGTGGCT GTCCCGTCGA
GCGGCTGCCG ATCCCGAATG GGTGCATCTG CACCGCCAGG CTGCGGCATA CTTTCAGCAG
CACGGTGATC ACGAAAGTGC GTTGTACCAT CAGATTGCTG CCGGTGATCC TGCCGCAGTG
ACCACCTTCC TCCCACTGGC CCGCACGCTG CTGGCCGAGC GTCGGGCCGA AGCTGTGCTC
GATTGGATGC GACGGTTAGC GTTGACCGGC GATGAAACCC CGGAATTGAT CGAGCTTCAA
GCCGCTGCAC TTCACCAGCT CAATCGGCTT GAAGCTGCCT TGGCTACCTA CCGCCGGGCC
GAGCGGGCTT TTGCCGCGAG CGGTGATCGA CTTGGTCAGT CGCGTTGTCT ACGTGGGCAG
GCTGCAATCT ACCTTGATAC CGTTCAACCG GCTCCGGCCA CCGATTTGTT ACGCCGGGCT
TTGAAGTTGT TGCCACGTCA GTGTGGTGCG GAACGGATCG AGTTGTTGCT GATGCAGGCC
GAGAATTGGG CTAATCGCGG GCGAGCCGAT ATCGGGTTGC GACTTGAGCA GACGGCGCGT
CAGTTGGCGA AAGCCCACGG GCTGTCCGCG CAGTACGATC CGCAGGCTGA AGTGCTCCGG
CCACGCTTGC TGTTGCGGGC CGGACGTCTG CGGGAAGCAC GGCAGTTGCT CGAAGAACGG
CTTTGGTCCG AACAGTCATG TACGCGAGCA ACAGGCCATC GTGAACCGCT TTTACTGCTG
GCGCTGATCA ACGCAATGCT TGGCGTGGGG CCACAAGCGC TAGCGTTTGC CCATCGAATG
TTGATCGAAG CGCAACAGAG CGGCAATTTG GTTACGGAAG CCCTGGCCGA GTTGCGGCTT
GGTCATGCCT ATCAGCTTAT CGCTCGTGGT GATGATCAAG CGGCGCTCCA ACACTATAGC
CGTGGTTTGC GTCTGTTGCA GCAGGTGACA GTGCCGCGAA CTCGTGCCGA GGGGTATCTG
GGCCTGACGC TCCTTCATGG TCATGCCGGC GATTTGGCCC GTGCCGAGGC CGATGCTCGC
GAGGGTCTGT TGCTGGCCGC TTCTGCCGGC GATGAGTGGG TCGCTGCCCT GATGTTGCTC
GCCCTCGGCA GTGTTACCCT TGTAGCCGAT GATCCGCGCG GTTACGAGTG GCTCGATCAG
GCCGAACAGC GATTTCAAGC CGGTCGTGAT ACCTTTGGCT TGTTTCTGGT GCATCTCTGG
CGAGCATTGG CAGCCTTGCG CGTCGGTCGG ACTGCTGCGG TCGATCCGTT GGTCGATCAG
GTCATGCACG AAGCGGTGGA GTATGGCTAC GAGCATGTCT TGATCGGACC GAGTCTGTTT
GGGCCGCGCG ATATTGCGGC GTTGGTGCCT TTGCTTCTAC GCGCCCGCAC GATGCCGGCC
CACCGCGATA CTGCTATTCG GCTGCTGCGG CAGGGTTTTC CCTCGATCGC CACCGATGAC
ACCGTTGATG ATTACCATCC CGGCTTTACA CTACGGGTGC AAATGTTGGG AGCATTCCGG
GTTTGGCGAG GTAATCAAGA GATTCAAGCT CGTGAGTGGC AGCGTGAAAA GGCGCGTCAG
CTTTTCCAGT TGCTGCTGAC GATGCGTGGT AATTGGGTAC AACGCGAGCA GATCTGCGCA
TGGCTGTGGC CTGATGCCGA TCTCGAAGCT GCCGAACGCC AGTTTAAGGT AACTTTGAAC
ACGCTTAACG CAGCGCTTGA GCCACACCGT CCGCCGCGCG TACCCCCGTT CTTTATCCGT
CGGCAGGGGT TGGCGTATAG CTTTGCCCCT TCCTTCGGCG TATGGATCGA TGTTGACGAA
TTTGAACTGC GTGCCAGTAG TGCATTGACG GCAACCGACC CCGATTTTGC GCGCCGCAGC
GCCCAAGCTG CGCTCCAACT CTACCGCGGT GATTATCTCG CCGAATCGCT GTATGATCCG
TGGACAACCG AAGAGCGTGA GCGTTTGTTG GCCCGTTATC TGGCAACGGC TGTTGCCTAT
GCCGAACGAC TGAGTGCCGA AGGAAAGCAC AACGAGGCGA TCCAGATCGC CGAACAGGTG
TTGCGTCGCG ATCGCTGTTA CGAGGAAGCC TACCAATTGC TGATGCGTGC CCATGCCCGC
GCCGGCAGCC GTTCGCAGGC GATGCGCGCC TATACACGTT GTGTCCAGGC GTTACGTGAG
GAATTAGGGA TTGAGCCGTT GGCTGAAACG GAAGCGCTCT ACCTGCGTAT CCGCTTGAAT
GAGCCGATTT GA
 
Protein sequence
MQRSTLLAKL IPTILMVSQA VTGEYNEHLL RPRLLPPPPP LHRIRRVRVE QRLAVAVDVP 
LTIVVAPPGS GKTVALAALA THGGWPAAWC RADTNDDPSR LLAHLAAALS RVTSLDPNRL
PRTIDGLINA LTGELDDETL LIIDDAHLID EQPELRALIE RFIVAQPPRL HLVLAGRREP
NSPLIATARL RGEALLIEAA DLLFTSDEAA QLWQQAGKTL PVDFDELMTF SAGWPLALRI
ALDATDWRSA LGLREQKGKP ALDDYLEREV FALLPEPLQR CLQRSAGLRW ITPDLCAALD
PTFNASELIA ELRRRRLFVE PFGEHGVRFQ PLIAAWLSRR AAADPEWVHL HRQAAAYFQQ
HGDHESALYH QIAAGDPAAV TTFLPLARTL LAERRAEAVL DWMRRLALTG DETPELIELQ
AAALHQLNRL EAALATYRRA ERAFAASGDR LGQSRCLRGQ AAIYLDTVQP APATDLLRRA
LKLLPRQCGA ERIELLLMQA ENWANRGRAD IGLRLEQTAR QLAKAHGLSA QYDPQAEVLR
PRLLLRAGRL REARQLLEER LWSEQSCTRA TGHREPLLLL ALINAMLGVG PQALAFAHRM
LIEAQQSGNL VTEALAELRL GHAYQLIARG DDQAALQHYS RGLRLLQQVT VPRTRAEGYL
GLTLLHGHAG DLARAEADAR EGLLLAASAG DEWVAALMLL ALGSVTLVAD DPRGYEWLDQ
AEQRFQAGRD TFGLFLVHLW RALAALRVGR TAAVDPLVDQ VMHEAVEYGY EHVLIGPSLF
GPRDIAALVP LLLRARTMPA HRDTAIRLLR QGFPSIATDD TVDDYHPGFT LRVQMLGAFR
VWRGNQEIQA REWQREKARQ LFQLLLTMRG NWVQREQICA WLWPDADLEA AERQFKVTLN
TLNAALEPHR PPRVPPFFIR RQGLAYSFAP SFGVWIDVDE FELRASSALT ATDPDFARRS
AQAALQLYRG DYLAESLYDP WTTEERERLL ARYLATAVAY AERLSAEGKH NEAIQIAEQV
LRRDRCYEEA YQLLMRAHAR AGSRSQAMRA YTRCVQALRE ELGIEPLAET EALYLRIRLN
EPI