Gene Cagg_2962 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCagg_2962 
Symbol 
ID7266493 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChloroflexus aggregans DSM 9485 
KingdomBacteria 
Replicon accessionNC_011831 
Strand
Start bp3630756 
End bp3633797 
Gene Length3042 bp 
Protein Length1013 aa 
Translation table11 
GC content58% 
IMG OID643567784 
Producttranscriptional activator domain protein 
Protein accessionYP_002464258 
Protein GI219849825 
COG category[R] General function prediction only 
COG ID[COG3899] Predicted ATPase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0000192637 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGATTACGG TTCAGCTTTT TGGTTCGCCG CTGGTATTGC GTAACCATCA GCCGGTCGTA 
TTGTCGCGCC GGCGCAGTCG GGCACTACTC TACTACCTTG CAGCACATAC GCAGCCGGTG
CGGCGGGAGC AGGTGTTGAC CTTGTTGTGG CCCGACCACG AACGCAGCTC TGCCCAACAA
CTGCTGCGTT CAACACTCTA CACCGTTCGG CAAGTTCTCG GTGAGGCGCT AGTTGCCGAT
AATGAGCAAT TGGCGCTGAA GGCAACGGTT GATCTGCGCA CGCTGCAAGC CGTACTTGCC
GATCCACACG CAACGGTACA GATGTTAACG TCGGCATTGC CAACCCATGC CGTTGAGGTG
TTAGCCGGGT TTGATCTGCC CGACAGTGAG CCGTTTCAGT CGTGGCTCAC CGGTGTGCGT
GAACAAGCGC GGGTGCTGGT TGGGCGCGGT TGGTTGCGGC TGGCCCATTT GTATGAACAG
CACGGCGATT TGGCGGCTGC ACTGCATGCC CTTGATGTGG CGTTGAGCAT CGATCCACTA
CAAGAGGATG TGCAGCGTGA TGCGATGCGT TTGGCGTATC TGAACGGTGA TCGGGTAGGG
GCTATCCGTC GGTTTGAGCA GTTTCGCGAT CTGCTTGACC AAGAGTTGGG TGTGCCACCG
ATGCGCGAAA CGCAGGCGCT GTACGATGCG ATTATTACCG ATACTCTTCC CGGTACGTCG
GCTCCGGTGC GTCTGAGTGC GGTATCGCTG AACGGTGTGC TGCCCTTTAT TGGCCGTGAT
ACCGAACTGT TATTGTTACA AACAGAAGCC CAGCCCGGTC GTTTACTGTT GATTGAAGGT
GTGTCCGGGA TTGGTAAGAC GCGACTCGCC GAGGAGTTTC TTGCCCGTTG CGGTGGGCTG
GTGTTGAGCG GGAGTGCCCG TGAACTTGAG CAAAATTTGC CCTACCATCC TATCCGCACT
GCTCTTGCTG CGTTTGTAGC TCGGCCTGAA TGGCCTGCGC TGCACAGTCG ATTGCTGTTG
GCGCCGATCT GGTGGCATGA ACTGGGGCGG TTGGTGCCCG ATCTGGCGCT ACCGCCGACC
ACACCACCCG ATGAAGCGCG GTTGTGGGAA GCGATTGCCC GCTTCTTAAA CGAGTTGGCC
CGTCAGCGAC CGGTCTACCT CCTGATCGAT GATCTTCAGT GGGCTGATAC GAGTACCTTA
GGTTTGCTCG GCTATCTGTT GCGGCGTAGT ACCGATGTCG CGTTAACGGT TATCGCTACG
ACTCGCCCTC CCGAACCGCG TAGCGCTCTT GCTCATCTCA TCCGTGCTTT GATCCGGGAA
GAGCGCCTCG TGCGTGTCGT ACTGGAACGG CTGGCACCGG CTGCAATTCT GACGATTGCC
CGCCAGCTAA GCCCGGCCTT TGCCCACCCT CTCGCCGATT GGTTGGAGCG TAATGCAGAA
GGGAATCCGT ACATCGTTTC CGAACTGGTG CGCCACGCGC GTACCGCCGG TTGGCTGACA
CCCAATGGTA TGGTCGATCT AAGTGCATTC TCGGCTCAAT CGGTGGTTCC GCAGAGTGTG
TATCATCTCA TCGAAGCGCG CCTCAGTCGA TTGTCAGGTG CGGCCCGGCG CGTGCTCGAT
GCGGCGGTTG TCGTAGGTCG TGATTTCGAC TTTGCTGTTG TCACGAAAGT TGCCGCGCTT
TCCGAAGAGG CAACGCTAGA CGCCCTCGAT GAATTGCTGG CCGCTCGTTT GGTGTTGCCG
CTCAGCGATG GTCAGTTCCG TTTCGATCAT CCCTTGACTA TGGAAGTGGT CTATCGGAAT
CTCGGCGATC TGCGTCACCG TATGCTTCAC CGGCGCGTAG GGGAAGCTTT GGAGGCGTTG
TATCGCGATC GGATTGATGA GGTAGCCGGG CAAATCGTGT TGCACTTTAT TGAGGGTGGT
TGGTCGGAAC GGGCTGCCGG CTATGCGTTG CGGGCCGCCG AACGAGCCGC TGCGTTGGCA
GCGTGGCCGG AAGCGGCAGA GTTTTATCTC AAAGCTTTAG ACGGTGTGTT GCCGCCTCAA
CGTTCAACCG TCTTGCGTAA ACTCGGCTAT GCTCGGCTAT ACGCCGGTGC AGCGGGGTTG
GCTGCCGAAG CCTTTCGTGA AGCGATGGGT GTGGCGTCAA CCCTGCATGA AGCCGTTGTC
GCGCGATTAG CACTTGCCCA TGCCCTGATC CCGCAGGGTC GCTACGCCGA TGTGATTGCG
TTGGTGAGCC AGATCGATCC TGATATTGAT CCACGTCAAC AGGCAGAAGC TCTTTTTCTC
TGGGGGACGG CGCTCTCGTT GGAAGGTGCC GATCTCGGTG CTGCCACCGA ACGGTTAATG
ATGGCCGAAG CCGTGTTACG TGCTTGCCCG ACGGCTGAAG CAGCCGCTCT GGCGCAAGTC
TCGTTTGAGC TAGGCAATAT TGCCGCCCAG CAGGGCGATC TCACTACCGC CATTGCCCGC
TACGAGATGG CCCAACAGAT CGCCGACGAA GCCGGTGATA GTGCGCTGAC GTGGCGAGTA
TTGGCTCGCA ATAATCAGGC GTATCATCGT CTGCTGCTCG GTGAAATCTC GGCTGCCGAG
ACTATAATCA CTGCCGCATT GAGCCTGGCT GAAGAAGGTG GTTTGCTCTC GGTTATGCCA
TACCTTCTCT CGACCGCCGG TGAAGTAGCG TTGGCGCAAG GTGATCTGGA GACGGCTATC
GCTCGCTTCC AACAGGGGTT AGCGCTCGCC ACTCAATTCA GCATTCCTGA ACGGGTAGCC
GGTATCACGG CCAATCTTGG CTTAGTCGCG TTGCAACGTG GTGATACGAC CTGTGCTGTG
CATTACCTCT CGACGGCGTT AGCCCAAGCC GATACGCTTG GTGCCCGTCA TCTGGCTGCC
CAGATTCGGG TTTGGCTGGC ACCACTGTTA CCACCGCCTG AAGCGCGGAT ACGGCTGGCC
GAAGCCCAAG CAATTGCCGA GGCGGGCGGC AGACAACGCC TGCTGGCGGC AATTGCCCAA
GTTCGAGCAG CGTTGTCCGA TCAAGCGACT GTTCCGCGTT GA
 
Protein sequence
MITVQLFGSP LVLRNHQPVV LSRRRSRALL YYLAAHTQPV RREQVLTLLW PDHERSSAQQ 
LLRSTLYTVR QVLGEALVAD NEQLALKATV DLRTLQAVLA DPHATVQMLT SALPTHAVEV
LAGFDLPDSE PFQSWLTGVR EQARVLVGRG WLRLAHLYEQ HGDLAAALHA LDVALSIDPL
QEDVQRDAMR LAYLNGDRVG AIRRFEQFRD LLDQELGVPP MRETQALYDA IITDTLPGTS
APVRLSAVSL NGVLPFIGRD TELLLLQTEA QPGRLLLIEG VSGIGKTRLA EEFLARCGGL
VLSGSARELE QNLPYHPIRT ALAAFVARPE WPALHSRLLL APIWWHELGR LVPDLALPPT
TPPDEARLWE AIARFLNELA RQRPVYLLID DLQWADTSTL GLLGYLLRRS TDVALTVIAT
TRPPEPRSAL AHLIRALIRE ERLVRVVLER LAPAAILTIA RQLSPAFAHP LADWLERNAE
GNPYIVSELV RHARTAGWLT PNGMVDLSAF SAQSVVPQSV YHLIEARLSR LSGAARRVLD
AAVVVGRDFD FAVVTKVAAL SEEATLDALD ELLAARLVLP LSDGQFRFDH PLTMEVVYRN
LGDLRHRMLH RRVGEALEAL YRDRIDEVAG QIVLHFIEGG WSERAAGYAL RAAERAAALA
AWPEAAEFYL KALDGVLPPQ RSTVLRKLGY ARLYAGAAGL AAEAFREAMG VASTLHEAVV
ARLALAHALI PQGRYADVIA LVSQIDPDID PRQQAEALFL WGTALSLEGA DLGAATERLM
MAEAVLRACP TAEAAALAQV SFELGNIAAQ QGDLTTAIAR YEMAQQIADE AGDSALTWRV
LARNNQAYHR LLLGEISAAE TIITAALSLA EEGGLLSVMP YLLSTAGEVA LAQGDLETAI
ARFQQGLALA TQFSIPERVA GITANLGLVA LQRGDTTCAV HYLSTALAQA DTLGARHLAA
QIRVWLAPLL PPPEARIRLA EAQAIAEAGG RQRLLAAIAQ VRAALSDQAT VPR