Gene Cagg_3287 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCagg_3287 
Symbol 
ID7267761 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChloroflexus aggregans DSM 9485 
KingdomBacteria 
Replicon accessionNC_011831 
Strand
Start bp3983943 
End bp3987077 
Gene Length3135 bp 
Protein Length1044 aa 
Translation table11 
GC content68% 
IMG OID643568103 
Producttranscriptional activator domain protein 
Protein accessionYP_002464576 
Protein GI219850143 
COG category[R] General function prediction only
[T] Signal transduction mechanisms 
COG ID[COG3629] DNA-binding transcriptional activator of the SARP family
[COG3903] Predicted ATPase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0920226 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00000903469 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
GTGTCGCAGC TCTCGCTCAC TTTATTGGGG ACACTGGCGA TCACCCTGGA TGGCCAGCCC 
GTCGCCGGCA TCGAGTCGGA CAAGGCGCGC GCGCTGCTGG TGCGCCTGGT GCTGGAGCCG
GAGCGCGCCT TCCGCCGCGA GGCGCTGAGC GCGCTCTTGT GGCCCGAAGC CGCGCCCGCA
CAGGCTGCCC AGAACCTCCG TCAGGCGCTC TATAGCCTGC GCCGCGCTCT CGGCCAAGCC
TTTTTGCTGA CGACGCCTCA AACCGTGCAG TTCAACGCCG CCGACGTGAC GGTGGACGCG
CTGACTTGGC GTCGCCTGTG GAGCGAGACG CAGACGCATC GCCACCGCCG CCGCGAGACC
TGCCGTCCCT GCCTGGAACG CCTGGAGCAG GCCGTGCCCC TCTATCGCGG CGACCTGCTG
GTCGGCTTCG CGCTCAAGGA CAGTGCAGAG TTCGATGACT GGCTGACGAT GGAGCGCGAG
CGGCTGCACG TACAGGCGCT GGAGGCGTTT ACCCTGCTGG CGAACGCTGC CGAACGGCGC
GGCGATTACC CTGCCGCGCA GGAGTACGTG CGGCGGCTGC TGGCGCTGGA ACCCTGGCAG
GAGGTCGCGC ATCGGCATCT GATGCGCCTG CTGGCCCTGG ACGGTCGGCG CGCGGCGGCG
CTGGAGCAGT TTGAGGTGTG CTGGCGCGTC CTGGCCGAAG AGTTGGGCCT GGAGCCGACC
GAGGAGACCC GCGCCCTCCA CGCGCGCATC CGCGCCGGCG AGCCGCTTTC CGCCGCGATG
CCGCTTCCCC CCGCGCCGCC CACCGATTTG CCGCTGCAAC TCACCTCGTT CATCGGCCGC
GAGCGCGAGG TGACGCTGTT GAGCGAACGC CTGGGCAACC CCGCCTATCG CCTCATCACC
CTCACCGGGC CGGGCGGGGT GGGAAAGACG CGCCTGGCGC TGCAACTCGC GGCGACCCTC
GCGGAACAGT TTGCCGATGG CGTCTGTTGG ATCTCGCTGA GCGACGCCGT CACCGAAAGC
AACCTGATTC TGGCGATTGC CGACGCGCTG CACCTGCGCC TTTCCGGTGC GCAAGCCCTG
CGCGCGCAAC TCTTGCAGGC GCTGCGTCAC GACCGGCGCG ACCTGCTGCT GGTGCTGGAC
AATTTCGAGC AACTGCTGTC CGTTGGCGGC GCGACGCTGG TGCTGGACGT GCTGCGCGCC
GCCCCGCGCT TCACCCTGCT GGTCACCTCG CGCGAGCGGC TGAATCTGCA GGCCGAGTCG
GTGCTCCCGC TGGAAGGCCT GGGCTGCGAT CTGCCTGCTC CTGACGCGCC GCCCTCTGAA
GCCGCGCAGT TGTTCGTCGA GCGCGCCGGA CGTGCCCGAA TGGACCTGAG CGTGGGCGCG
GCAGACCAGG CGACGGTCGC GGAGATTTGT CATCTGCTGG AAGGCTCGCC GCTGGGCATC
GAGCTGGCCG CCGCCTGGGC CGGTGAAATG TCGCTGGAAG GCATCGCTGA AGCCATCACC
GCCACGCGCG ATTTCCTCGC CTCCACCAGT CCCGACATGC CCGACCGTCA CCGTAGCCTG
CGCGCAGTCT TTGAAGGCTC CTGGCAATTG CTTTCTCCGG AAGAGCAGTT CGCGCTGATG
CGGGTTTCCA TCTTTCGCGG CGGCTTTCAG GCCGAGGCCG TGCAGCACGT CGCCGGGGTG
AGCGCGGCAA CGCTCAGCCG TCTGGTGCGC AAATCGTTGC TCTTCCTGGA TGAGCCGTGC
GGTCGTTACG GACTGCACGG CGACATCCGC TACTATGCGG CGGAGAAGCT GGCCGCGCAG
CCGTCCACCG CGCAGGAAGT GGCCGCGCGT CACGCCGCGT ATTTTGCTGA CCTGGTGAAG
CGACGAGAAC AGGCCCTGCG CGGACGCGCG CAGCAGGCGG TGCAGGCCGA GCTGGAACCC
GAATGGCAGA ACGTGCTCGC CGCTTGGCAA TGGGCCATCG CCCACGGCGA CGAGGCGCTG
CTCACCCACC TGACGCACGG GCTGTTTGCC TTCTGCGAAG CCAAATCCTG GTTCCGCGAA
GGTGCGACCC TCTTCCAGCC CGCTTTAGAG CGGATGCGGG AAGCGGCCCG CGCCGACCTG
GCTGCAGCGC GCCTCCTCCG CCGCCTGTTG GGACGGCAGG CTGTCTTTTG CCGACAACTC
TCGCAGTACG CGCAAGCGCA TCACCTGATT GAAGAGGGCC TGGCTCTGCC GGGCCTGCCT
GACGATGAGG AGCACGCTTT CCTGCTGTAT CAAAAGTCCT GGGTGGACTT TTTGCAGGCG
CGGTACGTGC AGGCGCGCAC GTGGGCCGAG GCGAGCCTGG AGCGTTACCG CGCGCTGGGG
CAGCCGGTGG GCATCGGCGA TAGCCTCTAT ATGCTCGGCT GGACGGCCTA CGAGTTGGGG
GATTTTGCCG CCGCCGAGGC GCTCTGCCTG GAGGCGCGGG CGGTATGCGC GCAGGCCGAT
TATGCCTGGG GAGTGCAGTA CGCCATCTAT GGGCTGGGGC TGGTGCGACG CGCGCAGGGG
GACTATGCTG CCGCCCGCCG CTGTTTCGAG GAGAACATGA CGTTTTGCGA CGCCATCGGC
TACCTGTGGG GCGTGGCACA GGCGCGCATC AATCTGGGGC TGGTGGCGCT GGCCCAGGAT
AAAGTGGAAG ATGCCGAAAC GCACTTTCAG AAAAGTCTGC TCATCGGTGA GCAAATCGGC
AATGAATGGG TCAACGCGCA GAGTCAGAAA GGCCTGAGCG CAGCGGCTTT GGCGCGCCGC
GACCTGCCCA CCGCGCTGAC CTTGGCGGAG CGCAGCCTGG CGCTCTATCA AGCGATGCAG
GATCGGGATG GGATGGCGGA TAGCCTGCTG CTGCTGAGCC AGATCGCGCT GGCAAGCGGT
GATCTCCCCG CCGCCCACCG CGCCCTGACG GAAGCCGAGG GGTTGATCCA GGCTACGGAA
AACGGCTTCC GCGCCGCCAG AGCGCTGGTG CAGCGGGCGG ACATCCTGCT GCGGGAAGGG
GAAACTGCGC AGGCGCGGGC GCTGCTGGAG GAAACGCTGC GCCATCCGGA CTGCGAGGCG
TCCATCCGTG CGTACGCCAC GGCGGCGCTG GCGCACAAAA TCAAAGGGGA CTGTCTATGC
GAATCACGTA CATAG
 
Protein sequence
MSQLSLTLLG TLAITLDGQP VAGIESDKAR ALLVRLVLEP ERAFRREALS ALLWPEAAPA 
QAAQNLRQAL YSLRRALGQA FLLTTPQTVQ FNAADVTVDA LTWRRLWSET QTHRHRRRET
CRPCLERLEQ AVPLYRGDLL VGFALKDSAE FDDWLTMERE RLHVQALEAF TLLANAAERR
GDYPAAQEYV RRLLALEPWQ EVAHRHLMRL LALDGRRAAA LEQFEVCWRV LAEELGLEPT
EETRALHARI RAGEPLSAAM PLPPAPPTDL PLQLTSFIGR EREVTLLSER LGNPAYRLIT
LTGPGGVGKT RLALQLAATL AEQFADGVCW ISLSDAVTES NLILAIADAL HLRLSGAQAL
RAQLLQALRH DRRDLLLVLD NFEQLLSVGG ATLVLDVLRA APRFTLLVTS RERLNLQAES
VLPLEGLGCD LPAPDAPPSE AAQLFVERAG RARMDLSVGA ADQATVAEIC HLLEGSPLGI
ELAAAWAGEM SLEGIAEAIT ATRDFLASTS PDMPDRHRSL RAVFEGSWQL LSPEEQFALM
RVSIFRGGFQ AEAVQHVAGV SAATLSRLVR KSLLFLDEPC GRYGLHGDIR YYAAEKLAAQ
PSTAQEVAAR HAAYFADLVK RREQALRGRA QQAVQAELEP EWQNVLAAWQ WAIAHGDEAL
LTHLTHGLFA FCEAKSWFRE GATLFQPALE RMREAARADL AAARLLRRLL GRQAVFCRQL
SQYAQAHHLI EEGLALPGLP DDEEHAFLLY QKSWVDFLQA RYVQARTWAE ASLERYRALG
QPVGIGDSLY MLGWTAYELG DFAAAEALCL EARAVCAQAD YAWGVQYAIY GLGLVRRAQG
DYAAARRCFE ENMTFCDAIG YLWGVAQARI NLGLVALAQD KVEDAETHFQ KSLLIGEQIG
NEWVNAQSQK GLSAAALARR DLPTALTLAE RSLALYQAMQ DRDGMADSLL LLSQIALASG
DLPAAHRALT EAEGLIQATE NGFRAARALV QRADILLREG ETAQARALLE ETLRHPDCEA
SIRAYATAAL AHKIKGDCLC ESRT