Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cagg_3287 |
Symbol | |
ID | 7267761 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chloroflexus aggregans DSM 9485 |
Kingdom | Bacteria |
Replicon accession | NC_011831 |
Strand | + |
Start bp | 3983943 |
End bp | 3987077 |
Gene Length | 3135 bp |
Protein Length | 1044 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 643568103 |
Product | transcriptional activator domain protein |
Protein accession | YP_002464576 |
Protein GI | 219850143 |
COG category | [R] General function prediction only [T] Signal transduction mechanisms |
COG ID | [COG3629] DNA-binding transcriptional activator of the SARP family [COG3903] Predicted ATPase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.0920226 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 2 |
Fosmid unclonability p-value | 0.00000903469 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | GTGTCGCAGC TCTCGCTCAC TTTATTGGGG ACACTGGCGA TCACCCTGGA TGGCCAGCCC GTCGCCGGCA TCGAGTCGGA CAAGGCGCGC GCGCTGCTGG TGCGCCTGGT GCTGGAGCCG GAGCGCGCCT TCCGCCGCGA GGCGCTGAGC GCGCTCTTGT GGCCCGAAGC CGCGCCCGCA CAGGCTGCCC AGAACCTCCG TCAGGCGCTC TATAGCCTGC GCCGCGCTCT CGGCCAAGCC TTTTTGCTGA CGACGCCTCA AACCGTGCAG TTCAACGCCG CCGACGTGAC GGTGGACGCG CTGACTTGGC GTCGCCTGTG GAGCGAGACG CAGACGCATC GCCACCGCCG CCGCGAGACC TGCCGTCCCT GCCTGGAACG CCTGGAGCAG GCCGTGCCCC TCTATCGCGG CGACCTGCTG GTCGGCTTCG CGCTCAAGGA CAGTGCAGAG TTCGATGACT GGCTGACGAT GGAGCGCGAG CGGCTGCACG TACAGGCGCT GGAGGCGTTT ACCCTGCTGG CGAACGCTGC CGAACGGCGC GGCGATTACC CTGCCGCGCA GGAGTACGTG CGGCGGCTGC TGGCGCTGGA ACCCTGGCAG GAGGTCGCGC ATCGGCATCT GATGCGCCTG CTGGCCCTGG ACGGTCGGCG CGCGGCGGCG CTGGAGCAGT TTGAGGTGTG CTGGCGCGTC CTGGCCGAAG AGTTGGGCCT GGAGCCGACC GAGGAGACCC GCGCCCTCCA CGCGCGCATC CGCGCCGGCG AGCCGCTTTC CGCCGCGATG CCGCTTCCCC CCGCGCCGCC CACCGATTTG CCGCTGCAAC TCACCTCGTT CATCGGCCGC GAGCGCGAGG TGACGCTGTT GAGCGAACGC CTGGGCAACC CCGCCTATCG CCTCATCACC CTCACCGGGC CGGGCGGGGT GGGAAAGACG CGCCTGGCGC TGCAACTCGC GGCGACCCTC GCGGAACAGT TTGCCGATGG CGTCTGTTGG ATCTCGCTGA GCGACGCCGT CACCGAAAGC AACCTGATTC TGGCGATTGC CGACGCGCTG CACCTGCGCC TTTCCGGTGC GCAAGCCCTG CGCGCGCAAC TCTTGCAGGC GCTGCGTCAC GACCGGCGCG ACCTGCTGCT GGTGCTGGAC AATTTCGAGC AACTGCTGTC CGTTGGCGGC GCGACGCTGG TGCTGGACGT GCTGCGCGCC GCCCCGCGCT TCACCCTGCT GGTCACCTCG CGCGAGCGGC TGAATCTGCA GGCCGAGTCG GTGCTCCCGC TGGAAGGCCT GGGCTGCGAT CTGCCTGCTC CTGACGCGCC GCCCTCTGAA GCCGCGCAGT TGTTCGTCGA GCGCGCCGGA CGTGCCCGAA TGGACCTGAG CGTGGGCGCG GCAGACCAGG CGACGGTCGC GGAGATTTGT CATCTGCTGG AAGGCTCGCC GCTGGGCATC GAGCTGGCCG CCGCCTGGGC CGGTGAAATG TCGCTGGAAG GCATCGCTGA AGCCATCACC GCCACGCGCG ATTTCCTCGC CTCCACCAGT CCCGACATGC CCGACCGTCA CCGTAGCCTG CGCGCAGTCT TTGAAGGCTC CTGGCAATTG CTTTCTCCGG AAGAGCAGTT CGCGCTGATG CGGGTTTCCA TCTTTCGCGG CGGCTTTCAG GCCGAGGCCG TGCAGCACGT CGCCGGGGTG AGCGCGGCAA CGCTCAGCCG TCTGGTGCGC AAATCGTTGC TCTTCCTGGA TGAGCCGTGC GGTCGTTACG GACTGCACGG CGACATCCGC TACTATGCGG CGGAGAAGCT GGCCGCGCAG CCGTCCACCG CGCAGGAAGT GGCCGCGCGT CACGCCGCGT ATTTTGCTGA CCTGGTGAAG CGACGAGAAC AGGCCCTGCG CGGACGCGCG CAGCAGGCGG TGCAGGCCGA GCTGGAACCC GAATGGCAGA ACGTGCTCGC CGCTTGGCAA TGGGCCATCG CCCACGGCGA CGAGGCGCTG CTCACCCACC TGACGCACGG GCTGTTTGCC TTCTGCGAAG CCAAATCCTG GTTCCGCGAA GGTGCGACCC TCTTCCAGCC CGCTTTAGAG CGGATGCGGG AAGCGGCCCG CGCCGACCTG GCTGCAGCGC GCCTCCTCCG CCGCCTGTTG GGACGGCAGG CTGTCTTTTG CCGACAACTC TCGCAGTACG CGCAAGCGCA TCACCTGATT GAAGAGGGCC TGGCTCTGCC GGGCCTGCCT GACGATGAGG AGCACGCTTT CCTGCTGTAT CAAAAGTCCT GGGTGGACTT TTTGCAGGCG CGGTACGTGC AGGCGCGCAC GTGGGCCGAG GCGAGCCTGG AGCGTTACCG CGCGCTGGGG CAGCCGGTGG GCATCGGCGA TAGCCTCTAT ATGCTCGGCT GGACGGCCTA CGAGTTGGGG GATTTTGCCG CCGCCGAGGC GCTCTGCCTG GAGGCGCGGG CGGTATGCGC GCAGGCCGAT TATGCCTGGG GAGTGCAGTA CGCCATCTAT GGGCTGGGGC TGGTGCGACG CGCGCAGGGG GACTATGCTG CCGCCCGCCG CTGTTTCGAG GAGAACATGA CGTTTTGCGA CGCCATCGGC TACCTGTGGG GCGTGGCACA GGCGCGCATC AATCTGGGGC TGGTGGCGCT GGCCCAGGAT AAAGTGGAAG ATGCCGAAAC GCACTTTCAG AAAAGTCTGC TCATCGGTGA GCAAATCGGC AATGAATGGG TCAACGCGCA GAGTCAGAAA GGCCTGAGCG CAGCGGCTTT GGCGCGCCGC GACCTGCCCA CCGCGCTGAC CTTGGCGGAG CGCAGCCTGG CGCTCTATCA AGCGATGCAG GATCGGGATG GGATGGCGGA TAGCCTGCTG CTGCTGAGCC AGATCGCGCT GGCAAGCGGT GATCTCCCCG CCGCCCACCG CGCCCTGACG GAAGCCGAGG GGTTGATCCA GGCTACGGAA AACGGCTTCC GCGCCGCCAG AGCGCTGGTG CAGCGGGCGG ACATCCTGCT GCGGGAAGGG GAAACTGCGC AGGCGCGGGC GCTGCTGGAG GAAACGCTGC GCCATCCGGA CTGCGAGGCG TCCATCCGTG CGTACGCCAC GGCGGCGCTG GCGCACAAAA TCAAAGGGGA CTGTCTATGC GAATCACGTA CATAG
|
Protein sequence | MSQLSLTLLG TLAITLDGQP VAGIESDKAR ALLVRLVLEP ERAFRREALS ALLWPEAAPA QAAQNLRQAL YSLRRALGQA FLLTTPQTVQ FNAADVTVDA LTWRRLWSET QTHRHRRRET CRPCLERLEQ AVPLYRGDLL VGFALKDSAE FDDWLTMERE RLHVQALEAF TLLANAAERR GDYPAAQEYV RRLLALEPWQ EVAHRHLMRL LALDGRRAAA LEQFEVCWRV LAEELGLEPT EETRALHARI RAGEPLSAAM PLPPAPPTDL PLQLTSFIGR EREVTLLSER LGNPAYRLIT LTGPGGVGKT RLALQLAATL AEQFADGVCW ISLSDAVTES NLILAIADAL HLRLSGAQAL RAQLLQALRH DRRDLLLVLD NFEQLLSVGG ATLVLDVLRA APRFTLLVTS RERLNLQAES VLPLEGLGCD LPAPDAPPSE AAQLFVERAG RARMDLSVGA ADQATVAEIC HLLEGSPLGI ELAAAWAGEM SLEGIAEAIT ATRDFLASTS PDMPDRHRSL RAVFEGSWQL LSPEEQFALM RVSIFRGGFQ AEAVQHVAGV SAATLSRLVR KSLLFLDEPC GRYGLHGDIR YYAAEKLAAQ PSTAQEVAAR HAAYFADLVK RREQALRGRA QQAVQAELEP EWQNVLAAWQ WAIAHGDEAL LTHLTHGLFA FCEAKSWFRE GATLFQPALE RMREAARADL AAARLLRRLL GRQAVFCRQL SQYAQAHHLI EEGLALPGLP DDEEHAFLLY QKSWVDFLQA RYVQARTWAE ASLERYRALG QPVGIGDSLY MLGWTAYELG DFAAAEALCL EARAVCAQAD YAWGVQYAIY GLGLVRRAQG DYAAARRCFE ENMTFCDAIG YLWGVAQARI NLGLVALAQD KVEDAETHFQ KSLLIGEQIG NEWVNAQSQK GLSAAALARR DLPTALTLAE RSLALYQAMQ DRDGMADSLL LLSQIALASG DLPAAHRALT EAEGLIQATE NGFRAARALV QRADILLREG ETAQARALLE ETLRHPDCEA SIRAYATAAL AHKIKGDCLC ESRT
|
| |