Gene Cagg_2832 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCagg_2832 
Symbol 
ID7267538 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChloroflexus aggregans DSM 9485 
KingdomBacteria 
Replicon accessionNC_011831 
Strand
Start bp3480210 
End bp3483647 
Gene Length3438 bp 
Protein Length1145 aa 
Translation table11 
GC content56% 
IMG OID643567653 
Producttranscriptional regulator, SARP family 
Protein accessionYP_002464130 
Protein GI219849697 
COG category[T] Signal transduction mechanisms 
COG ID[COG3629] DNA-binding transcriptional activator of the SARP family 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACAACAT GGCACGTTCA ACTTCTTGGC GGTTTCGACA TTCGACGCAA TGGTGTCTCG 
CTCAATAGGG CATTCCAAAC CGACAGCGCA CGAATCCTCT TTGCATGGCT GTGTTTTCAC
CAAAAGCAAG CCGTTCGCCG TGAAACCCTC GCTACGTTGC TCTGGTCAGA TAAGCCGCAG
AGCGCTGCAC AAAATGCGTT GCGTGTTACG CTCAGCCGTA TCCGGCAGGC CATAGGTTCA
ACCCTAAACA CGCTGCTTAC CGATCCTACT ACCGTCACTC TCAACCTCTC CAGCGACTGG
CACATTGATG CACTGGAGTT TCTCCGGGCC GTCACAACGG TTCGCAACCA TCCTCACCGC
TCTGTCGCCG GCTGCCCAAT CTGTCAAGCG CATCTGCACC GAGCTGCTAT GCTCTACCGA
GGCGCATTAC TGGCCGGAAT GACCCCTGAA AGTGAACCAT TGCTCGCATG GATAACCCAT
CAAAGTGAAA CGATGCATCG GGCCGCCATC GAGGTGGATG GACATCTGGC CGAATATGCC
CTGCGTGTAG GTGATTGGAG AGCTGCGCAG GACTACGCTG GCCGGCAATT ACAGCTTGAA
CCATGGTATG AAGCGGCACA TCGTCAACTG ATGGTTGCCC TCGCCCGGCA AGGGCAGCGC
ACGGCGGCGC TAGCACAGTA TCAGAACTGC TACGATCTTT TGCAGCACGA ATTCGGCATC
GAACCAGAGG ACGAGACCAA ACAGCTCGCG GAATCCATTC GTACCGGCCA AATCGCTCAT
TTTATTGAGC CGATTCTACC GTCACAACAC CAGGCCGATG CTATCAATCG ATTGCCACTC
ATCGGACGAG AACGGGAGGT CGCCACCCTG ATCGACTGGC TGAATCAAGC CGGTCTCCAA
CTGATTACCA TCACTGGGGC CGGTGGGATC GGCAAAACCC GACTGGCGCT GCGGGCCGCT
CATCTGATGC AGTATGCATT TCGCGATGGC GCACGATATA TCTTTTTGCA TCCAGAGGAT
GGTGCGGCGA TGACCAATGA CGATGCAACC GCGTCATTGG CCCGCGCTAT CGCCATTGCC
TGTCATATCG AAGTCAATGA CCAGTATCCG CTGCCGGCGC AGATCATTGC TGCATTACAA
TCACGCGCCT ATCTGCTGGT CTGCGACAGT TTTGAGCACG TCGCCGCTGC CAATTTATTT
GTCAACGAAC TGCTCACTGC CGCACCCGAC TGCGTCGTGT TGGTGACATC CCGCCAGCCA
CTGCACCTAC GCCGCGAGCA GATACTTAAG CTTGGGCCAC TCTCTACCAA AACTACTACG
GCAGAACCGA GTCCGAGTGC GCAGATGTTT ATCGCATTAG CCAAACGCAG CGGCGTCACC
GGCGAAGGTC AGTTGGCGCT AGCCGATATT GAACATATCT GCGCCGATCT CGATGGGATG
CCGTTGGGAA TTGAGCTGGC CGCGGCTGCG CTGCACACCA TGAATTTGCC CGAATTGCGG
CACACAATGC AGCAGCGCAT CCAGACGTTG CACAACCCTC TCACCGATGC ACCGGCGCGT
CACCGCAGCC TCTCTGCCAT CCTGGCTTCG ACATGGCAAA CCTTGACGGC AACCAGTCGG
CAGGCACTGG CGATGTTGAG TGTAGTGCAT GCGCCATGCC CGCTGGCAGC AGCACAAGCG
ATTATCGGTA GCGAGGAGGC ATTAGCCGAA CTATTCGATC TCGCCTTGGC CCGTCGGCTC
GATGATGGAA CGGTGTGGCT CCACGAGCAC GTGCGCCAAT GGGCCGGCGA ACGGCTGCTG
AGCAATTTTG ATCCCACCAT TGCCGATACA GCCCATCGTC GCCACGCCGA ATGGTTTCTG
GATTGGCTGG CCGGCGCATA TCACGCTATG CAAGGCACCG ATAGCTTTGT TATCCGTGAG
CGACTCCTCG CCAGTTCGAA GGACATTGAA GCCGCATGGT ACTGGGCGTT AATCCATGGG
GCGTGGGATC GAGTCAGCGC TGCCGTACCC GCCTACGAAG TTCTGTTTTA CCTCAGTGGA
CGGTTCTTTG ATGGTCTTGA GCGATTTCAG CAGAGTCTGG CTTATGTCAG TCAGCCTGAC
CAACCGGAAA CCAAGCGGCT ACGAGCGAGA TTGCTGATCG GTCAAGCGAC TTTGCAACGC
TTACGCAGCA GTGGACCGGC TAGTGAGGCG ATGATTCAGG AAGCAGTTAC CCTGGCCGAG
CAATTGGCCG ACCAACAACT CCTGGCGAAC GCACTCTTAC GGCTTGGTAC GCTCCAGAAC
ACCGGAAGAT ATAACGCCAG CGGTCGTGCC ACACTCGACC GCCTCCGTGA AGTGCTCGAT
CAGCAGATTA CCATGCCGCT GAACGAACTG TACGTGATGG AAAGCGCATA CTGGCGGCTG
ATGAGTTGGC TTGAACTTCA CGCTGGCCGG AACGAGATGG CCATCGCGTA TGCCCAGCAG
GCGCGGGAAC TGGCCGAACA AGCTCATCAG TACGTCATGA TCGCTCATTG CTACGAGTCC
CTCAGCGCGG TCTTCAGTAC GATAGGCAAT TTTGCCGACT CCGAGCACTA TCTGCAACAG
GCACTGACCA TCTATCAGCA GCTTCGTTTG ACGTATCATC AAACCAATGT GCTTGATTTA
CTGGCGCAAA ACGCTGATGC ACGTGGCGAT TACGAACAAG CTCAACGTTA CTATTGGCAA
GAATTGGTGC TGGCACGTGA ATGCGGTAAT CTTGATGCTG AATTAGTAGC GCACATCAAT
CTCGGTATTT CTTACGACCA GATGGGGCAC TACGAACAAG CACTCTCCCA CACGCAGATT
GCAATGGCGC TTTGCGACAA AGTGGGTAAT ACCAAACATC ACACGGTGAT ACTGGCCAAT
TTAAGCTTGC ACGCGCACCA CAACCATCGC CACGAGCTAG CTCTTGCCTA CGCTCGCAGC
GCCACCGAAC AAGCCGCAAA TCTTCCCGAT CTGCAAGCTT ACGGGTACGA TTTTCAAGGA
CATGCCTTAC TTGCGCTGGG CCGTCTCGAT GAGGCTGAGC AGGCCTACCA TCAAGCCAAA
ACGATTCGCC AACAGATCAA CTACCCTGTA CTCGTGCTCG AATCGCAGGC AGGTCTGACC
CGTGTCGCGC TAACCCGCAA CGATGCCGCA GAGGCATTAC GTCGCGCTAC CCCGATTGTC
GAACACCTGC TGGCCGGCGG CCATCTCTAC GGTACCGAAG AGACGCTGCG GATCTATTGG
ACAGCGTATC AGGTGTTGGC AGCCAATCAG GACCCACGGT CCGAGGCTGT TCTCGAGCTG
GCGCGCAACG TGGTGCGTGA ACGGGCCAAC CGGTTAAGCG ATCAGACCAA CCGTACAATC
TTTTTGAACG CTGAGTTCAA CCGTCGAATT ATGACTGCCG GCAAACCAGA GGCTGTATAT
CATCCGTCGG CAGCCTGA
 
Protein sequence
MTTWHVQLLG GFDIRRNGVS LNRAFQTDSA RILFAWLCFH QKQAVRRETL ATLLWSDKPQ 
SAAQNALRVT LSRIRQAIGS TLNTLLTDPT TVTLNLSSDW HIDALEFLRA VTTVRNHPHR
SVAGCPICQA HLHRAAMLYR GALLAGMTPE SEPLLAWITH QSETMHRAAI EVDGHLAEYA
LRVGDWRAAQ DYAGRQLQLE PWYEAAHRQL MVALARQGQR TAALAQYQNC YDLLQHEFGI
EPEDETKQLA ESIRTGQIAH FIEPILPSQH QADAINRLPL IGREREVATL IDWLNQAGLQ
LITITGAGGI GKTRLALRAA HLMQYAFRDG ARYIFLHPED GAAMTNDDAT ASLARAIAIA
CHIEVNDQYP LPAQIIAALQ SRAYLLVCDS FEHVAAANLF VNELLTAAPD CVVLVTSRQP
LHLRREQILK LGPLSTKTTT AEPSPSAQMF IALAKRSGVT GEGQLALADI EHICADLDGM
PLGIELAAAA LHTMNLPELR HTMQQRIQTL HNPLTDAPAR HRSLSAILAS TWQTLTATSR
QALAMLSVVH APCPLAAAQA IIGSEEALAE LFDLALARRL DDGTVWLHEH VRQWAGERLL
SNFDPTIADT AHRRHAEWFL DWLAGAYHAM QGTDSFVIRE RLLASSKDIE AAWYWALIHG
AWDRVSAAVP AYEVLFYLSG RFFDGLERFQ QSLAYVSQPD QPETKRLRAR LLIGQATLQR
LRSSGPASEA MIQEAVTLAE QLADQQLLAN ALLRLGTLQN TGRYNASGRA TLDRLREVLD
QQITMPLNEL YVMESAYWRL MSWLELHAGR NEMAIAYAQQ ARELAEQAHQ YVMIAHCYES
LSAVFSTIGN FADSEHYLQQ ALTIYQQLRL TYHQTNVLDL LAQNADARGD YEQAQRYYWQ
ELVLARECGN LDAELVAHIN LGISYDQMGH YEQALSHTQI AMALCDKVGN TKHHTVILAN
LSLHAHHNHR HELALAYARS ATEQAANLPD LQAYGYDFQG HALLALGRLD EAEQAYHQAK
TIRQQINYPV LVLESQAGLT RVALTRNDAA EALRRATPIV EHLLAGGHLY GTEETLRIYW
TAYQVLAANQ DPRSEAVLEL ARNVVRERAN RLSDQTNRTI FLNAEFNRRI MTAGKPEAVY
HPSAA