Gene Cagg_2054 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCagg_2054 
Symbol 
ID7269213 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChloroflexus aggregans DSM 9485 
KingdomBacteria 
Replicon accessionNC_011831 
Strand
Start bp2514492 
End bp2518097 
Gene Length3606 bp 
Protein Length1201 aa 
Translation table11 
GC content56% 
IMG OID643566889 
Producttwo component transcriptional regulator, AraC family 
Protein accessionYP_002463378 
Protein GI219848945 
COG category[K] Transcription 
COG ID[COG1609] Transcriptional regulators
[COG4977] Transcriptional regulator containing an amidase domain and an AraC-type DNA-binding HTH domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.695732 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones32 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCACACCA TTGCTGCAAC GAAGCAACAC CGACGACGCC CAACGGTGGG AGCCATCGCT 
GGCTGGCAGT TTTATGGCAC TATGCTGACG ATGAGCTACC TTAGCCCGAT TTATCAGGGT
ATTCGTCAGG CTGCGTATGA TCTCGGCTGT AACCTCTTGT TAGCCTGCGG TATGGGTCCG
TCCGGTCAGA TCAACGAGCC ACTACGTCCG GCTTGGTTCG ATGTGATCGA AGATACCGAT
TTCGTACCGG TGGGGCCATG GAACACCCAC GGACTGATTA TCATCAACCC CCTGCAACGC
CCCGAACGGT CGCAGGCGGT ACAGGCTATC CGTGCATCCG GTCATCCGGT GATCTTTGTG
GGGTCGGGGG AACGAGGACC AACCATTGTC GCCGACAACG TCGGTGGGGT CTACGCAGCA
CTGCGCCATA TCGTAGAACA TGGTCATCGC CAGATTGCCT TTATTGCCGG TAGTCAGACC
GATCTCGAAG GTGATTCTGG TGATCGATTA CGTGCTTTTC AGGCGGCGAT GATCGAGTTT
GGCTTGCCAA TCGATCCGCG GCTGATCGTG TTTGGTCGGC ATGTCTTTAC CGGTGGTTAT
GAAGCGATGC GCCGGTTATT GGCGACCGGC GTACCGTTCA CCGCGGTGTT GGTGAGTAAT
GACGAGTCGG CAATTGGAGC CATCACAGCT CTGCGTGAAG CCGGTAAGCG AGTGCCGCAC
GATATTGCGA TCATCGGCTT TGATGATCGG ATCGAGGGAT TGTTGCAAGA ACCGGCCCTA
ACAACGGTAC GGATCCCTCT CTTCAAATTA GGCTATCAGG CGCTTGAGCT GATGGTGCGC
CATATACGTG AGCGTGAACC GTTGCCCGAA TTGGTGCAGG TACCTACGCG CTTGATCGTC
CGTGAGTCAT GTGGGTGCAG CCAGAGTGCT GTATTGGCCG ACACACTGGA GATGATTGCC
CACCACCCAA CTGAACCGAC TGTGGCCAAC TTGCCCGTAC AAGCCGCCCA AGCGATGAGC
ACGATGCTGG CGCCTAAAAT GCATGGGTTA ACAGCCAGCG AAGTGTCTCG ACTTTGCCAA
CAGCTTATCA CGGCATTCAG GCAGAGTGTG CAATCTGCTC GTCCGGTTGC CTTTCGTACC
GAGCTGGCAC AGGTGCTCAA GCAAGTCGCT GCTGCCGGCG ATGACACCCA CGATTGGCAA
ATTGCTATTT CCTTCTTGCG CGATCTTGTG CCGCGGATGT TCGATCCGAC GTTGCAACCG
CTTGCCTTGG AGTTACTCGA CGAAGCTCGT GTGACGACAA GTGCGGCGAT GCGTTGGCAG
TATTGGCAGT ATTTCAATCG GCAACAACAG ACCAACAATC GGGTAGGGCG ATTGACCGCA
CGCTTGTTAC ATACGCTGGC TGAAAGTGAT ATTTACGAGA TTTTGGCTCA CCATCTCCCC
GAACTCAATA TTCCCTTGCT CTGGATTGGC TTCTTTGAGG CCGAAAACGG TGATCCGGTC
GCGTGGTGTC GGCTGCGCGC CGTGACGGCG CCACAGCAGC CCGTTGTGCG CATTCGCAGC
CGGACCTTTC CGCCTCGGCA GTGGTTACCG ACTCGCCAAT CGTTCCAGCT TGCCTTGATC
CCACTTGGTG GGACGGGTAG TGAGGCGGGT TTTGTGACGT TTGATGCATC ACGCCTTGAA
CTCTACGGCA CCATTACCCA ACAGATCAAT GCCGCTCTCA ACACTGCTCG GCTGTACCGA
GCGGCGAGTG AGGGGCAGCG GCTGGCCGAA GAGGCTAATC AACTCAAAAG TCGCTTTCTG
TCAATGGTAA GCCACGAATT GCAGACACCG CTGAATCTGA TCGTCGGAAT GAGTGGTCTC
CTTCTGCGCG AGATTGCTCA GAGCGGAGAC CCATTACCGT CATCGATCCG TGATGATCTC
AGGCGCATCT ATGCTAGTGC TCGACATCTC GGTCGGTTGA TTAGCGATGT GCTCGATTTG
GCAAGCAGCG ATGCCGGCCA ATTGCGGCTG AACTGTGAGG TAGTTGATCT CGGTGAAGTG
ATGCGGGTAG TGGCTGATGC CGGTCGGCAA ATGGCTGCCG ATAAACAGTT GACGTGGTAC
GACTCGATTC CTGCCGAAGG GCCGTGGGTT TGGGGTGATC GGACCCGCCT CCAACAGATC
GGGCTTAATC TCGTTGTCAA TGCGATAAAG TTCACTGCTC GCGGTCGTGT TGGCCTGATT
GTTACCCCTG AAACCGATGC AGTTACCGTT ACTGTGCGTG ATACCGGCAT TGGCTTGCCG
CCTACCGAAC AGGCGCATAT CTTTGAGGAG TTTCAGCGTT CAGAACGGAG TGTCAGTCAG
GGTTATGGTG GGATCGGCCT GGGGTTGGCA ATCTGTAAAC GGTTGGTCGC GATGCACGGT
GGGGAGATTG GAGTGCGTTC GCGTGGTATT GAGGGTGAAG GGGCAGAGTT TTTCTTCCGC
TTGCCGACTA TCACTGCCCC CACACCACGT CGGCGACACA ATCCACCGGC GTTGCCTGTC
CAGCCGCGTG TGTTGCTCCT TTCTGCCGAC CACGATGAAC CGTTGCAGCG TTACCTCGAA
CAGCGAGGCT TTATCGTGAC CGTCTTTTCC ATTGACGAGA GTGCTGCATG GTTGAATGAA
CTGCTTCACC GTAGCTACAG TGCGGTCGTT TTGCATGCCA CCCAAGGTGA GCGGGCATGG
TGGCAGACGA TCCAAGTGTT GAAAGCCAAC CCCACGACAC GCGATTTGCC ACTCTTGTGT
TACGCCATGA ACGAGCAGCA CGGTGCGGTG ATGGAGTTTA ACTACCTGAC CAAGCCGATT
GAGTTGGCCG ATTTGTCGCG TGCGCTCGAT CAGTACTGGC GAGTTACCGG TACCATCGGT
ACGCCCCAGA CCATCCTCGT GGTTGATGAC GATCCCGATA CGCTCGATCT GCATGCGCGC
CTGATACAGG CCCACGGTAT CGCAAAACTG GTCTTGCGGG CACGTTCGGG GCGCGAAGCG
CTGGAATTGA TGGAACAGCA GCGCGTCGAT TTGGTGTTGC TTGATCTGAT GATGCCGGAG
ATGGACGGGT TTGACGTGCT GGCGGCAATG CGGAACCATA AGCAGATGCG GGAGATTCCG
GTAATCGTGA TTACCGGACG TGTCCTCAGC GAAGAGGATA TGGCTCGTCT CAATCAAGGG
GTAACTGCGG TACTGAGCAA AGGTGTGTTC AGCGCGCACG AGACTCTCGC CCGCCTGCAA
GCTGCGCTCG AACGTCGCCG ACGGCTTAGC GATCAGGCGC AGACACTGGT ACGTAAGGCC
ATGGCGTATA TTCACAGCCA TTACGACCAC CCCCTCACAC GGCAAGATAT TGCGCGTTAC
GTCGGCATGA GTGAAGACTA TCTTACCCAC TGTTTTCGGC AAGAACTCGG CACCACGCCG
GTTGATTATC TCAACCGCTA TCGGGTGTTG CAGGCACGCC GACTGCTGCT CGAGAGCGAT
AAGAGCATTA CCAATATTGC GCTAGAAGTT GGCTTTTCGA GCAGCAGTTA TTTCAGTCGC
GTGTTTCGTA AGGAGACCGG TCAATCACCG GAAGAATATC GCCGACAGGG GCGAAGTACG
ATCTGA
 
Protein sequence
MHTIAATKQH RRRPTVGAIA GWQFYGTMLT MSYLSPIYQG IRQAAYDLGC NLLLACGMGP 
SGQINEPLRP AWFDVIEDTD FVPVGPWNTH GLIIINPLQR PERSQAVQAI RASGHPVIFV
GSGERGPTIV ADNVGGVYAA LRHIVEHGHR QIAFIAGSQT DLEGDSGDRL RAFQAAMIEF
GLPIDPRLIV FGRHVFTGGY EAMRRLLATG VPFTAVLVSN DESAIGAITA LREAGKRVPH
DIAIIGFDDR IEGLLQEPAL TTVRIPLFKL GYQALELMVR HIREREPLPE LVQVPTRLIV
RESCGCSQSA VLADTLEMIA HHPTEPTVAN LPVQAAQAMS TMLAPKMHGL TASEVSRLCQ
QLITAFRQSV QSARPVAFRT ELAQVLKQVA AAGDDTHDWQ IAISFLRDLV PRMFDPTLQP
LALELLDEAR VTTSAAMRWQ YWQYFNRQQQ TNNRVGRLTA RLLHTLAESD IYEILAHHLP
ELNIPLLWIG FFEAENGDPV AWCRLRAVTA PQQPVVRIRS RTFPPRQWLP TRQSFQLALI
PLGGTGSEAG FVTFDASRLE LYGTITQQIN AALNTARLYR AASEGQRLAE EANQLKSRFL
SMVSHELQTP LNLIVGMSGL LLREIAQSGD PLPSSIRDDL RRIYASARHL GRLISDVLDL
ASSDAGQLRL NCEVVDLGEV MRVVADAGRQ MAADKQLTWY DSIPAEGPWV WGDRTRLQQI
GLNLVVNAIK FTARGRVGLI VTPETDAVTV TVRDTGIGLP PTEQAHIFEE FQRSERSVSQ
GYGGIGLGLA ICKRLVAMHG GEIGVRSRGI EGEGAEFFFR LPTITAPTPR RRHNPPALPV
QPRVLLLSAD HDEPLQRYLE QRGFIVTVFS IDESAAWLNE LLHRSYSAVV LHATQGERAW
WQTIQVLKAN PTTRDLPLLC YAMNEQHGAV MEFNYLTKPI ELADLSRALD QYWRVTGTIG
TPQTILVVDD DPDTLDLHAR LIQAHGIAKL VLRARSGREA LELMEQQRVD LVLLDLMMPE
MDGFDVLAAM RNHKQMREIP VIVITGRVLS EEDMARLNQG VTAVLSKGVF SAHETLARLQ
AALERRRRLS DQAQTLVRKA MAYIHSHYDH PLTRQDIARY VGMSEDYLTH CFRQELGTTP
VDYLNRYRVL QARRLLLESD KSITNIALEV GFSSSSYFSR VFRKETGQSP EEYRRQGRST
I