Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cagg_1953 |
Symbol | |
ID | 7268869 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chloroflexus aggregans DSM 9485 |
Kingdom | Bacteria |
Replicon accession | NC_011831 |
Strand | + |
Start bp | 2384687 |
End bp | 2387938 |
Gene Length | 3252 bp |
Protein Length | 1083 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 643566791 |
Product | transcriptional activator domain protein |
Protein accession | YP_002463284 |
Protein GI | 219848851 |
COG category | [K] Transcription |
COG ID | [COG2909] ATP-dependent transcriptional regulator |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 6 |
Fosmid unclonability p-value | 0.00247822 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | TTGCAACGTA GTACGTTGCT CGCGAAACTG ATCCCTACTA TCCTCATGGT ATCACAAGCG GTTACAGGTG AGTATAACGA ACATCTGTTG CGGCCTCGTC TATTGCCACC CCCACCACCG TTGCACCGCA TCCGTCGGGT GCGGGTTGAG CAGCGGCTGG CGGTGGCTGT CGATGTCCCA TTGACTATTG TAGTAGCACC TCCCGGTAGT GGCAAGACGG TTGCCTTAGC CGCTCTCGCG ACCCATGGTG GCTGGCCGGC TGCATGGTGT CGTGCCGATA CCAACGATGA TCCATCGCGC CTGCTCGCCC ACCTTGCTGC GGCTCTGAGT CGGGTAACAT CCCTCGACCC CAATCGTCTC CCACGAACGA TTGATGGTTT GATTAACGCG CTAACCGGCG AGCTTGATGA TGAAACGTTG TTGATTATCG ATGATGCGCA TCTGATCGAT GAGCAGCCAG AACTACGTGC CCTTATTGAA CGGTTTATCG TGGCTCAGCC TCCAAGATTG CATCTGGTTC TCGCCGGACG GCGCGAACCC AATTCGCCAC TTATCGCGAC GGCTCGTTTG CGTGGTGAAG CGTTGCTGAT CGAAGCCGCC GATTTGCTTT TTACATCCGA TGAAGCAGCT CAGCTTTGGC AGCAGGCCGG CAAAACGCTG CCGGTTGATT TCGATGAGTT AATGACATTC AGCGCCGGAT GGCCACTCGC CCTACGGATT GCACTCGATG CGACCGATTG GCGGAGTGCG CTTGGGCTGC GTGAACAGAA GGGTAAGCCG GCCCTTGACG ATTATCTGGA ACGGGAAGTG TTCGCGCTGC TGCCCGAGCC GTTACAACGT TGCCTGCAAC GCAGCGCAGG TCTGCGCTGG ATCACCCCCG ATCTGTGTGC GGCCCTCGAT CCAACCTTCA ACGCATCTGA ACTCATTGCC GAATTACGTC GTCGCCGCCT CTTTGTCGAA CCGTTTGGTG AGCATGGTGT CCGGTTTCAG CCGCTGATCG CGGCGTGGCT GTCCCGTCGA GCGGCTGCCG ATCCCGAATG GGTGCATCTG CACCGCCAGG CTGCGGCATA CTTTCAGCAG CACGGTGATC ACGAAAGTGC GTTGTACCAT CAGATTGCTG CCGGTGATCC TGCCGCAGTG ACCACCTTCC TCCCACTGGC CCGCACGCTG CTGGCCGAGC GTCGGGCCGA AGCTGTGCTC GATTGGATGC GACGGTTAGC GTTGACCGGC GATGAAACCC CGGAATTGAT CGAGCTTCAA GCCGCTGCAC TTCACCAGCT CAATCGGCTT GAAGCTGCCT TGGCTACCTA CCGCCGGGCC GAGCGGGCTT TTGCCGCGAG CGGTGATCGA CTTGGTCAGT CGCGTTGTCT ACGTGGGCAG GCTGCAATCT ACCTTGATAC CGTTCAACCG GCTCCGGCCA CCGATTTGTT ACGCCGGGCT TTGAAGTTGT TGCCACGTCA GTGTGGTGCG GAACGGATCG AGTTGTTGCT GATGCAGGCC GAGAATTGGG CTAATCGCGG GCGAGCCGAT ATCGGGTTGC GACTTGAGCA GACGGCGCGT CAGTTGGCGA AAGCCCACGG GCTGTCCGCG CAGTACGATC CGCAGGCTGA AGTGCTCCGG CCACGCTTGC TGTTGCGGGC CGGACGTCTG CGGGAAGCAC GGCAGTTGCT CGAAGAACGG CTTTGGTCCG AACAGTCATG TACGCGAGCA ACAGGCCATC GTGAACCGCT TTTACTGCTG GCGCTGATCA ACGCAATGCT TGGCGTGGGG CCACAAGCGC TAGCGTTTGC CCATCGAATG TTGATCGAAG CGCAACAGAG CGGCAATTTG GTTACGGAAG CCCTGGCCGA GTTGCGGCTT GGTCATGCCT ATCAGCTTAT CGCTCGTGGT GATGATCAAG CGGCGCTCCA ACACTATAGC CGTGGTTTGC GTCTGTTGCA GCAGGTGACA GTGCCGCGAA CTCGTGCCGA GGGGTATCTG GGCCTGACGC TCCTTCATGG TCATGCCGGC GATTTGGCCC GTGCCGAGGC CGATGCTCGC GAGGGTCTGT TGCTGGCCGC TTCTGCCGGC GATGAGTGGG TCGCTGCCCT GATGTTGCTC GCCCTCGGCA GTGTTACCCT TGTAGCCGAT GATCCGCGCG GTTACGAGTG GCTCGATCAG GCCGAACAGC GATTTCAAGC CGGTCGTGAT ACCTTTGGCT TGTTTCTGGT GCATCTCTGG CGAGCATTGG CAGCCTTGCG CGTCGGTCGG ACTGCTGCGG TCGATCCGTT GGTCGATCAG GTCATGCACG AAGCGGTGGA GTATGGCTAC GAGCATGTCT TGATCGGACC GAGTCTGTTT GGGCCGCGCG ATATTGCGGC GTTGGTGCCT TTGCTTCTAC GCGCCCGCAC GATGCCGGCC CACCGCGATA CTGCTATTCG GCTGCTGCGG CAGGGTTTTC CCTCGATCGC CACCGATGAC ACCGTTGATG ATTACCATCC CGGCTTTACA CTACGGGTGC AAATGTTGGG AGCATTCCGG GTTTGGCGAG GTAATCAAGA GATTCAAGCT CGTGAGTGGC AGCGTGAAAA GGCGCGTCAG CTTTTCCAGT TGCTGCTGAC GATGCGTGGT AATTGGGTAC AACGCGAGCA GATCTGCGCA TGGCTGTGGC CTGATGCCGA TCTCGAAGCT GCCGAACGCC AGTTTAAGGT AACTTTGAAC ACGCTTAACG CAGCGCTTGA GCCACACCGT CCGCCGCGCG TACCCCCGTT CTTTATCCGT CGGCAGGGGT TGGCGTATAG CTTTGCCCCT TCCTTCGGCG TATGGATCGA TGTTGACGAA TTTGAACTGC GTGCCAGTAG TGCATTGACG GCAACCGACC CCGATTTTGC GCGCCGCAGC GCCCAAGCTG CGCTCCAACT CTACCGCGGT GATTATCTCG CCGAATCGCT GTATGATCCG TGGACAACCG AAGAGCGTGA GCGTTTGTTG GCCCGTTATC TGGCAACGGC TGTTGCCTAT GCCGAACGAC TGAGTGCCGA AGGAAAGCAC AACGAGGCGA TCCAGATCGC CGAACAGGTG TTGCGTCGCG ATCGCTGTTA CGAGGAAGCC TACCAATTGC TGATGCGTGC CCATGCCCGC GCCGGCAGCC GTTCGCAGGC GATGCGCGCC TATACACGTT GTGTCCAGGC GTTACGTGAG GAATTAGGGA TTGAGCCGTT GGCTGAAACG GAAGCGCTCT ACCTGCGTAT CCGCTTGAAT GAGCCGATTT GA
|
Protein sequence | MQRSTLLAKL IPTILMVSQA VTGEYNEHLL RPRLLPPPPP LHRIRRVRVE QRLAVAVDVP LTIVVAPPGS GKTVALAALA THGGWPAAWC RADTNDDPSR LLAHLAAALS RVTSLDPNRL PRTIDGLINA LTGELDDETL LIIDDAHLID EQPELRALIE RFIVAQPPRL HLVLAGRREP NSPLIATARL RGEALLIEAA DLLFTSDEAA QLWQQAGKTL PVDFDELMTF SAGWPLALRI ALDATDWRSA LGLREQKGKP ALDDYLEREV FALLPEPLQR CLQRSAGLRW ITPDLCAALD PTFNASELIA ELRRRRLFVE PFGEHGVRFQ PLIAAWLSRR AAADPEWVHL HRQAAAYFQQ HGDHESALYH QIAAGDPAAV TTFLPLARTL LAERRAEAVL DWMRRLALTG DETPELIELQ AAALHQLNRL EAALATYRRA ERAFAASGDR LGQSRCLRGQ AAIYLDTVQP APATDLLRRA LKLLPRQCGA ERIELLLMQA ENWANRGRAD IGLRLEQTAR QLAKAHGLSA QYDPQAEVLR PRLLLRAGRL REARQLLEER LWSEQSCTRA TGHREPLLLL ALINAMLGVG PQALAFAHRM LIEAQQSGNL VTEALAELRL GHAYQLIARG DDQAALQHYS RGLRLLQQVT VPRTRAEGYL GLTLLHGHAG DLARAEADAR EGLLLAASAG DEWVAALMLL ALGSVTLVAD DPRGYEWLDQ AEQRFQAGRD TFGLFLVHLW RALAALRVGR TAAVDPLVDQ VMHEAVEYGY EHVLIGPSLF GPRDIAALVP LLLRARTMPA HRDTAIRLLR QGFPSIATDD TVDDYHPGFT LRVQMLGAFR VWRGNQEIQA REWQREKARQ LFQLLLTMRG NWVQREQICA WLWPDADLEA AERQFKVTLN TLNAALEPHR PPRVPPFFIR RQGLAYSFAP SFGVWIDVDE FELRASSALT ATDPDFARRS AQAALQLYRG DYLAESLYDP WTTEERERLL ARYLATAVAY AERLSAEGKH NEAIQIAEQV LRRDRCYEEA YQLLMRAHAR AGSRSQAMRA YTRCVQALRE ELGIEPLAET EALYLRIRLN EPI
|
| |