Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cagg_2962 |
Symbol | |
ID | 7266493 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chloroflexus aggregans DSM 9485 |
Kingdom | Bacteria |
Replicon accession | NC_011831 |
Strand | + |
Start bp | 3630756 |
End bp | 3633797 |
Gene Length | 3042 bp |
Protein Length | 1013 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 643567784 |
Product | transcriptional activator domain protein |
Protein accession | YP_002464258 |
Protein GI | 219849825 |
COG category | [R] General function prediction only |
COG ID | [COG3899] Predicted ATPase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 2 |
Fosmid unclonability p-value | 0.0000192637 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGATTACGG TTCAGCTTTT TGGTTCGCCG CTGGTATTGC GTAACCATCA GCCGGTCGTA TTGTCGCGCC GGCGCAGTCG GGCACTACTC TACTACCTTG CAGCACATAC GCAGCCGGTG CGGCGGGAGC AGGTGTTGAC CTTGTTGTGG CCCGACCACG AACGCAGCTC TGCCCAACAA CTGCTGCGTT CAACACTCTA CACCGTTCGG CAAGTTCTCG GTGAGGCGCT AGTTGCCGAT AATGAGCAAT TGGCGCTGAA GGCAACGGTT GATCTGCGCA CGCTGCAAGC CGTACTTGCC GATCCACACG CAACGGTACA GATGTTAACG TCGGCATTGC CAACCCATGC CGTTGAGGTG TTAGCCGGGT TTGATCTGCC CGACAGTGAG CCGTTTCAGT CGTGGCTCAC CGGTGTGCGT GAACAAGCGC GGGTGCTGGT TGGGCGCGGT TGGTTGCGGC TGGCCCATTT GTATGAACAG CACGGCGATT TGGCGGCTGC ACTGCATGCC CTTGATGTGG CGTTGAGCAT CGATCCACTA CAAGAGGATG TGCAGCGTGA TGCGATGCGT TTGGCGTATC TGAACGGTGA TCGGGTAGGG GCTATCCGTC GGTTTGAGCA GTTTCGCGAT CTGCTTGACC AAGAGTTGGG TGTGCCACCG ATGCGCGAAA CGCAGGCGCT GTACGATGCG ATTATTACCG ATACTCTTCC CGGTACGTCG GCTCCGGTGC GTCTGAGTGC GGTATCGCTG AACGGTGTGC TGCCCTTTAT TGGCCGTGAT ACCGAACTGT TATTGTTACA AACAGAAGCC CAGCCCGGTC GTTTACTGTT GATTGAAGGT GTGTCCGGGA TTGGTAAGAC GCGACTCGCC GAGGAGTTTC TTGCCCGTTG CGGTGGGCTG GTGTTGAGCG GGAGTGCCCG TGAACTTGAG CAAAATTTGC CCTACCATCC TATCCGCACT GCTCTTGCTG CGTTTGTAGC TCGGCCTGAA TGGCCTGCGC TGCACAGTCG ATTGCTGTTG GCGCCGATCT GGTGGCATGA ACTGGGGCGG TTGGTGCCCG ATCTGGCGCT ACCGCCGACC ACACCACCCG ATGAAGCGCG GTTGTGGGAA GCGATTGCCC GCTTCTTAAA CGAGTTGGCC CGTCAGCGAC CGGTCTACCT CCTGATCGAT GATCTTCAGT GGGCTGATAC GAGTACCTTA GGTTTGCTCG GCTATCTGTT GCGGCGTAGT ACCGATGTCG CGTTAACGGT TATCGCTACG ACTCGCCCTC CCGAACCGCG TAGCGCTCTT GCTCATCTCA TCCGTGCTTT GATCCGGGAA GAGCGCCTCG TGCGTGTCGT ACTGGAACGG CTGGCACCGG CTGCAATTCT GACGATTGCC CGCCAGCTAA GCCCGGCCTT TGCCCACCCT CTCGCCGATT GGTTGGAGCG TAATGCAGAA GGGAATCCGT ACATCGTTTC CGAACTGGTG CGCCACGCGC GTACCGCCGG TTGGCTGACA CCCAATGGTA TGGTCGATCT AAGTGCATTC TCGGCTCAAT CGGTGGTTCC GCAGAGTGTG TATCATCTCA TCGAAGCGCG CCTCAGTCGA TTGTCAGGTG CGGCCCGGCG CGTGCTCGAT GCGGCGGTTG TCGTAGGTCG TGATTTCGAC TTTGCTGTTG TCACGAAAGT TGCCGCGCTT TCCGAAGAGG CAACGCTAGA CGCCCTCGAT GAATTGCTGG CCGCTCGTTT GGTGTTGCCG CTCAGCGATG GTCAGTTCCG TTTCGATCAT CCCTTGACTA TGGAAGTGGT CTATCGGAAT CTCGGCGATC TGCGTCACCG TATGCTTCAC CGGCGCGTAG GGGAAGCTTT GGAGGCGTTG TATCGCGATC GGATTGATGA GGTAGCCGGG CAAATCGTGT TGCACTTTAT TGAGGGTGGT TGGTCGGAAC GGGCTGCCGG CTATGCGTTG CGGGCCGCCG AACGAGCCGC TGCGTTGGCA GCGTGGCCGG AAGCGGCAGA GTTTTATCTC AAAGCTTTAG ACGGTGTGTT GCCGCCTCAA CGTTCAACCG TCTTGCGTAA ACTCGGCTAT GCTCGGCTAT ACGCCGGTGC AGCGGGGTTG GCTGCCGAAG CCTTTCGTGA AGCGATGGGT GTGGCGTCAA CCCTGCATGA AGCCGTTGTC GCGCGATTAG CACTTGCCCA TGCCCTGATC CCGCAGGGTC GCTACGCCGA TGTGATTGCG TTGGTGAGCC AGATCGATCC TGATATTGAT CCACGTCAAC AGGCAGAAGC TCTTTTTCTC TGGGGGACGG CGCTCTCGTT GGAAGGTGCC GATCTCGGTG CTGCCACCGA ACGGTTAATG ATGGCCGAAG CCGTGTTACG TGCTTGCCCG ACGGCTGAAG CAGCCGCTCT GGCGCAAGTC TCGTTTGAGC TAGGCAATAT TGCCGCCCAG CAGGGCGATC TCACTACCGC CATTGCCCGC TACGAGATGG CCCAACAGAT CGCCGACGAA GCCGGTGATA GTGCGCTGAC GTGGCGAGTA TTGGCTCGCA ATAATCAGGC GTATCATCGT CTGCTGCTCG GTGAAATCTC GGCTGCCGAG ACTATAATCA CTGCCGCATT GAGCCTGGCT GAAGAAGGTG GTTTGCTCTC GGTTATGCCA TACCTTCTCT CGACCGCCGG TGAAGTAGCG TTGGCGCAAG GTGATCTGGA GACGGCTATC GCTCGCTTCC AACAGGGGTT AGCGCTCGCC ACTCAATTCA GCATTCCTGA ACGGGTAGCC GGTATCACGG CCAATCTTGG CTTAGTCGCG TTGCAACGTG GTGATACGAC CTGTGCTGTG CATTACCTCT CGACGGCGTT AGCCCAAGCC GATACGCTTG GTGCCCGTCA TCTGGCTGCC CAGATTCGGG TTTGGCTGGC ACCACTGTTA CCACCGCCTG AAGCGCGGAT ACGGCTGGCC GAAGCCCAAG CAATTGCCGA GGCGGGCGGC AGACAACGCC TGCTGGCGGC AATTGCCCAA GTTCGAGCAG CGTTGTCCGA TCAAGCGACT GTTCCGCGTT GA
|
Protein sequence | MITVQLFGSP LVLRNHQPVV LSRRRSRALL YYLAAHTQPV RREQVLTLLW PDHERSSAQQ LLRSTLYTVR QVLGEALVAD NEQLALKATV DLRTLQAVLA DPHATVQMLT SALPTHAVEV LAGFDLPDSE PFQSWLTGVR EQARVLVGRG WLRLAHLYEQ HGDLAAALHA LDVALSIDPL QEDVQRDAMR LAYLNGDRVG AIRRFEQFRD LLDQELGVPP MRETQALYDA IITDTLPGTS APVRLSAVSL NGVLPFIGRD TELLLLQTEA QPGRLLLIEG VSGIGKTRLA EEFLARCGGL VLSGSARELE QNLPYHPIRT ALAAFVARPE WPALHSRLLL APIWWHELGR LVPDLALPPT TPPDEARLWE AIARFLNELA RQRPVYLLID DLQWADTSTL GLLGYLLRRS TDVALTVIAT TRPPEPRSAL AHLIRALIRE ERLVRVVLER LAPAAILTIA RQLSPAFAHP LADWLERNAE GNPYIVSELV RHARTAGWLT PNGMVDLSAF SAQSVVPQSV YHLIEARLSR LSGAARRVLD AAVVVGRDFD FAVVTKVAAL SEEATLDALD ELLAARLVLP LSDGQFRFDH PLTMEVVYRN LGDLRHRMLH RRVGEALEAL YRDRIDEVAG QIVLHFIEGG WSERAAGYAL RAAERAAALA AWPEAAEFYL KALDGVLPPQ RSTVLRKLGY ARLYAGAAGL AAEAFREAMG VASTLHEAVV ARLALAHALI PQGRYADVIA LVSQIDPDID PRQQAEALFL WGTALSLEGA DLGAATERLM MAEAVLRACP TAEAAALAQV SFELGNIAAQ QGDLTTAIAR YEMAQQIADE AGDSALTWRV LARNNQAYHR LLLGEISAAE TIITAALSLA EEGGLLSVMP YLLSTAGEVA LAQGDLETAI ARFQQGLALA TQFSIPERVA GITANLGLVA LQRGDTTCAV HYLSTALAQA DTLGARHLAA QIRVWLAPLL PPPEARIRLA EAQAIAEAGG RQRLLAAIAQ VRAALSDQAT VPR
|
| |