Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rcas_4424 |
Symbol | |
ID | 5541937 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Roseiflexus castenholzii DSM 13941 |
Kingdom | Bacteria |
Replicon accession | NC_009767 |
Strand | + |
Start bp | 5688342 |
End bp | 5691509 |
Gene Length | 3168 bp |
Protein Length | 1055 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 640896522 |
Product | transcriptional activator domain-containing protein |
Protein accession | YP_001434458 |
Protein GI | 156744329 |
COG category | [R] General function prediction only [T] Signal transduction mechanisms |
COG ID | [COG3629] DNA-binding transcriptional activator of the SARP family [COG3899] Predicted ATPase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.518741 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 24 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCTCACCA TTCTCCTTCT CGGTCCGCCG CAGATCCTGT CCGATGGCAT CCCGGTAACG GTGTCGCGCC GTCGTGCGCG GGCGCTGGTC TACTATCTGG CAGCGCAGGA CAAACCGGTT CACCGCGAGC GCTTGATCGA CCTGTTCTGG CACGACCACG ACCACGCTGC TGCCCGACAG TTGCTCCGCA CCACGCTCCA CAGCGTGCGA CGGGTGCTTG GATCGGCTGT CGAAGGTGAT GAGGAAGTTA CTCTCGCCCT CGATGCCGAC GTAGACTATC GCGCACTGGT CACTGCCGTC ACTGCGCCCA TCGGCAATGA GTCGATACTG TCCGCCGCGC TGGAGCGCTA CCGTGATGAT CTGCTGACCG GTTTCACCCT TCCCGATGCT CCGCTATTTG CCGATTGGCT CGAAGCAGAG CGAGAGCGGG CGCGTTTGCT GGCGATGCGG GGATATACAC ACCTGGCGCG CATCGCCGAA ATGCGCGGCG ATCTTGCTGC GGCGCTGACG GCGCTCGACC GCGCGCTGAC GTTCGATCCG CTCCAGGAAG ACCTGCAACG CGAAATAATG CGCCTGCATT ACCTGTCAGG CGATCGGGTC GGCGCCATCC GGCGCTACGA GAACCTGCGC GATCTGCTCG ACGCCGAATT GGGTGTGCCG CCGATGCGTG AGACCCGCGA ACTCTACGAT GCGATCGTGA CCGATCGCCT GCTCGATAGT GCATCGACGC AGTATCGCAG TCTTGAGAGT GAACGGTATA AGCAGCGGAA CGTGTCGAGC AATCGTCCAC CGCCAGCGCG TCACACCCTG CCGTCCCTGC TGCCATTCAT TGGACGGTCG GCGGAGATGG AAGCGATCGA GATGGTTGGT GCGGGACGGC TTATGCTGAT TGAAGGTGAA ACCGGGATCG GCAAGACCCG TCTGGCGTTC GAGGCGCTGG AACGACATAC AGCGCGGGGC GGATTGACCC TTATCGCGGC TGCGCGCGAA CTGGAGCAGG GATTGCCCTA CCGACCATGG ATCGACCTGT TGCGTGATCT GCTGGCGCGT CCCAACTGGC ATATGCTGCG CGCACACCTC AATCTCGATC CACTCTGGTT CGGCGAGGCG GCGCGGCTGT TGCCGGAACT GGCGCCTGGA TCATCGGCGA CGACACAGGC GGACGAAGCG CGCCTGTGGG AAGGAGTGAC ACGGTTGTTG ATTGCGCTGG CGAGTCTGAA GCCGCTCATG TTGCTCTTCG ACGATCTCCA TTGGGCGGAC GCAAGCAGTC TTGGTCTGCT GGGATATGTG GTGCGGCGGG CTGAGGAAGC GTCGGTTCGC CTGATTGCAA CTGCGCGCAC GACGGATCAT CAGGCGGCGC TCCGTATCTT GCTGAATGCG CTGACCCGTG AGGGGCGTCT GGAACGCATA CTGCTGCGCC GTCTGAGTAC GACTGACACC GAAGCGCTGG CGCGCGCGCT GAGTCCCGAT GATGCCGCGC GATTGGCTTC CTGGCTCTAT CGTAATACCG AAGGCAATCC TTTCGTTATT GCCGAATTGG TGCGCCACGC CCGCACCACC GGGTTGCTGT CCGCCGATGG GCGATTGAGT CCGGTTCTGC CCGATGAACC GGTTGTGCCT GTGTCGGTAT ATGGTCTAAT CCAGTCACAA TTGGCGCGTC TTTCCGACGA AGCGCGCCGG GTACTCGACA CGGCAGCGAC AGTCGGGCGG GTCTTTTCAT TTGATGTCGT GGCGCGCGCA GCGGCGCTTT CCGAAGAGGC GGCGCTCGAC GCACTCGATG AACTGCGCGC TGCCCGCCTC ATCGAGCCGT TAGCCAATGG TCGTTTTCAG TTCGACCATA GTTTGACGAT GGAGGTCGCG TACCGCGAGA TGGGTGAGCC GCGCCATCGG GCGCTGCACC GGCGCGTCGC CGAGGCGCTC GAAGCGCTCA ACCGCGACTG TCTGGATGAT GTGGCAGGAT TGATCGCCTG GCATTTTGCC GAAGGCGGGG TTCCTGAGCG CGCGGCGACC TACGCGGTGC GCGCCGGTCG CCGCGCGGCG CGCGTCGCCG CCTGGACGGA AGCGATTGCG TTTTACGAAC AGGCGCTGGC AGGTTTAGGG GCTTCACAGC GGTTCGATGC GCTGATGAAC CTGGGAGAAG CGTTAGTGAT GGGCGGGAAA GCGGCGCAGG CAGCGGAACG GTTCCGCGAA AGCCTGGCGC TGGCGCGCAC TCCTGCGGAA GCGCGTCGGG CGCGGTTGAG TCTGGCGCGC GCACTGGCGC CTCAGGGACG GTACGCTGAA ATGATCGAAG CAGTGCGCGG GTTGGAACAA CGTGGTGATC TCCGTGATCG GATCACGGCA CTGTTCCTGT GGGGCACAGC GCTATCGCTC GAAGGGTCCG ATCTGATCGG TGCGGCGCTC CGGTTGCGTG AAGCGGCGCG TCTGATTCTG GCGCAACCCG CGCCCGATCC CATCGCGCTA GCGCAGGTGC GCTTCGAACT CGGCAGCGTG GCGGCTCAGC AGGGCGACCT GTCCGCCGCA GTCGCCTCCT ACCGCGAGGC GCTGGCAGCC ACCGACAGCG CCACCGCGCA TCCAGAAGCG CTCACATGGC GCATTCTGGC GTGCAACAAC CTTGCCTATC ATCTGCACCT GCAAGGGCGC CTGGACGAAG CCGAACACTG GCTGGACGAA GGTCTGCGCC TGGCGAATGA GTATGGCATG CTGGGGCTTC AACCATATCT GCTCTCGACC CAAGGCGAGA TCGCGCTGGC GCGCGGCAAC CTCGATGCGG CTGAGTCCAG TTTTCTGGCG GGACTGACCC TTGCCGAACG CATGGTCGTT TCTGAACGGG TGGCCGGCAT CACCGCCAAT CTCGGACTCG TCGCATTGCA CCGCGGACAC GCCACACTCG CTATTCACCA CCTCTCGATA GCCCTGGCGC GCGCCGATAC GCTGGGCACA CGTCATCTCG CCGCACAGAT CCGCATCTGG CTGGCGCCGC TCCTCCCGCC TGATGAAGCG CGCACCGTTC TCGCGCAGGC GCGCACCATC GCCGAAAGCG GCGGTCGTCG TCGTCTGCTA GCGGATATAG CCCGTGTGGA ATACCAATTA CGATTAAGGA TGCCGCATGC TGGTCATTCC GAGCGCAGCG AGGAATCTGA GCGGGTTGCG CAAGACCCCT CGCTCTGA
|
Protein sequence | MLTILLLGPP QILSDGIPVT VSRRRARALV YYLAAQDKPV HRERLIDLFW HDHDHAAARQ LLRTTLHSVR RVLGSAVEGD EEVTLALDAD VDYRALVTAV TAPIGNESIL SAALERYRDD LLTGFTLPDA PLFADWLEAE RERARLLAMR GYTHLARIAE MRGDLAAALT ALDRALTFDP LQEDLQREIM RLHYLSGDRV GAIRRYENLR DLLDAELGVP PMRETRELYD AIVTDRLLDS ASTQYRSLES ERYKQRNVSS NRPPPARHTL PSLLPFIGRS AEMEAIEMVG AGRLMLIEGE TGIGKTRLAF EALERHTARG GLTLIAAARE LEQGLPYRPW IDLLRDLLAR PNWHMLRAHL NLDPLWFGEA ARLLPELAPG SSATTQADEA RLWEGVTRLL IALASLKPLM LLFDDLHWAD ASSLGLLGYV VRRAEEASVR LIATARTTDH QAALRILLNA LTREGRLERI LLRRLSTTDT EALARALSPD DAARLASWLY RNTEGNPFVI AELVRHARTT GLLSADGRLS PVLPDEPVVP VSVYGLIQSQ LARLSDEARR VLDTAATVGR VFSFDVVARA AALSEEAALD ALDELRAARL IEPLANGRFQ FDHSLTMEVA YREMGEPRHR ALHRRVAEAL EALNRDCLDD VAGLIAWHFA EGGVPERAAT YAVRAGRRAA RVAAWTEAIA FYEQALAGLG ASQRFDALMN LGEALVMGGK AAQAAERFRE SLALARTPAE ARRARLSLAR ALAPQGRYAE MIEAVRGLEQ RGDLRDRITA LFLWGTALSL EGSDLIGAAL RLREAARLIL AQPAPDPIAL AQVRFELGSV AAQQGDLSAA VASYREALAA TDSATAHPEA LTWRILACNN LAYHLHLQGR LDEAEHWLDE GLRLANEYGM LGLQPYLLST QGEIALARGN LDAAESSFLA GLTLAERMVV SERVAGITAN LGLVALHRGH ATLAIHHLSI ALARADTLGT RHLAAQIRIW LAPLLPPDEA RTVLAQARTI AESGGRRRLL ADIARVEYQL RLRMPHAGHS ERSEESERVA QDPSL
|
| |