Gene Rcas_4424 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRcas_4424 
Symbol 
ID5541937 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus castenholzii DSM 13941 
KingdomBacteria 
Replicon accessionNC_009767 
Strand
Start bp5688342 
End bp5691509 
Gene Length3168 bp 
Protein Length1055 aa 
Translation table11 
GC content64% 
IMG OID640896522 
Producttranscriptional activator domain-containing protein 
Protein accessionYP_001434458 
Protein GI156744329 
COG category[R] General function prediction only
[T] Signal transduction mechanisms 
COG ID[COG3629] DNA-binding transcriptional activator of the SARP family
[COG3899] Predicted ATPase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.518741 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTCACCA TTCTCCTTCT CGGTCCGCCG CAGATCCTGT CCGATGGCAT CCCGGTAACG 
GTGTCGCGCC GTCGTGCGCG GGCGCTGGTC TACTATCTGG CAGCGCAGGA CAAACCGGTT
CACCGCGAGC GCTTGATCGA CCTGTTCTGG CACGACCACG ACCACGCTGC TGCCCGACAG
TTGCTCCGCA CCACGCTCCA CAGCGTGCGA CGGGTGCTTG GATCGGCTGT CGAAGGTGAT
GAGGAAGTTA CTCTCGCCCT CGATGCCGAC GTAGACTATC GCGCACTGGT CACTGCCGTC
ACTGCGCCCA TCGGCAATGA GTCGATACTG TCCGCCGCGC TGGAGCGCTA CCGTGATGAT
CTGCTGACCG GTTTCACCCT TCCCGATGCT CCGCTATTTG CCGATTGGCT CGAAGCAGAG
CGAGAGCGGG CGCGTTTGCT GGCGATGCGG GGATATACAC ACCTGGCGCG CATCGCCGAA
ATGCGCGGCG ATCTTGCTGC GGCGCTGACG GCGCTCGACC GCGCGCTGAC GTTCGATCCG
CTCCAGGAAG ACCTGCAACG CGAAATAATG CGCCTGCATT ACCTGTCAGG CGATCGGGTC
GGCGCCATCC GGCGCTACGA GAACCTGCGC GATCTGCTCG ACGCCGAATT GGGTGTGCCG
CCGATGCGTG AGACCCGCGA ACTCTACGAT GCGATCGTGA CCGATCGCCT GCTCGATAGT
GCATCGACGC AGTATCGCAG TCTTGAGAGT GAACGGTATA AGCAGCGGAA CGTGTCGAGC
AATCGTCCAC CGCCAGCGCG TCACACCCTG CCGTCCCTGC TGCCATTCAT TGGACGGTCG
GCGGAGATGG AAGCGATCGA GATGGTTGGT GCGGGACGGC TTATGCTGAT TGAAGGTGAA
ACCGGGATCG GCAAGACCCG TCTGGCGTTC GAGGCGCTGG AACGACATAC AGCGCGGGGC
GGATTGACCC TTATCGCGGC TGCGCGCGAA CTGGAGCAGG GATTGCCCTA CCGACCATGG
ATCGACCTGT TGCGTGATCT GCTGGCGCGT CCCAACTGGC ATATGCTGCG CGCACACCTC
AATCTCGATC CACTCTGGTT CGGCGAGGCG GCGCGGCTGT TGCCGGAACT GGCGCCTGGA
TCATCGGCGA CGACACAGGC GGACGAAGCG CGCCTGTGGG AAGGAGTGAC ACGGTTGTTG
ATTGCGCTGG CGAGTCTGAA GCCGCTCATG TTGCTCTTCG ACGATCTCCA TTGGGCGGAC
GCAAGCAGTC TTGGTCTGCT GGGATATGTG GTGCGGCGGG CTGAGGAAGC GTCGGTTCGC
CTGATTGCAA CTGCGCGCAC GACGGATCAT CAGGCGGCGC TCCGTATCTT GCTGAATGCG
CTGACCCGTG AGGGGCGTCT GGAACGCATA CTGCTGCGCC GTCTGAGTAC GACTGACACC
GAAGCGCTGG CGCGCGCGCT GAGTCCCGAT GATGCCGCGC GATTGGCTTC CTGGCTCTAT
CGTAATACCG AAGGCAATCC TTTCGTTATT GCCGAATTGG TGCGCCACGC CCGCACCACC
GGGTTGCTGT CCGCCGATGG GCGATTGAGT CCGGTTCTGC CCGATGAACC GGTTGTGCCT
GTGTCGGTAT ATGGTCTAAT CCAGTCACAA TTGGCGCGTC TTTCCGACGA AGCGCGCCGG
GTACTCGACA CGGCAGCGAC AGTCGGGCGG GTCTTTTCAT TTGATGTCGT GGCGCGCGCA
GCGGCGCTTT CCGAAGAGGC GGCGCTCGAC GCACTCGATG AACTGCGCGC TGCCCGCCTC
ATCGAGCCGT TAGCCAATGG TCGTTTTCAG TTCGACCATA GTTTGACGAT GGAGGTCGCG
TACCGCGAGA TGGGTGAGCC GCGCCATCGG GCGCTGCACC GGCGCGTCGC CGAGGCGCTC
GAAGCGCTCA ACCGCGACTG TCTGGATGAT GTGGCAGGAT TGATCGCCTG GCATTTTGCC
GAAGGCGGGG TTCCTGAGCG CGCGGCGACC TACGCGGTGC GCGCCGGTCG CCGCGCGGCG
CGCGTCGCCG CCTGGACGGA AGCGATTGCG TTTTACGAAC AGGCGCTGGC AGGTTTAGGG
GCTTCACAGC GGTTCGATGC GCTGATGAAC CTGGGAGAAG CGTTAGTGAT GGGCGGGAAA
GCGGCGCAGG CAGCGGAACG GTTCCGCGAA AGCCTGGCGC TGGCGCGCAC TCCTGCGGAA
GCGCGTCGGG CGCGGTTGAG TCTGGCGCGC GCACTGGCGC CTCAGGGACG GTACGCTGAA
ATGATCGAAG CAGTGCGCGG GTTGGAACAA CGTGGTGATC TCCGTGATCG GATCACGGCA
CTGTTCCTGT GGGGCACAGC GCTATCGCTC GAAGGGTCCG ATCTGATCGG TGCGGCGCTC
CGGTTGCGTG AAGCGGCGCG TCTGATTCTG GCGCAACCCG CGCCCGATCC CATCGCGCTA
GCGCAGGTGC GCTTCGAACT CGGCAGCGTG GCGGCTCAGC AGGGCGACCT GTCCGCCGCA
GTCGCCTCCT ACCGCGAGGC GCTGGCAGCC ACCGACAGCG CCACCGCGCA TCCAGAAGCG
CTCACATGGC GCATTCTGGC GTGCAACAAC CTTGCCTATC ATCTGCACCT GCAAGGGCGC
CTGGACGAAG CCGAACACTG GCTGGACGAA GGTCTGCGCC TGGCGAATGA GTATGGCATG
CTGGGGCTTC AACCATATCT GCTCTCGACC CAAGGCGAGA TCGCGCTGGC GCGCGGCAAC
CTCGATGCGG CTGAGTCCAG TTTTCTGGCG GGACTGACCC TTGCCGAACG CATGGTCGTT
TCTGAACGGG TGGCCGGCAT CACCGCCAAT CTCGGACTCG TCGCATTGCA CCGCGGACAC
GCCACACTCG CTATTCACCA CCTCTCGATA GCCCTGGCGC GCGCCGATAC GCTGGGCACA
CGTCATCTCG CCGCACAGAT CCGCATCTGG CTGGCGCCGC TCCTCCCGCC TGATGAAGCG
CGCACCGTTC TCGCGCAGGC GCGCACCATC GCCGAAAGCG GCGGTCGTCG TCGTCTGCTA
GCGGATATAG CCCGTGTGGA ATACCAATTA CGATTAAGGA TGCCGCATGC TGGTCATTCC
GAGCGCAGCG AGGAATCTGA GCGGGTTGCG CAAGACCCCT CGCTCTGA
 
Protein sequence
MLTILLLGPP QILSDGIPVT VSRRRARALV YYLAAQDKPV HRERLIDLFW HDHDHAAARQ 
LLRTTLHSVR RVLGSAVEGD EEVTLALDAD VDYRALVTAV TAPIGNESIL SAALERYRDD
LLTGFTLPDA PLFADWLEAE RERARLLAMR GYTHLARIAE MRGDLAAALT ALDRALTFDP
LQEDLQREIM RLHYLSGDRV GAIRRYENLR DLLDAELGVP PMRETRELYD AIVTDRLLDS
ASTQYRSLES ERYKQRNVSS NRPPPARHTL PSLLPFIGRS AEMEAIEMVG AGRLMLIEGE
TGIGKTRLAF EALERHTARG GLTLIAAARE LEQGLPYRPW IDLLRDLLAR PNWHMLRAHL
NLDPLWFGEA ARLLPELAPG SSATTQADEA RLWEGVTRLL IALASLKPLM LLFDDLHWAD
ASSLGLLGYV VRRAEEASVR LIATARTTDH QAALRILLNA LTREGRLERI LLRRLSTTDT
EALARALSPD DAARLASWLY RNTEGNPFVI AELVRHARTT GLLSADGRLS PVLPDEPVVP
VSVYGLIQSQ LARLSDEARR VLDTAATVGR VFSFDVVARA AALSEEAALD ALDELRAARL
IEPLANGRFQ FDHSLTMEVA YREMGEPRHR ALHRRVAEAL EALNRDCLDD VAGLIAWHFA
EGGVPERAAT YAVRAGRRAA RVAAWTEAIA FYEQALAGLG ASQRFDALMN LGEALVMGGK
AAQAAERFRE SLALARTPAE ARRARLSLAR ALAPQGRYAE MIEAVRGLEQ RGDLRDRITA
LFLWGTALSL EGSDLIGAAL RLREAARLIL AQPAPDPIAL AQVRFELGSV AAQQGDLSAA
VASYREALAA TDSATAHPEA LTWRILACNN LAYHLHLQGR LDEAEHWLDE GLRLANEYGM
LGLQPYLLST QGEIALARGN LDAAESSFLA GLTLAERMVV SERVAGITAN LGLVALHRGH
ATLAIHHLSI ALARADTLGT RHLAAQIRIW LAPLLPPDEA RTVLAQARTI AESGGRRRLL
ADIARVEYQL RLRMPHAGHS ERSEESERVA QDPSL