Gene Rcas_4327 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRcas_4327 
Symbol 
ID5541840 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus castenholzii DSM 13941 
KingdomBacteria 
Replicon accessionNC_009767 
Strand
Start bp5577109 
End bp5580174 
Gene Length3066 bp 
Protein Length1021 aa 
Translation table11 
GC content63% 
IMG OID640896433 
ProductATP-dependent transcription regulator LuxR 
Protein accessionYP_001434369 
Protein GI156744240 
COG category[K] Transcription 
COG ID[COG2909] ATP-dependent transcriptional regulator 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGCCATT ATCGTCACAC CCATGTCGTC GACGACAACA CTCTCCTCCA GGCGAACGGG 
AGCGACGCCA TTCCGCTTGG TTCGCCCGCC TGGTATGCCT GGCTGAACGA CGACCAGACC
ACCTCCTTCG TGTACCGCAG CGCACGTGGC GGATTTACAG CGCGCCGCGA ACGTCAGCGC
AATGGCTGGT ACTGGTATGC CTACCGACGC ATCGGTGGAA GATTGCGCAA ACGCTATCTG
GGGCGCGCCG GCGATCTCAA CGCCGAACGA CTGCGCCGTG TCGATATGGC GTTCATCAAC
GGCGAGACGC CAACGCATGC GCCATACCAT TCCAGCGATC AGGCGCTGAC AACTGCCAAC
CGACTGCGTC CACCGCCGCC GCGCCCGTCG TTGATTGCCC GCCCTCGCCT GATCGAGCGC
ATCGATGATA CGCTTCACGC CGCGCTCACC CTGATCTCTG CTCCAGCCGG GTTTGGGAAG
ACGACTCTGC TCACCGAATG GGCGCTGCAC GTGCAGAGCG CAGCCCGCGC AGCAGTCGCA
TGGATCTCGC TCGACGCGAC CGATGATGAT CCGGTGCGTT TCTGGAGCGC GATAACACTG
GCGTTGAGCG TCGTTCGCAC TGACATTGGC GCCCATGCAC GCTCACTCCT GGAGTCGCCG
CAGCGCCCGC CGCATATGCT GCTGGCGCAT GCCCTGCTTG TCGATCTGAA CGCGGTGGCA
GCGCCGATCA TCGTTGTGCT CGACGATTAT CACCTGATCA CCTCAGCGGA GGTACACGAA
TCGCTGACGA CGTTCATCGA ACATCTGTCG CCGCACGTAC ACCTGGTCAT CGCCACACGC
GCCGATCCGC CGCTGCCGCT GGCGCGCTGG CGGGTGCGCG GGCGCCTGGC TGAACTGCGC
GCTGCCGATC TGCGCTTCAC CACCGAAGAG GCTGCGGCTT TCCTTCACTC AACCATGGGG
CTGGATTTGC CACTCGACGT GATTCGTGCG CTTGAGGACC GCACCGAGGG ATGGGTGGCG
GCGCTGCAAC TGGCGGCGCT TTCCATCCGT GATCGCGCCG ATGTCGAGGG GTTCATCCGG
CGATTCGCCG GAAGCCACCG GGCGATTGTC GATTACCTTG CCGAAGAGGT CATCGGTCGG
CAACCGGATC ACATCCAGAC ATTCCTGCTG CGCACATCTA TCGTTGATCG CATGTCGGCG
GAGTTGTGCG ATGCGCTGCT CACACCTGCC GCAGATGAAC CGGGTGCAAC AATCCCGGCG
CAGGCAATGC TCGACCACCT GGAACGCGCG AACCTGTTCG TGACGCCGCT CGACGAAGAG
CGACGCTGGT ATCGCTATCA TCATCTCTTT GCCGAAATGC TGCGCGACCG GTTGCAACGC
ACTCAGCCGG CGCTGGTTGC AGAACTCCAT CGGCGCGCTG CGCGCTGGCA CCGCGAGGCG
GGGTTCGCCG ACGCTGCGAT CCATCATTTT CTTCAGGCAG GAGATGCCGC CAACGCCGCC
GATGTCATTG AAGCAATCGC CAACGCGACC CTCTGGGAGC AGGGCGATGC GCAGACGCTT
CTGCGCTGGA TCGCCGCATT GCCCGATGAC ACTGTTCTGG CGCGTCCGCG TCTGGCGCTC
GATCAGGTGT GGGCATTGCT GGCAAGCATT CAATTCGATG CAGCAGAAGC ACGCGCCGCT
GAACTTGACC ATGTGCTGAC TCGGCGAGAT GCCCAAATGC GATCAATCGT CATTGGTGAA
CTGGCGGCGG TGCGTTCGGT TGCTGCGCGC GTGCGCGGGG ATGTGCCGCG CGCACTGGAG
TACGCCACGC ATGCGCGCGA TCATCTGGCA AGCATCGATT GCAGCCTGGC GCGCGCGATT
GCCGTCATGA ACCTCGTAGA AGCCCGCCTG ATGCAGGGCG ATCTGTCGAG CGCGGAACAG
CTCTGTGACG AATTGCGCGT TGCATCGCTG AGCCGCAACC TGGTTATCGC CCTGATCGGG
GTATTGTCGC GCTCAGAAGT GTACCGCAAT CAGGGTCGGC TCACCGATGC GCGCGCGCTC
CTCGATGAAG CGCTACGCCT CCTCGACCGA CGCGGCGCTT CGGAACGCCC GATCGTCGCG
CTCATCCATG TGGCGCTGGC AGACATTTTC TATGAGCAGA ACAGGCTCGA TGAAGCGGAA
TACTACGCAA ACCTGGCACT CAGACGCGCA GAACGCTGGT GGAACAACGA CATCCTCATC
CACAGCACAG GGCTGCTCTC TGCCATCCGT CGGGCGCAGG GCGATGAAGC CGCTGCTGCA
AAACTTGCCG AACGAGTCGA GCAACTCAGC CTCGAATACC GGGTTGGATG GATTTCCAAT
CACACCCTTG CGGGGCGCGC CGAGCGTCTC TTGCGCACCG GTGATCGGCA GGCTGCTGAA
CGGTGGGCGG CAGGGTGCGG ACTCACGCTG GCAGATGATC CACCGCCCGA ACGGTTCTAC
GAATATCTGG TGCTGGCGCG TGTCGCGCTG GCGCGTGGCG AAGGACGCAC AGCGCTTCCG
TTGATCCGGC GGTTACTGAA ACAGAGTGAG GAAGGGCGAC ATGCCATCCG CATGATCGAA
ACGTTGAAAC TTCTGGCGCT CACGAACCGT CAGATCGGTG ATGCCGCCAG CGCGCGCCAG
GCGCTCATTC GTGCATTACG CCTCGCTGAA CCGGGCGGTC TGGTGCGCAC ATTCGTTGAT
GAAGGCGAAG CCATGAAATG GTTGCTCGCC GATTGTGGCG CCTCGATTGC GCCACAGGCA
CAGCAAGGCG ACGGAGATGC GCGTCGTCTT CTGGCGTATG TCGATGATTT GATGGGAATG
TTTCGCAGCA TACCAACAAC CCAAACCGCC TCTCCGCTCG TCCACGCCAA CGAGCCGCTC
ACAGCGCGTG AGATCCAGGT GTTGCGCCTG CTCGCCACCG GGCGCACCGA TCAGGAAATC
GCCACAAGCC TGGTCATCGC GGTCAGCACC GTGCGATCAC ATATCAAGCG CATCTACGCC
AAAATCAACG CACGCAACCG CACCCAGGCT GCTGCCCGCG CCCACGATCT GGGCATCATC
GACTGA
 
Protein sequence
MRHYRHTHVV DDNTLLQANG SDAIPLGSPA WYAWLNDDQT TSFVYRSARG GFTARRERQR 
NGWYWYAYRR IGGRLRKRYL GRAGDLNAER LRRVDMAFIN GETPTHAPYH SSDQALTTAN
RLRPPPPRPS LIARPRLIER IDDTLHAALT LISAPAGFGK TTLLTEWALH VQSAARAAVA
WISLDATDDD PVRFWSAITL ALSVVRTDIG AHARSLLESP QRPPHMLLAH ALLVDLNAVA
APIIVVLDDY HLITSAEVHE SLTTFIEHLS PHVHLVIATR ADPPLPLARW RVRGRLAELR
AADLRFTTEE AAAFLHSTMG LDLPLDVIRA LEDRTEGWVA ALQLAALSIR DRADVEGFIR
RFAGSHRAIV DYLAEEVIGR QPDHIQTFLL RTSIVDRMSA ELCDALLTPA ADEPGATIPA
QAMLDHLERA NLFVTPLDEE RRWYRYHHLF AEMLRDRLQR TQPALVAELH RRAARWHREA
GFADAAIHHF LQAGDAANAA DVIEAIANAT LWEQGDAQTL LRWIAALPDD TVLARPRLAL
DQVWALLASI QFDAAEARAA ELDHVLTRRD AQMRSIVIGE LAAVRSVAAR VRGDVPRALE
YATHARDHLA SIDCSLARAI AVMNLVEARL MQGDLSSAEQ LCDELRVASL SRNLVIALIG
VLSRSEVYRN QGRLTDARAL LDEALRLLDR RGASERPIVA LIHVALADIF YEQNRLDEAE
YYANLALRRA ERWWNNDILI HSTGLLSAIR RAQGDEAAAA KLAERVEQLS LEYRVGWISN
HTLAGRAERL LRTGDRQAAE RWAAGCGLTL ADDPPPERFY EYLVLARVAL ARGEGRTALP
LIRRLLKQSE EGRHAIRMIE TLKLLALTNR QIGDAASARQ ALIRALRLAE PGGLVRTFVD
EGEAMKWLLA DCGASIAPQA QQGDGDARRL LAYVDDLMGM FRSIPTTQTA SPLVHANEPL
TAREIQVLRL LATGRTDQEI ATSLVIAVST VRSHIKRIYA KINARNRTQA AARAHDLGII
D