Gene Rcas_3701 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRcas_3701 
Symbol 
ID5541203 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus castenholzii DSM 13941 
KingdomBacteria 
Replicon accessionNC_009767 
Strand
Start bp4837476 
End bp4840670 
Gene Length3195 bp 
Protein Length1064 aa 
Translation table11 
GC content68% 
IMG OID640895812 
ProductSARP family transcriptional regulator 
Protein accessionYP_001433759 
Protein GI156743630 
COG category[R] General function prediction only
[T] Signal transduction mechanisms 
COG ID[COG3629] DNA-binding transcriptional activator of the SARP family
[COG3903] Predicted ATPase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000000037794 
Fosmid HitchhikerNo 
Fosmid clonabilitydecreased coverage 
 

Sequence

Gene sequence
GTGCTACTAT GCTACGAGAA TTTAGAATTC GTGTCGCAGC TCTCGCTCAC TCTATTGGGG 
ACACTGGCGA TCACCCTGGA TGGCCAGCCC GTCGCCGGCA TCGAGTCGGA CAAGGCGCGC
GCGCTGCTGG TCCGGCTGGC GCTGGAGCCG GAGCGCGCTT TCCGACGCGA GGCGCTGAGC
GCGCTCTTGT GGCCCGAAGC CGCGCCCGCA CAGGCTTCCC AGAACCTCCG TCAGGCGCTC
TACAACCTGC GCCGCGCCCT CGGCGAAGCC TTTTTGCTGA CGACGCCCCA CACCGTGCAG
TTCAACGCGG CTGCCGACGT GACGGTGGAC GCGCTGACCT GGCGCCGCCT GTGGAGTGAG
ACGCAGACGC ATCGCCACCG CCGCCGCGAG ACCTGCCGTC CCTGCCTGGA ACGCCTGGCG
CAGGCCATCG CCCTCTATCG CGGCGACCTG CTGGCCGGGT TCGCGCTCCA AGATAGCGCA
GAGTTCGACG ACTGGCTGGC CGTCGAGCGC GAACGGCTGC ACGTGCAGGC CCTGGAGGCG
CTGACGTTGC TGGCGAACGC CGCCGAGCGG CGCGGCGATT ACCCTGCCGC GCAGGAGTAC
GTGCGGCGAT TGTTGGCCCT GGAACCCTGG CAGGAGGCCG CGCATCGGCA TCTGATGCGC
CTGCTGGCCC TGGACGGCCG GCGCGCGGCG GCGCTGGCGC AGTTCGAGGT CTGTCGCCGC
GCGCTGGCCG ACGAGCTGGG CCTGGCGCCG GATGAGGAAA CCCGCGCCCT CCACGCGCGC
ATCCGCGCCG GCGAGCCGCT CTCCGCCGCG ATGCCGCTTC CCCCCGCGCC GCCCACCGAC
CTGCCGCTGC AACTCACCTC GTTCATCGGC CGTGAGCGCG AGTTGACGCT GTTGAGCGAG
CGCCTGAGCA ACCCCGCCTA CCGCCTCATC ACCCTCACCG GGCCGGGCGG GGTGGGAAAG
ACGCGCCTGG CGCTGCAACT CGCGGCGACC CTAGCGGAAC AGTTTGCCGA TGGCGTCTCT
TGGATCTCGC TGAGCGACGC CGTCACCGAA AGCAACCTGA TTCTGGCGAT TGCCGACGCG
CTGCACCTGC GCCTTTCCGG CGCACAAGAC CTGCGCGCGC AACTCTTGCA GGCGCTGCGT
CACGACCGGC GCGACCTGCT GCTGGTGCTG GACAATTTCG AGCAACTGTT GCCTGTCGGC
GGCGCGACGC TGGTACTGGA GGTGATGCGC GCCGCCCCGC GCCTCACGCT GCTGGTCACC
TCGCGCGAAC GGCTGAATCT GCAAGCCGAG TCGGTGCTCC CGCTAGAGGG CCTGGGCTAC
GATCTGCCCG CTCCTGACGC GCCGCCCTCC GAAGCCGCGC AGTTGTTCGT CGAGCGCGCC
GGACGTGCCC GAATGGACCT GAGCGTAGGC GCGGCAGACC AGACGATGAT CGCGGAGATT
TGTCATCTGC TGGAAGGCTT GCCGCTGGGC ATTGAGCTGG CCGCCGCTTG GGCCGGTGAA
ATGTCGCTGG AAGGCATCGC TGAAGCCATC ACCGTCACGC GCGATTTCCT CGCCTCCAGC
AGTCCCGACA TGCCCGACCG TCACCGCAGC CTGCGCGTGG TCTTTGAAGG CTCCTGGCAA
TTGCTTTCTC CGGAAGAGCA GTTCGCGCTG ATGCGGGTTT CCATCTTTCG CGGCGGCTTT
CAATCTGAGG CCGCGCAGCA CGTCGCCGGG GTGAGCGCGG CAATGCTCAG CCGTCTGGTG
CGCAAATCGT TGCTCTTCCT GGACGGGCCG CGCGTCCGCT ACGGGCTGCA CGGCGACATC
CGCTACTATG CGGCGGAGAA ACTGGCGGCG CAGCCGTCCA CTGCCCAGGA GATGGCCGCG
CGTCACGCCG CGTATTTTGC CGACCTGGTG CAGCAGCGAG AACAGGCCCT GCGCGGACGC
GCGCAGCAGG CAGTGCAGGC CGAGCTGGAA CCCGAATGGC AGAACGTGCT CGCCGCGTGG
CAATGGGCCA TCGCCCACGG CGACGAGGCG CTGCTCACCC GCCTGACCCA CGGGCTGTTT
GCCTTTTGCG AAGCCAAATC CTGGTTCCGC GAAGGTGCGG CCCTCTTCCA GCCCGCTTTA
GAGCGGATGC GGGAAGCGGC CCGCGCCGAC CTGGCGGCCG CCGCGCTGCT CCGTCGCCTG
CTGGGGCGGC AGGCCGTCTT TTGCCGACAA CTCTCGCAGT ACGCGCAAGC GCATCAGTTA
ATTGAAGAGG GCCTAGCCTT GCCGGGCCTG CCCGACGACG AGGAGCGCGC GTTCCTGCTG
TATCAAAAGT CCTGGGTGGA TTTTTTGCAG GCGCGGTACG CGCAGGCGCG CCAGTGGGCC
GAGGCAAGCC TGGAGCGTTA CCGCGCACTG GGGCAGCCGG TGGGCATCGG CGACAGCCTC
TATATGCTCG GCTGGACGGC CTACGAGTTG GGGGATTTTG CCGCCGCCGA GGCGCTCTGC
CTGGAGGCGC GGGCGGTGTG TGCGCAGGCC GATTATGCCT GGGGAGTGCA GTACGCCATC
TATGGGCTGG GGCTGGTGCG ACGCGCGCAG GGGGACTATG CCGCCGCCCG CCGCTGTTTC
GAGGAGAACA TGACGTTTTG CGACGCCATC GGCTACCTGT GGGGCGTGGC ACAGGCGCGC
ATCAACCTGG GGCTGGTGGC ACTGGCCGAG GGCAACGTGG AAGAGTCCGA AGCGCACTTT
CAAAAGAGCC TGCTCATCGG CGAGCAGATC GGCAATGCAT GGGTCAACGC GCAAAGCCAG
AAGGGCCTGA GCGATGCGGC TTTGGTGCGC CGCGACCTAT CCACTGCGCT GACCCTGGCG
GAGCGCAGCC TGGCGCTCTA TCAAGCGATG CAGGATCGGG ATGGGATGGC GGATAGCTTG
CTGCTGCTGA GCCAGGTCGC GCTGGCAAGC GGTGACCTCC CCGCCGCCCA CCGCGCCCTG
ACGGAAGCCG AGGGGTTGAT CCAGGCTACG GAAAACGGCT TCCGCGCCGC CAGGGCGCTG
GTTCAGCGGG TGGGCATCCT GCTGCGGGAA GGGGAAACTG CGCAGGCGCG GGCGCTGCTG
GAGGAAACGC TGCGCCATCC GGCCTGCGAG GCGTCCATCC ACAAGCGGGC TATGGAGGCG
CTGCGTGGAG CCGGCTACCT TGCCCCGCCT GCCGAAAGCG ACATTCGTCA CAGAACACCT
GAGTGGAAGC CATAG
 
Protein sequence
MLLCYENLEF VSQLSLTLLG TLAITLDGQP VAGIESDKAR ALLVRLALEP ERAFRREALS 
ALLWPEAAPA QASQNLRQAL YNLRRALGEA FLLTTPHTVQ FNAAADVTVD ALTWRRLWSE
TQTHRHRRRE TCRPCLERLA QAIALYRGDL LAGFALQDSA EFDDWLAVER ERLHVQALEA
LTLLANAAER RGDYPAAQEY VRRLLALEPW QEAAHRHLMR LLALDGRRAA ALAQFEVCRR
ALADELGLAP DEETRALHAR IRAGEPLSAA MPLPPAPPTD LPLQLTSFIG RERELTLLSE
RLSNPAYRLI TLTGPGGVGK TRLALQLAAT LAEQFADGVS WISLSDAVTE SNLILAIADA
LHLRLSGAQD LRAQLLQALR HDRRDLLLVL DNFEQLLPVG GATLVLEVMR AAPRLTLLVT
SRERLNLQAE SVLPLEGLGY DLPAPDAPPS EAAQLFVERA GRARMDLSVG AADQTMIAEI
CHLLEGLPLG IELAAAWAGE MSLEGIAEAI TVTRDFLASS SPDMPDRHRS LRVVFEGSWQ
LLSPEEQFAL MRVSIFRGGF QSEAAQHVAG VSAAMLSRLV RKSLLFLDGP RVRYGLHGDI
RYYAAEKLAA QPSTAQEMAA RHAAYFADLV QQREQALRGR AQQAVQAELE PEWQNVLAAW
QWAIAHGDEA LLTRLTHGLF AFCEAKSWFR EGAALFQPAL ERMREAARAD LAAAALLRRL
LGRQAVFCRQ LSQYAQAHQL IEEGLALPGL PDDEERAFLL YQKSWVDFLQ ARYAQARQWA
EASLERYRAL GQPVGIGDSL YMLGWTAYEL GDFAAAEALC LEARAVCAQA DYAWGVQYAI
YGLGLVRRAQ GDYAAARRCF EENMTFCDAI GYLWGVAQAR INLGLVALAE GNVEESEAHF
QKSLLIGEQI GNAWVNAQSQ KGLSDAALVR RDLSTALTLA ERSLALYQAM QDRDGMADSL
LLLSQVALAS GDLPAAHRAL TEAEGLIQAT ENGFRAARAL VQRVGILLRE GETAQARALL
EETLRHPACE ASIHKRAMEA LRGAGYLAPP AESDIRHRTP EWKP