Gene RoseRS_1015 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRoseRS_1015 
Symbol 
ID5207961 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus sp. RS-1 
KingdomBacteria 
Replicon accessionNC_009523 
Strand
Start bp1242622 
End bp1246272 
Gene Length3651 bp 
Protein Length1216 aa 
Translation table11 
GC content62% 
IMG OID640594629 
ProductSARP family transcriptional regulator 
Protein accessionYP_001275374 
Protein GI148655169 
COG category[T] Signal transduction mechanisms 
COG ID[COG3629] DNA-binding transcriptional activator of the SARP family 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCATAGCG CAAGACGACA AAAGCGATGT CTGAACAGGT ATAATACCAT CACACGCAGA 
GTATCCGGTT CAATCCGCCC GAATCGCAAA CGCATCGTCA TGATCGGTCC GCATCATCCA
CATCCATCGC TCGAACTCCG GGTTTTCGGC ACGCCGCGCC TTTTGCTCGA CCAGCACGAC
GTGCCGTTCC GCCGGCGACA ACCGCTGGCA ATTATGGCTG TGCTGGCGCT CAGCGAACGT
TCCGTCACCC GCGATGAACT GACGTACCTG TTGTGGCCCG ACTCGCCGCA GACCGTCGGA
CGTCAACGGT TACGCCGTTT ACTCTCACAA CTGCGTCAGT CGGTCGGTCC GCTGGCTGAC
CGGTTGTTGA CCGGTGAAGC CGGGATCGGC AGTGGCGTCA TTCGTTTCGA TGCCAGTATG
TGCCGGGTCG ATGCCCGTGA GTTCGTGCGC TTATCCGGTC AGGCGCGCAC ATTGCCCGAT
CCGGACGGCT TACGCGCTGC CGAGCAGGCG ACGCGACTCT ATGCCGGACC GCTGCTGAGC
GGTCTGGAAC TCGAAGAGTC GCCGGAGTTC GAACACTGGC TCCTCCAGCA GCGCGAACGC
TTCGAGCGCA TGATCCTCGA TATCTGGCGG CGTCTCGTCG ATGGATACGC CAGCACAGGC
GATTTTGACC GCGCTATTGC TGCGGTTGAA CATGCGCTCG TTCTCGATCC GCTCTCTGAA
ATGCTGCACC GCAAGGCGAT ATGGCTCTAT GCAAAAACCG GTCGTCGCAG TGACGCCATT
CGGCAGTTTA CCGTCTGTGT TGCGTTGCTC GAACGCGAAC TGGCGCTTGA GCCGGACGCA
GACACTGCGG CGCTCTATCG GGCTATTCTG AATAACCAGC TCGATACTGC GCGGTCGCTG
GCATTCCGGG ATCAGTCGCT TCCATCCGCC ATCCGCAGTT CCACCGGTCG TGCGGAACAT
ACCTCGCCAG CGACACTCAC GACGACGTTG ACAGCAGATC TGGTCGCGGC CATGCGCACC
GCGCTCACCG GATCGTCGCC GGTTGTCGTT GTGGAGGGAC CGGCTGGCAG CGGGAAAACG
CGCATGGTTC GCCAGGCGCT GGATCAGATT CGACAGACAA CGACCAATCT TCTGGTCTGG
CTATCCGGTG CACGGCGATC AGGTCAGCAA CGCCCCTTTG GCGTCATGAT CGACCTGCTC
GATACGGCGC TGCGCGAGCG ATTGAACCAT CCTGACGCAG AGCGTGCTTT GCCGACCGAC
GTGCGGATGA CTGAGGCCGT CCGCGTGCTG CCGGAGTTGC GCGCCGTTTT CCTCCGATTG
CAACCCATTC GCCAGCCCAC AGAAGAATCA CCATCGTCCC ACCACCTCTT GCAGCGGCGG
TTGTTGCAGG CGCTGCCACG CGCCGTCCAT GCGCTGGCAG GCGGAGAGCC GGTTATCGTG
GCGCTGGAAG ACCTGGATCA GGCCGATCCA CTCTCGATAG AAGCCGTCGC CTGGCTGGCG
CGTTCGCTGC ACGGAACGAA TCTGGCGCTG GTGATCACGT GTCGCACCGC AGAGGGCGCC
CTCAACGCGA TGCTCTCCGA TCTGCGCGCA CGCGGCATGC TGCAATCGTT GACGCTGACT
GCATTCGATC ACCCGACAGT CATCGGTCTG GCGCAGCATG CCGGTCTGCC GACGACGACG
GCTGAACAGA TCTGGCGGCA GACGGGCGGT GCGCCGCTGG CAACCCGCGA GATGGTGCGT
GCGATGGTTG CTGCCGGAAA AGACCTTTCC GCACTTCCTT TGTCACTGCA CGAAGCCATT
CAGATTCAGT TGAAATCGCT TGCCCCGACC ATCCGCCAGG TTATGGAAGC AGCAGCGGTG
CTGCAAAGCG GTGATGCGCT TGAGATTCAG CAGGTCAGCG GACGCACCGC CGACGAGGTG
GAACACGCCT GTGAGGAACT TCAGGCACGC GACTGGCTCA CGTTTGACGG ATCACGGTAC
GTCGTCGCGC ATCCAGAGGT GCGGGAAGCA GTGCTGGAAA GCCTGAGTCC GGCGCGTCGC
CAGCGGCTGC ACCGTCAGGC TGCGTTGGTG ATGCGTCAGC ACAACGCGGA CCCTGCGCGA
ATTGCATCCC ATCTTGAGTC CGCCGATCAA CCAGATGAGG CAGCCGCGAT GTGGCTTCAG
GCGGCGCGGC GCGCCCGGTC GCTCTATGCG CGCGATGCTG CACTCACGGC GCTTCAGTGT
GGTCTGGGGC TGGTTCGTGA TCGACACGTG CTGTTCGAAT TGTTGAGCGA ACAGGAATCA
ATTCTGCACG AACACGGGTT GCGCGATGAA CAACGCGCGA CGCTGGAGAC ACTGGAGCGT
TTCGTCGAAC AGTCACCCGA TCACCCCGAC TGGCGCGCCG AAGTCTATCG CAAACGCGGG
CGTCTTGCCC TTGCGTGCAA CGAGTGGAAT GCCGCTATCG ATGCGCTGCG TCGAGCGGCA
GTGTTCACAC TTCACAGCGA CTGGGCAACG TTGTGTCTGC TGGCGCGCGC CCTCGGACAC
AACCAGCAGT GGCACGAAGC GGACGAGATA CTCCAACGCG CGCTGGCGCT GGCACAGCAG
CAGCGTGATC GTGAAGCGCA GGCGCGCTGC TGGCTGACCC GCGCCGACAT CGAGCAGGGA
CGGGAGCGTT TCGACGCTGC CGAAAGCGCC TTGAAGCACG CCGTGCACCT GATCGAACCC
TCATCACCAA CATTGCCGCA ACTGATGTTG AACCTTGGGA ATATGGCTAC CGTGCGCAAC
GATTTCGTCA GTGCGCTGAC GTATGGACAG GAGGCGCAAC GTCTGTTTGC GCAGCGGGGC
GCGCCCGATA GCGAAGCCGC CGCCTGGGTG CTGGTCGCCC GGATGCACGC CCGCCTGGGG
CAGTTTGAAG CGGCATTCGA AGCGTACCAG TCCGCCTATG CGGGGTATGC TGCGCTCGAA
CTGCGCCAGG GTATGGCAGC CGCTCGTATC AATGCGTGCA CCCTGGCGTT GCGCATTGGC
AATTTTGACA ACGGGTTGCG CCTGGCAGAG GAAGCCTGGG AACTGTTCCA GGCGATCAAC
GATGCGCGTG GCATGTGTGT CACCGCCAGC AACAGGGGTG CGGCGCTGGT CTGGATGGGA
CGCGGCGCCG AAGCCGAGCC ATGGCTGCGC GAGTCGTATG AGCGTGCGGT TGCGATTCCG
CTGCCTGCGC AACAGGCGGC AGCGCTGGCG AACCTGGGCG CGGCGCTGCT CCAGCAGGGA
CGGCTGGAAG AAGCCCGTCG GTTGATGGAA CAGGGACTGG CGTTGCGCGT CGCGCAGGGG
CATATCGATG TGAGTGTCGA CCGCGCATTT CTGGCGATAG CCTGCCTGCG TCTTGGCGAC
ATCGAGGCTG CTGACCGGTA TAGCCTGGAG GCTGTGGAGT ACCTGGCGCG AGCGCCACAG
GTAGAAAACC CACAGCAGGT CTGGTTTGCA CGTGCTCAGG TCCTCCGCGC TCAGGGTTTG
ATAACCGAAG CTAACAATGC TTTGCAGTCC GCCGTGGAAT GTCTATACCG CAGCGAGCAA
CAACTGCCGC CACCGTACCG CGAACGCTAC CGCAGCGTGT TTTCGTTCAA TCGTGCGATC
CTGCATGCCT TCGATAATGG CGTCTGGCCC GAACCGCCGA TGCTGGTGTG A
 
Protein sequence
MHSARRQKRC LNRYNTITRR VSGSIRPNRK RIVMIGPHHP HPSLELRVFG TPRLLLDQHD 
VPFRRRQPLA IMAVLALSER SVTRDELTYL LWPDSPQTVG RQRLRRLLSQ LRQSVGPLAD
RLLTGEAGIG SGVIRFDASM CRVDAREFVR LSGQARTLPD PDGLRAAEQA TRLYAGPLLS
GLELEESPEF EHWLLQQRER FERMILDIWR RLVDGYASTG DFDRAIAAVE HALVLDPLSE
MLHRKAIWLY AKTGRRSDAI RQFTVCVALL ERELALEPDA DTAALYRAIL NNQLDTARSL
AFRDQSLPSA IRSSTGRAEH TSPATLTTTL TADLVAAMRT ALTGSSPVVV VEGPAGSGKT
RMVRQALDQI RQTTTNLLVW LSGARRSGQQ RPFGVMIDLL DTALRERLNH PDAERALPTD
VRMTEAVRVL PELRAVFLRL QPIRQPTEES PSSHHLLQRR LLQALPRAVH ALAGGEPVIV
ALEDLDQADP LSIEAVAWLA RSLHGTNLAL VITCRTAEGA LNAMLSDLRA RGMLQSLTLT
AFDHPTVIGL AQHAGLPTTT AEQIWRQTGG APLATREMVR AMVAAGKDLS ALPLSLHEAI
QIQLKSLAPT IRQVMEAAAV LQSGDALEIQ QVSGRTADEV EHACEELQAR DWLTFDGSRY
VVAHPEVREA VLESLSPARR QRLHRQAALV MRQHNADPAR IASHLESADQ PDEAAAMWLQ
AARRARSLYA RDAALTALQC GLGLVRDRHV LFELLSEQES ILHEHGLRDE QRATLETLER
FVEQSPDHPD WRAEVYRKRG RLALACNEWN AAIDALRRAA VFTLHSDWAT LCLLARALGH
NQQWHEADEI LQRALALAQQ QRDREAQARC WLTRADIEQG RERFDAAESA LKHAVHLIEP
SSPTLPQLML NLGNMATVRN DFVSALTYGQ EAQRLFAQRG APDSEAAAWV LVARMHARLG
QFEAAFEAYQ SAYAGYAALE LRQGMAAARI NACTLALRIG NFDNGLRLAE EAWELFQAIN
DARGMCVTAS NRGAALVWMG RGAEAEPWLR ESYERAVAIP LPAQQAAALA NLGAALLQQG
RLEEARRLME QGLALRVAQG HIDVSVDRAF LAIACLRLGD IEAADRYSLE AVEYLARAPQ
VENPQQVWFA RAQVLRAQGL ITEANNALQS AVECLYRSEQ QLPPPYRERY RSVFSFNRAI
LHAFDNGVWP EPPMLV