Gene RoseRS_1735 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRoseRS_1735 
Symbol 
ID5208692 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus sp. RS-1 
KingdomBacteria 
Replicon accessionNC_009523 
Strand
Start bp2135835 
End bp2137025 
Gene Length1191 bp 
Protein Length396 aa 
Translation table11 
GC content59% 
IMG OID640595341 
Productputative transcriptional regulator 
Protein accessionYP_001276075 
Protein GI148655870 
COG category[K] Transcription 
COG ID[COG2865] Predicted transcriptional regulator containing an HTH domain and an uncharacterized domain shared with the mammalian protein Schlafen 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.256151 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.15639 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGATATGT GGGAACTCCA ACGGCGGATT GCGCGCTGGG AAGATATACA TACTGAGTTC 
AAAGAGCAGG ATGTCCATAC CGATGACATT GCTGCGGCGT TGGTGGCTTT TGCCAACACC
GATGGCGGAC AGTTGATCTT CGGCATCAAT CAAAACCGGG CCATTATCGG CGTTGATGAC
CCTGATCGCC TGATGCAGCG CGTTGATCAG ATTGCCTGGA ACAACTGCGA GCCGCCGCTC
ACCGTCCTGC AAGAAACCAT TCGCAGCGAG GAAGGCCGCG TCGTGGTGGT TGTCAACATC
CCTAAAGGGG ATCAGCGCCC CTATCGTACC ATCAGAGGCG ACTACTTCAT ACGCACCACC
TCGGGACGCC GACGGGCTTC CCGGCAAGAA CTGCTCCGCC TGTTTCAATC GACGGAGAGT
CTCTATTACG ATGAGACCGT GGTCTGGCGC GCCACGTTAC GCGATCTGGA CGAACAGCGT
TTTGCCGATT TCTTCCGGCG GTCCTATAAC CGCGAGATCA CGTCAGAGCA AGAAACAGAG
CGCCTGATGA AAAACATGCG CTTGCTGGAA GAACGTGAGG GCGCATGGCG TCCCACACTG
GCGGGCCTGC TCTGCTTCGG ACGAGAGCCG CAGCGATTTC TGCCGTATGC GCAGATCAGC
GCTGCCCGCA TCCCCGGTGA GACGCTGGCG CTGGCGCCTT CCGATGCCAG GACGATCGGC
GGCACGTTGT TCGACATGCT GGAAGATGCC GCCCGCTTTC TGCGGATTCA TCTGCGCCGC
CCGCACGTCA TCCAGGGATT TGAGCCTGAA GAACGCCCGG AGATCCCCGA AGAAGCCTTG
CGCGAGTTGC TGGTCAACGC GCTGGTGCAT CGCGATTACA CCGTCACTTC TCCGATTCGC
GTCTTGATCT TCGATGATCG CATCGAAATC CGCACACCGG GCAACCTGCC CAACACAGTT
ACGATCGAGG CAATTCTTCT GGGCGCTGCG CATGTTTTGC GCAATCCCAT CATCTACACC
ATGTTCAGTC GCGCCGGACT GGTCACTCAC CTCGGCAGCG GCGTGTTGCG CGCCAGACAA
CTCATTGAGC AGGACGCGCG CGCCACACTG CGCCTGGAAG TTGTGGCGAA CGAGTTCGTG
GTTTCTGTTT CCCGTCCCGA AATGTGGCAT GGACCGGGCG GACAGCAATA G
 
Protein sequence
MDMWELQRRI ARWEDIHTEF KEQDVHTDDI AAALVAFANT DGGQLIFGIN QNRAIIGVDD 
PDRLMQRVDQ IAWNNCEPPL TVLQETIRSE EGRVVVVVNI PKGDQRPYRT IRGDYFIRTT
SGRRRASRQE LLRLFQSTES LYYDETVVWR ATLRDLDEQR FADFFRRSYN REITSEQETE
RLMKNMRLLE EREGAWRPTL AGLLCFGREP QRFLPYAQIS AARIPGETLA LAPSDARTIG
GTLFDMLEDA ARFLRIHLRR PHVIQGFEPE ERPEIPEEAL RELLVNALVH RDYTVTSPIR
VLIFDDRIEI RTPGNLPNTV TIEAILLGAA HVLRNPIIYT MFSRAGLVTH LGSGVLRARQ
LIEQDARATL RLEVVANEFV VSVSRPEMWH GPGGQQ