Gene Rcas_2849 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRcas_2849 
Symbol 
ID5540338 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus castenholzii DSM 13941 
KingdomBacteria 
Replicon accessionNC_009767 
Strand
Start bp3694858 
End bp3697743 
Gene Length2886 bp 
Protein Length961 aa 
Translation table11 
GC content61% 
IMG OID640894978 
Productprotein serine phosphatase with GAF(s) sensor(s) 
Protein accessionYP_001432938 
Protein GI156742809 
COG category[K] Transcription
[T] Signal transduction mechanisms 
COG ID[COG2203] FOG: GAF domain
[COG2208] Serine phosphatase RsbU, regulator of sigma subunit 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.230006 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACACAAT CTGTGAGCGA TCCAATCGCC GAACTCCAGC GGGAGATCGA TGCGCTGCGC 
ATTGATCGAG AGCGCGACCT GCGGGCATTG AATGTGCTTC GCACCGTCAG CATCGCATGC
CTGGGGAAAC CCAACGCGCG CGCCATCTTC GAAGCCATGC ATCAGGCGCT CCTGCCTGTG
TTCCGGTACG ATGCGGCATA CTTCGCCGTC TGTGATCCCG GCAATCGGGA AACATTTCGC
GCCGCGTTGC TCGTTGATGA AGAGAGCGTC GAATATCTGG AACACACGCC GTATGGTCCG
CTGACCGGCA TGCTGGTGCA GCATCGTCAA CCATTGCTCT TTTACGACCT GCTGCACGAG
CGCACGCTGC TGACACCGCC GCCGGAACGA TTTGGCAATC AGCAGAAGCA TTCACGATCC
TGGATGGGGA CGCCGCTGCT GCTCGGCTAT GAGACGGTTG GGGTCATGTC GATCCAGAGT
TATCGGCCAG GAATCTACAA TCAGGATCAC CTCGATCTGT TGCAGCGGAT AGGCAACCTG
CTGGCGGTGG CGCTGGAGAA TGCGGCACTG GTGGAACAGC AACAGACGTT GAGCAACGAA
CTGACGGAGC GTATCGCGGC GCGCACCCGC GAACTGACCA TGCTGACGGC GCTCGCCGAA
GAACTCGCGT TCCAGCGCCC ACTCTCCGAG TTGCTGGATC GCGCGCTATC GCTCCTGGTG
GATCATCTTG CCATGAGTGG CGGCGCTGTG CGATTGCTCT CTTCCGATGG CGCGGAACTG
ACCCTCGCCG CCGCCTATGG GTTTCCTCCG GCGTATGCGG ATCAGTTGGC GCGAATCCCG
GTTGCGAGCA GTCTGATGCG CAGCGTCATT GAAGAACGTC GCCCGCTGAT CATTGATAAT
CTTGCGCAGA GCGAGCACTG GCAGCGTCTT GGGTTGCACT ACCGCTCGTT GATCGCCGTT
CCGCTTCAGA TCGGTGATCG GGCGCTGGGT TCCTTGCTGC TGGTCGATAC GCGCCAGCGC
GCCATTCAAG ACCAGGAGAT CGAATTTGTG CGCACCGTTG GGCATCAGAT CGCTATCGGC
ATCAAGACTG CGCAGTTGCT TGCCGAGCGC GAACGGCAGG TCGCCGAACT ATCGGCGCTG
AGCGACATTA GCCACGCGGC GAGCACCACG CTGGAACTGC GCGCGCTCCT GCGCCACATC
GCGCGCGCCC TCGCCCGCTT CATGCGCGCC GACGTTTTTT CCGTCGCAGT CTACGATGCT
GAACGAGACG TTATCAGCGA CGGCTTCTCG ATCGATGATG GCGAAGAGCA TTCCTTCTGG
CAGTTTCAAC CGCCGCCGTC GGACTCGTTG ACCGCGTGGG TGCTGCGCAA CCAACGCGCG
TTGCGTTTCG ATAACCTGCT CGACGAGATC GGACGTTACC CGGAACTGCG GTTGCACGCT
ATCGGTACAG ATCGCATGGC GCTGTCGTGG CTCGGTGTAC CGCTGATCGG GCGCGAGGGG
CAACCGATTG GCGTTCTCAC GGTGCAGGCA TACTCACCCT GTGCATTCGA TGATCGCGAT
GAGCGGTTTC TGGCGGCGGT CGCGCGCCAG GTGGCGTTGC ACGTGCAGAA TGTCATGCTG
TTCGCCGCCG AGCAGGAGGC GCGGCGTACC GCCGACACGC TGCGCGAAGT GGCGCGTGTA
TTGAGCGCTG CGTTCAGTTC GGGTGAAGTG CTGAACGTGA TCCTGCGCGA ACTGAAAAAG
GTCATTCCCT ACGATAGCGC CTCCATTATG TTGCGCGAAG GGAGGGTGCT GCGCGCCGCT
GCCGTTGCTC CTGAGCATAT CGCCACCCGA CTGGACGCAA TGGACCGTCC ACTGAATGAA
CCCGGCGCTG CTGTATGGGT TGTACACCAC AAACAACCAC TTTTGATCGA GGACGTCAGC
GTATCGCACA TCTGGGCGCC CAAACCGCAG ATCAGCAACA TCCAATCGTG GATCGGCGTC
CCATTGCTTG TCAAAGGTGA GGTGCTCGGC GTGCTGAACA TCGACTCAAC CCAAAAGGGG
CGTTTCACCC AGCGCGATGT CGAGGTGGCG CAGGTCTTCG CCGACAACGC CGCGATTGCT
ATCGAAAACG CCCGGCTGTA TGCTGAGTCG ATTGCGCGGC GCGAACAGGA GCTTGAGATC
GCGCGCCGTA TCCAGTCGAA CCTGTTCCCC CGCGAGCTGC CGCGGCGTCA GGGGATCGCG
CTGGCGGCGC GTTGCCTGCC TGCCCGTGAA GTCGGCGGCG ATTTCTTCGA CTGTTTTGCG
CTTGGCGAAC GCCGCTCGAT TGCGGTGATG ATCGGCGATG CGTCGGGCAA GAGTGTGGCA
GGAGCGCTCC TGATGGCGAT GGCGCGGTCG GTTGGACGCG CCGAAGCGTT CGATCATGAG
GAACCGGCGC TGGTGCTGCG CGAAACGAAT CGGAGCGTGG CGCACGATGT GCCGCGCGGC
ACTTTCGTGG CGCTCTGCTA TGCGACGATT GATCCGTCGG GGCGCATAGC GGTGGCGAAC
GCCGGACAGT TGACGCCGGT GCTGGTGCAT GCCGACGGTG CTATTCAGTA TCTCTATCCG
CCAGGACCAA CCCTGCCGCT CGGCATTCAG TCCAATATCG GGTATGAAAC GCTTGCCATC
GATCTGGCGC CGGGCGATAC GGTCGTGTTC TTTACCGATG GGCTGGTCGA GGCGCACAAT
GCAACTGGCG AGTTGTTCGG CTTCGAGCGC CTCGATGCGC TCCTGACGAA AGAAGCCGGT
CTGCCGCCCG ACGCCCTCAT CGACCGCATC ATCGACGAAG TGATAGCCTT CATGGGCGAT
ACGGTGCAAC ACGATGACAT GACCCTGGTT GTCGTGCAAT TCGGCGGGGA ACGGACGAAC
GATTGA
 
Protein sequence
MTQSVSDPIA ELQREIDALR IDRERDLRAL NVLRTVSIAC LGKPNARAIF EAMHQALLPV 
FRYDAAYFAV CDPGNRETFR AALLVDEESV EYLEHTPYGP LTGMLVQHRQ PLLFYDLLHE
RTLLTPPPER FGNQQKHSRS WMGTPLLLGY ETVGVMSIQS YRPGIYNQDH LDLLQRIGNL
LAVALENAAL VEQQQTLSNE LTERIAARTR ELTMLTALAE ELAFQRPLSE LLDRALSLLV
DHLAMSGGAV RLLSSDGAEL TLAAAYGFPP AYADQLARIP VASSLMRSVI EERRPLIIDN
LAQSEHWQRL GLHYRSLIAV PLQIGDRALG SLLLVDTRQR AIQDQEIEFV RTVGHQIAIG
IKTAQLLAER ERQVAELSAL SDISHAASTT LELRALLRHI ARALARFMRA DVFSVAVYDA
ERDVISDGFS IDDGEEHSFW QFQPPPSDSL TAWVLRNQRA LRFDNLLDEI GRYPELRLHA
IGTDRMALSW LGVPLIGREG QPIGVLTVQA YSPCAFDDRD ERFLAAVARQ VALHVQNVML
FAAEQEARRT ADTLREVARV LSAAFSSGEV LNVILRELKK VIPYDSASIM LREGRVLRAA
AVAPEHIATR LDAMDRPLNE PGAAVWVVHH KQPLLIEDVS VSHIWAPKPQ ISNIQSWIGV
PLLVKGEVLG VLNIDSTQKG RFTQRDVEVA QVFADNAAIA IENARLYAES IARREQELEI
ARRIQSNLFP RELPRRQGIA LAARCLPARE VGGDFFDCFA LGERRSIAVM IGDASGKSVA
GALLMAMARS VGRAEAFDHE EPALVLRETN RSVAHDVPRG TFVALCYATI DPSGRIAVAN
AGQLTPVLVH ADGAIQYLYP PGPTLPLGIQ SNIGYETLAI DLAPGDTVVF FTDGLVEAHN
ATGELFGFER LDALLTKEAG LPPDALIDRI IDEVIAFMGD TVQHDDMTLV VVQFGGERTN
D