Gene RoseRS_0520 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRoseRS_0520 
Symbol 
ID5207457 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus sp. RS-1 
KingdomBacteria 
Replicon accessionNC_009523 
Strand
Start bp653276 
End bp654946 
Gene Length1671 bp 
Protein Length556 aa 
Translation table11 
GC content59% 
IMG OID640594140 
Productmulti-sensor signal transduction histidine kinase 
Protein accessionYP_001274894 
Protein GI148654689 
COG category[T] Signal transduction mechanisms 
COG ID[COG2205] Osmosensitive K+ channel histidine kinase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.936608 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.731504 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTCAGG CAATCGGTAC TGCGGTTTCC AAAACGCCAG CCGGCGCCTC ACCGCCACGT 
TTGCTGATCG TCGATGATGA ACAGCATATG TGCGATGTAT GCTCGCGCAC CCTGCAACGC
GCCGGCTACG ACGTGCTTGC CACCAACGAT CCGCTTGTCG CCATTGAGGC GCTCAACAAT
GGGCAGCACT TCGATCTGCT GCTGACCGAT ATCAAAATGC CGGCGATGAG TGGTCTCGAT
CTGGCGCGAA TCGCGCGCGA AAAAGACCCG GAGATTGCCA TTATCATCAT GACCGGGTAT
GCGTCGCTTG AAAATCTCCA CCAATCGGTG CGGCGCGGCG TGGCGGATTT TCTCGCAAAA
CCGTTCGAAC TCGAGCAACT GCGCCTGGCG GTCGATCAGG CATTGCACAA ACGCGCGATG
CAGCAGGACA ACCTGCGATT GCGGACGCTC GAACAACTGC TCGCGGTCAG CGAAGCGCTC
AGCGCCACGC TAGAACTGTC GGAACTGGTC CATACCGTCC TCGATGCCAT GATCGAACGG
AGCGGGTTTC AGACCGGGTT CCTGCTGCTG GGGGATGAAC CTGCAATGTT GCATCTTGCA
GCAACAACAC CAGAGACGGC GCACCTGACC GACGAGGGAC GCGCGCTGGC TGAGCGCGCA
TTCACCCTCC AGCAAACACG CTATGAAGAG ACTGCGTGCT ACGGAACACA GCCTGATCAG
ACACTGCACG CAGCGCTCGC CGTGCCGCTG CGCGCTCAAG GGCGAGTCAA CGGCGTGGTT
GTCCTGTGCA ACCAGCACTC CACCACCCTG CGACCGGGCG TCCAACAGGG GTTAATGTTG
CTCGCCAACC AGGCTGGCGC CGCGTTGCGC AACGCAACCC TCTACCGTCA ACTCGATGAA
GCGTATCAGC GCCGACAGGA ACTCGATCAC CTCAAAAGCG AGTTCATCGC CATCGCCTCG
CACGAACTGC GCACTCCACT CTCGATTGTA CTTGGATATA CTATGATGGT CCGTGATCAG
GTCGAGGGAA GTCAGCGCGA CTACCTTCAG CGCGTAATGG AGAGCGCCCA ACGTATCAAA
GCGATCGTCG ATGATATGGT CAATCTGCAC TACATCGATA CTGGTGAGTC ACAACCGCAA
CTCGCGCTGG TCGATCTGGG CGAGACGATC TACCAGACGG TGCAGAATCT GCGCAGTGCG
GCGGAACTCG CGGGGCAAAC GATCACGGTC AACCTGCCCG ATGCGCTCCC GCCATTCCTG
ACTGATCGCG AAAAGGTGAT GCTGGTGCTC AATCATCTGT TGTCCAATGC GATCAAGTTT
ACGCCGCAGC ACGGACGGAT CACGATTACG GTGAGCATAC GGCAGTATCA CGAACTGGAG
TCGCTGCGCG AAGTATCGGT CATCACGCCT TCGGCGTCGC TTCGCGCATT ACCCTGGGTT
GTGACCGATG TCAGGGACAC CGGGATCGGC ATCCCGATGC ACGAACGAAC GCGGATCTTC
GAACGTTTTT ATCAGGTCGG CGACTCGCTT ACGCGCGAGC GTGGCGGCGT CGGGCTTGGA
TTGGCGCTGG TGCGCGAGTT GATAGCCTCA CTGGGCGGGG CGGTGTGGGT TGTCAGTCGA
GAGGGTGAAG GCAGCACCTT CTCGTTTGCG CTCCCCCTCC GCCGGACTTA G
 
Protein sequence
MSQAIGTAVS KTPAGASPPR LLIVDDEQHM CDVCSRTLQR AGYDVLATND PLVAIEALNN 
GQHFDLLLTD IKMPAMSGLD LARIAREKDP EIAIIIMTGY ASLENLHQSV RRGVADFLAK
PFELEQLRLA VDQALHKRAM QQDNLRLRTL EQLLAVSEAL SATLELSELV HTVLDAMIER
SGFQTGFLLL GDEPAMLHLA ATTPETAHLT DEGRALAERA FTLQQTRYEE TACYGTQPDQ
TLHAALAVPL RAQGRVNGVV VLCNQHSTTL RPGVQQGLML LANQAGAALR NATLYRQLDE
AYQRRQELDH LKSEFIAIAS HELRTPLSIV LGYTMMVRDQ VEGSQRDYLQ RVMESAQRIK
AIVDDMVNLH YIDTGESQPQ LALVDLGETI YQTVQNLRSA AELAGQTITV NLPDALPPFL
TDREKVMLVL NHLLSNAIKF TPQHGRITIT VSIRQYHELE SLREVSVITP SASLRALPWV
VTDVRDTGIG IPMHERTRIF ERFYQVGDSL TRERGGVGLG LALVRELIAS LGGAVWVVSR
EGEGSTFSFA LPLRRT