Gene RoseRS_3817 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRoseRS_3817 
Symbol 
ID5210799 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus sp. RS-1 
KingdomBacteria 
Replicon accessionNC_009523 
Strand
Start bp4775600 
End bp4776778 
Gene Length1179 bp 
Protein Length392 aa 
Translation table11 
GC content61% 
IMG OID640597413 
ProductPAS/PAC sensor signal transduction histidine kinase 
Protein accessionYP_001278121 
Protein GI148657916 
COG category[T] Signal transduction mechanisms 
COG ID[COG5002] Signal transduction histidine kinase 
TIGRFAM ID[TIGR00229] PAS domain S-box 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.59485 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.347226 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAATGGA TCATTGCCAC CCTCGTGATT GCGCTCGGAC TGTCGATCGT CTGGGGATGG 
CGTCAGCGGC AGATTGCATT ACAATCGCCG CCGCCCCCTC CACAGCATGA TTCATTTTCT
CCGCTTTTTC ACGCCGCCGC CCATGCGTTC GATGATGGAG TTGTGCTGGT CGATGCAGAT
CGTCGCGTTC GCTACATTAA CCGGCAGGCC GAAGAATTGT TGAACCTCAG TCATACCATC
AGCATCGGGC AGGGACTGAT CACGCTTGCG CGTGATTACC AGGTCGATGC TATGGTCGAA
GAGGCGATTG CGTCCGCCGA ACCCCGCGAA AGTATCTTTC AACCGCTTGG GCGCCAGCGA
ACGCTGCGCC TGCGCGCAAT TCCGCTCGAC GACGGCGCAC GCGGCGCGTT GCTGCTGGTG
CGCGATGTGA CGCAACTGAG CCTGCTGGAA CGCGCGCGGC GCGATCTGGT CGCCAACGTC
TCTCACGAAC TCCGCACACC GCTGGCGTCG ATGAAACTCC TGGTCGAAAC GCTGCAATCA
GACCCGCCGC CGCCGGTGGC GAAACGCATG CTGGATCAGA TCGCGCAGGA AGTTGATGCC
ATTACCCAAC TTGTCGAGGA GCTGCACGAA CTGGCGCAGA TCGAGTCAGG GCGCGTGGCA
CTCCAGTTAG TACCCGCGCC GCTGGCACCG CTGGTAGCGC GCACCATTGA GCGCATCCGC
CCGCAAATGG ACCGAAAACA TTTACAGATG ACGACGGATG TCCCTGATGA TCTGCCCCAG
GCGCTGATCG ATTGCAATCG TGTTGGTCAG GTGCTGCTCA ATCTGCTTCA CAACGCCTCG
AAATTCACCC CCGAAGGCGG GTGCGTATCG ATTGCCGCCC GAGTGATTAC GGTCGGCGAC
GGTTCCCCCC CGCCGCCTGG ACTACCACCA TCGCATGCCC CAGGTCAGTG GCTGCTCCTG
TCAGTCACCG ATAACGGCAT CGGCATTCCA GCAACCGACC TCTCGCGGAT TTTCGAGCGC
TTCTACAAGG TTGATCGTGC GCGAACGCGC AATGCTGGCG GCACAGGGTT AGGACTGGCA
ATCGCCAAAC ATCTGGTCGA AGGGCACGGC GGGCGCATCT GGGCGACCAG CATCGAAGGC
GAGGGCAGCA CGTTTTACCT GACATTGCCG GTTGCATAG
 
Protein sequence
MEWIIATLVI ALGLSIVWGW RQRQIALQSP PPPPQHDSFS PLFHAAAHAF DDGVVLVDAD 
RRVRYINRQA EELLNLSHTI SIGQGLITLA RDYQVDAMVE EAIASAEPRE SIFQPLGRQR
TLRLRAIPLD DGARGALLLV RDVTQLSLLE RARRDLVANV SHELRTPLAS MKLLVETLQS
DPPPPVAKRM LDQIAQEVDA ITQLVEELHE LAQIESGRVA LQLVPAPLAP LVARTIERIR
PQMDRKHLQM TTDVPDDLPQ ALIDCNRVGQ VLLNLLHNAS KFTPEGGCVS IAARVITVGD
GSPPPPGLPP SHAPGQWLLL SVTDNGIGIP ATDLSRIFER FYKVDRARTR NAGGTGLGLA
IAKHLVEGHG GRIWATSIEG EGSTFYLTLP VA