Gene RoseRS_3762 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRoseRS_3762 
Symbol 
ID5210744 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus sp. RS-1 
KingdomBacteria 
Replicon accessionNC_009523 
Strand
Start bp4704645 
End bp4705721 
Gene Length1077 bp 
Protein Length358 aa 
Translation table11 
GC content57% 
IMG OID640597358 
Producthistidine kinase 
Protein accessionYP_001278066 
Protein GI148657861 
COG category[T] Signal transduction mechanisms 
COG ID[COG4585] Signal transduction histidine kinase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00011366 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0000145112 
Fosmid HitchhikerNo 
Fosmid clonabilityunclonable 
 

Sequence

Gene sequence
GTGACGGTGA ATGATACTGC GCCATCCCTC AAAGAACTCT ACGATAAATT GAATGCCGAT 
CTGGAAAAAA CGCGCCGGGA ACTGATCGAG ATCGAAGCGC TTCTGCGGCA GACATCCAAC
GAGGTCGAAA AACTCCAGCA GCGCGAACTC ACCGTCTCGA ATCGCCTGCG CGATCTGGAT
GTCAATGTTG ATCGGTACAG CAAGGCGGAC ATTAAGAATT TTTATGCTTC GGCGCAGGAG
GTGCAAATGC GTCTCCTGAC GATGCGCAGC CAGCTTGAGC AGTTGCAATA CCGTCAGCAG
GCCGCCAGGC AACGGCAGAG TCAGGTGTTT GAACTGATCA CCGCGCTCGA ACCGTTGCTG
GGTGTTGCTG CACCTGCCGG CGGCGCTCCC GGCATGATCG GAAGCGACCA GTTGATTGCC
GATATTATTC AGGCGCAGGA AAAAGAACGC TTGCGCATTT CGCTTCAGAT GCACGACGGT
CCGGCGCAAT CGATGAGCAA CCTTGTGCTG CGCGCCGAAA TCTGCCAGCG TCTGCTCGAT
CGCGATGTTG AGATGGCGCG CGCCGAACTT GGCGCCCTCA AAAATGCCAT CAATGCCACC
CTCCAGGACA CGCGCCGCTT CATCTTCGAT CTCCGCCCCA TGATCCTTGA TGATCTCGGT
CTTGCGCCGA CGTTACGGCG CTATGTTCAG CAGGTGAGCG AAAAGAACAA ACTCGACATC
AACCTGATGG TGCAGAATCT TGATATGCGC CTGCCTTCCC ACTACGAAGT CGCCATTTTT
CGCTTCATTC AAGAGGCGCT CAACAATGTC ATCAAGCACG CCAATGCCAC CCAGGCGCGT
GTGCTCGTCA GCGTGCGCGA TGATGTCGGT GGAACGCGCA TGATCCACGT ATCGGTCGAG
GATGACGGCA GCGGATTCCA CGTCAGCGAC GTGCTGGCTG ATGACAGCGG ACGGCGCAAC
ATGGGAATCG CCACGCTCCG TCAACAGGTC GAAACGCTGC TCCGCGGCGA GTTCGGCATC
GAAAGCGCCA TCGGGCGTGG TACGCGCGTC GAAGCGTTGA TTCCCCTGCC GATGTAG
 
Protein sequence
MTVNDTAPSL KELYDKLNAD LEKTRRELIE IEALLRQTSN EVEKLQQREL TVSNRLRDLD 
VNVDRYSKAD IKNFYASAQE VQMRLLTMRS QLEQLQYRQQ AARQRQSQVF ELITALEPLL
GVAAPAGGAP GMIGSDQLIA DIIQAQEKER LRISLQMHDG PAQSMSNLVL RAEICQRLLD
RDVEMARAEL GALKNAINAT LQDTRRFIFD LRPMILDDLG LAPTLRRYVQ QVSEKNKLDI
NLMVQNLDMR LPSHYEVAIF RFIQEALNNV IKHANATQAR VLVSVRDDVG GTRMIHVSVE
DDGSGFHVSD VLADDSGRRN MGIATLRQQV ETLLRGEFGI ESAIGRGTRV EALIPLPM