Gene RoseRS_3059 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRoseRS_3059 
Symbol 
ID5210027 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus sp. RS-1 
KingdomBacteria 
Replicon accessionNC_009523 
Strand
Start bp3842132 
End bp3844057 
Gene Length1926 bp 
Protein Length641 aa 
Translation table11 
GC content60% 
IMG OID640596651 
ProductPAS/PAC sensor hybrid histidine kinase 
Protein accessionYP_001277373 
Protein GI148657168 
COG category[T] Signal transduction mechanisms 
COG ID[COG4191] Signal transduction histidine kinase regulating C4-dicarboxylate transport system 
TIGRFAM ID[TIGR00229] PAS domain S-box 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00716616 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGATGGGGT CTGGTGTGGA TGCAGCTGAA CGGATCAGTC TGCTCCTCAT CGAGGACGAT 
GAGTCGCATG TTGCGCTGAT CGAGCGCGCC TTCGAGTCAT GGCGATCATT GTTCGATCTG
TCGGTTGTGC ACACGTTGCA GGAGGCATAC GCGCTGCTCT CCGGTGATTC GTCCGACTTC
GATCTGATCG TGTGCGATTG GCGATTGCCC GACGGCGAAG GGTTGGAACT GCTGGACTTC
AACCGTGCGC TGCCGGTCAT TCTGATGACC GGGTATGGCG ATGAACGGGT GGCAGTCGAA
GCGATCCGTT CTGGCGCGCT CGATTATATT GTGAAGAGTG ATGCCGCATT CGCCGATCTG
CCCCATGTGG CGCAGCGTGC CATCCGCCAG TGGCGGGCGA TCCAGGCGCA ACGTCGCGCC
GAACAGGAGT TGCGCGAAAT TGAGGCGCGC TACCGCCTCA TTACCGAGAA CACGTCTGAT
CTGATCGCCA TTCTCGACGA CCAGCACCAC TTCCAGTACA TCAGCCCTTC GGTCACGACG
CTGTTGGGGC ATACCGCAGA AGCGTTGGTC GGACAGGATG CATTCTTCCT TATCCATCCC
GATGACCTTC CGTACAGCGA GGAGTACTGG TGCACCATGC TCCAGCGTAC CCGCGCCACT
GCGACGTTTC GCTATCGTCA TGCCAGCGGC GCCTGGCGCT GGATCGAATG CAATATCAGG
GCGGTCGAGC AAGGCGGCGG GTTGACGGCG ATTGTCGTGG GGCGCGATAT TACCGAGCGC
CGTGAACTGG AAGAACGTTT ACTCCAGATC CAGAAAATGG ATGCGCTTGG TCGGTTGTCG
GGCGGCATTG CGCACGATTT CAACAACATG CTGGCAGTGA TTGCCGGTTG CACCGAACTG
GCGCGTCAAC TGATCCCTGA TGACCACCCG GCGACCGGAG AACTGATCGA AATCCAGCAT
GCGACCGAGC GCGCAACGGC GCTGACCCGT CAGCTGCTGG CTTTTGCGCA CCGACAGCGG
TTCGAGCCGC GCCCCATCGA TCTGAACACG CTTATTCTCG ACATGCAGAA ACTGCTGCGT
CGCCTGATCC GCGAGAATAT CACCCTGAAC ACCCGTCTTG CCCCCGATCT CTGGCTTGTG
CGCGCCGATC CTGGTCAGAT CGAGCAGGTG CTGGTCAATC TGGCGATCAA CGCGCGTGAT
GCGATGGTCG ATGGCGGTGT GTTGACGATT ACAACCGAAA ATGCTGTGAT CGACGATGCG
TTCGACAACC GCCATCCTTC GCTGAATCCG GGCATGTATG TCCGGTTGAC GGTCAGTGAC
ACCGGCGTCG GCATGGACGA AGAAACGCGC AGGCGCGCCT TCGAGCCGTT CTTTACCACG
AAGAAGCCCG GCGAGGGAAC CGGTCTCGGT CTGGCGACCT GCTACGGGAT TGTCGTCCAG
CACGGCGGCG CCATCGAACT GACGAGCGAA CCGGGATGCG GCACAACGGT CATTATCTAT
CTGCCGCGCG CCTATCCCTC CGAAGAGACA GTCGCCCTGG AGACTGGCGA GCACATCGAT
CTCAAAGGCT CGGAAACGAT CCTGCTGGTC GAGGATGATC CGGCGGTGCG CACGCTGACA
GCGCGGGTGC TGCGCACCCA TGGGTATACC GTTCTGGAAG CGGGTGACGG ACATGAAGCG
TTGACGCTCG CCGGTGAACG TCCGATCCAC CTTCTCTTGA GTGACCACGT TATTCCGCAT
ATGAGTAGTG AGTCGCTGGC ACACTATCTG ACCGCTCTGA TCCCGCAGAT GAAGGTGCTC
TTTATGAGCG GGTATGTCGT TGAGCCGCGC CCCGATGTTC AGGCGCTCCA GGAAGCGACG
GTTCTTCAGA AGCCATTCAC TCCAACTGCG CTCCTTCAGA GGGTTCGCGC TGTCCTGGAT
TCGTGA
 
Protein sequence
MMGSGVDAAE RISLLLIEDD ESHVALIERA FESWRSLFDL SVVHTLQEAY ALLSGDSSDF 
DLIVCDWRLP DGEGLELLDF NRALPVILMT GYGDERVAVE AIRSGALDYI VKSDAAFADL
PHVAQRAIRQ WRAIQAQRRA EQELREIEAR YRLITENTSD LIAILDDQHH FQYISPSVTT
LLGHTAEALV GQDAFFLIHP DDLPYSEEYW CTMLQRTRAT ATFRYRHASG AWRWIECNIR
AVEQGGGLTA IVVGRDITER RELEERLLQI QKMDALGRLS GGIAHDFNNM LAVIAGCTEL
ARQLIPDDHP ATGELIEIQH ATERATALTR QLLAFAHRQR FEPRPIDLNT LILDMQKLLR
RLIRENITLN TRLAPDLWLV RADPGQIEQV LVNLAINARD AMVDGGVLTI TTENAVIDDA
FDNRHPSLNP GMYVRLTVSD TGVGMDEETR RRAFEPFFTT KKPGEGTGLG LATCYGIVVQ
HGGAIELTSE PGCGTTVIIY LPRAYPSEET VALETGEHID LKGSETILLV EDDPAVRTLT
ARVLRTHGYT VLEAGDGHEA LTLAGERPIH LLLSDHVIPH MSSESLAHYL TALIPQMKVL
FMSGYVVEPR PDVQALQEAT VLQKPFTPTA LLQRVRAVLD S