Gene RoseRS_3771 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRoseRS_3771 
Symbol 
ID5210753 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus sp. RS-1 
KingdomBacteria 
Replicon accessionNC_009523 
Strand
Start bp4715921 
End bp4719040 
Gene Length3120 bp 
Protein Length1039 aa 
Translation table11 
GC content60% 
IMG OID640597367 
Productmulti-sensor signal transduction histidine kinase 
Protein accessionYP_001278075 
Protein GI148657870 
COG category[T] Signal transduction mechanisms 
COG ID[COG0642] Signal transduction histidine kinase 
TIGRFAM ID[TIGR00229] PAS domain S-box 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0000191257 
Fosmid HitchhikerNo 
Fosmid clonabilityunclonable 
 

Sequence

Gene sequence
ATGAATGCTG CTGATCCCCC GCAATCGACG CACGTTGTAC GGCGCATCAT ACTCAGTTTC 
ACCCTGACAA TCGGCCTGAT CGCCGTCATT GCATTGATCT CCTACGGCTG GACACAACTC
CTCTTTGCCG CTGAACGGGA GCGTGTGGAT CTGGTTGTCG CCAGCACGCG ACAACACGCC
CTGTCACAGC GTATCGCACT GTCGGCTGAG CGCCTGATGG ATACGTCTGA CCCGGAAGAT
ACTGCCCGGG CGCGCGTGGC GCTGAGCGAA GCAATCGATG AGATGACGCA GCAGCATCAG
AACCTGATCC GCGCTGTCGC ATCGCTGCCG TCTGATGACC CACATGCACA GGCGATCCGT
CGTCTCTATT TCGATCCGCC GACGGCGCTC GACGCGCGCG TGCAGGAGTA TCTGGAGCAT
GCGCAGCGCC TGCACGACGA TCAGAACCCG CAACGCTCCG ATCTGCTGGC GATCCGGCAG
GCAGCGCTGA ACGATCTGCC GCACCTCTTC GACACGGCAA CCCGTCTGTA CAGCGACGAG
CGTCGCACCC TTTTGTCCAC AATGGATGCC ATGCATGCCG TCGTCTTTGG CATCGTGCTG
ATAGTGCTGA TGCTGGAAGG CGTCTTCATC GTGCGACCGA TGGTGCGACA GACGCAGCAG
TACATCGCGC AGCGCGACGA AAGTGAAGCG CGCCTGCGCG CGCGCGAAAA AGTGACGCGC
GCCCTGTACG ATATAACCTC AACCACGCAG ATGGATCATC TTCAGAAGGT GCAGGCGTTG
CTTGCCATGG GATGCGACTA CTTCCGCATG ACTACCGGCA TGCTCACCCG GATCGACGGT
GAAGAACTGG AGGTCGTTGC AGCGCATCAA CCTCCTGAGC ACCTGGCGCC CGGTCAGCGC
TTTGCGCGCG CCGACAGTTA CTGCACCGCC GTTCTGGAAT CCTGCACGCC GGTTGGCATC
AACCACGCCG GACAATCTGC CTGGCGCGAT CATCGCTGCT ACGCGCTTCA GCGCATGGAA
GCATACATCG GCGCACCGGT GCGGATGCGG GGCGTGACGG TCGGCACCCT TTGCTTTGCC
AGCGCCACAT CCCGTCAGAC GCCATTCACG GATGGAGACT ACGATCTGAT TCGCCTTATG
GCGCAATGGA TCGGTAGTGA ACAAGAACGC TTGCAAACCG AAGCCGCGCT GCGCGAGAGC
GAGGAGCGCT TCGCTCTGCT TGCAAGCGTC ACCACCGAAG GGGTTATTAT CAGTGAGCAG
GGGATCATTG TTGACGCAAA CGCGGCTGCC AGCACGCTGC TCGGTTGCCC GCTCGAACAA
CTGCGCGGCA GATCGGTGTT CGAATTCACA ACTCCTGAAG GGCGAGAAAA AGTCGCTCAC
GCGCTTGCCA CCGGCTATGA TTGTCCCTAT GAGGTGCTTG CCCGCCGCAT TGATGGAACC
CTCTTTCCCG CCGAGGTAAC CGGACGGAAC ATCCCCTACC ATGGACGGAC GGCGCGCGTG
ACGACTATCC GCGACATGTC GCGGCAGCGT CTGGCGGAAG CCGCTCTGCG CGCCAGTGAA
GAACGCTTCC GCCAGTTAGC AGAGAATGTC AACCAGGTCT TCTGGATCTG TACCCCGGCG
CTTGATCAGA TTCTGTATGT CAATCCAGCG TATGAACGCA TCTGGGGACG ATCCTGCGAC
AGTCTGTATG CGCAACCGGC GTCGTTGTTC GAGGCGATTG TTCTGGAGGA TCGCGAGCGC
GTTCTTGCAC TCCACAATGC AGAATATCAC CGTGGATACA GTATCGAATT CCAGATTCGG
CGGGATGATG GTCAGCCACG CTGGATTCTG ACCCGCGCTT TCCCGGTGAT GAACGAAGCA
GGGGTCATCT ATCGCATTGC AGCTATCTCG GAGGATGTGA CCGGGCGCAG GCAGGCGGAG
GAAGAACTGC GTGCGACACT GGCAGCGCTC GAAGCGCAAT ACCAGGCAGC AGATCGCGCC
CAGAGCGAGA TGCGCGCAAT TCTCGACGCT TCCAGCGAAG CGATCGCACT GCTGGCGCCT
GATGGTACGT TCCTGACGGT CAATCGCCGT TTCTTCGACA TGTTCGGCAC GACCGCAGAA
CAGGCGCTCG GACATCGCCT GTCAGATATG CGCGCTGCCA TCCGGTGGAT CTTTGACGAT
GCCGACGAAT TGTACGTTCG CATGTGCCAC GCCCTTCAGG ACACTCAGAC TATCTTCCGC
GAACGGGTGT CGCAGCGCAA ACCACAGCAG CGCGAACTGG CGATCTTTTC TTCGCCGGTG
TGGACGGCCA ACCAGACGCA CCTTGGACGC CTTTTCGTCT TTCGCGATGT CACGCACGAA
CGCGCTGTCG AGCGGATGAA ATCTGAATTC GTGGCGATGG TGTCGCACGA GTTGCGCACA
CCGCTGACCT CGATCAAAGG GTATGTGGAT ATGCTGCTCG ACGGCGATGC CGGACCGCTT
GCCGATGAGC ACCAGGAACT TCTGCGCATT GTCAAATCGA ACGCCGATAG ACTGCTGCTG
CTGATCAACG ATCTGCTCGA CATGTCGCGG ATCGAAGCAG GGAAACTGTC GCTCCACCGC
ATCCCGCTCG ATCTCCGCCC GCTCATCCGA CAGGTCGCCG CCACGATGCG ACCACATCTC
GACGCGAAAC AGCAACGCCT GACCCTCGAT CTGCCGGAGA CTCCACCGGA TGGTTCAGCT
CCGCTCATCG CAGGCGATGC AGCGCGTTTC CATCAGATTC TGACCAATCT TCTCTCCAAC
GCAATCAAGT ATACTCCTCC AAATGGCGAG ATGACGATCC GCCTGACGGC AGAACCACCC
TGGCTCTGCA TCACCGTCCA GGATACCGGC ATCGGCATGA CGCCCGAAGA ACAGGACCAT
ATTTTCGACC GCTTCTACCG CGCACGCAAC CGCGCAACTC GTGAAACCGG CGGAACAGGA
CTCGGTCTGG CAATCACCCG TTCTCTCGTT GAACTGCATG ATGGGCGCAT CACCGTCGAA
AGCCAGCCGG GGAAAGGTTC GACATTCCGT GTCTATGTTC CGCTGTTGGA GTATGCGGAT
CAGCACGACA ACGCCCTGAC GGCAGCGTGG ACCGATGGAG AGGAAGAGCG TGATGGATGA
 
Protein sequence
MNAADPPQST HVVRRIILSF TLTIGLIAVI ALISYGWTQL LFAAERERVD LVVASTRQHA 
LSQRIALSAE RLMDTSDPED TARARVALSE AIDEMTQQHQ NLIRAVASLP SDDPHAQAIR
RLYFDPPTAL DARVQEYLEH AQRLHDDQNP QRSDLLAIRQ AALNDLPHLF DTATRLYSDE
RRTLLSTMDA MHAVVFGIVL IVLMLEGVFI VRPMVRQTQQ YIAQRDESEA RLRAREKVTR
ALYDITSTTQ MDHLQKVQAL LAMGCDYFRM TTGMLTRIDG EELEVVAAHQ PPEHLAPGQR
FARADSYCTA VLESCTPVGI NHAGQSAWRD HRCYALQRME AYIGAPVRMR GVTVGTLCFA
SATSRQTPFT DGDYDLIRLM AQWIGSEQER LQTEAALRES EERFALLASV TTEGVIISEQ
GIIVDANAAA STLLGCPLEQ LRGRSVFEFT TPEGREKVAH ALATGYDCPY EVLARRIDGT
LFPAEVTGRN IPYHGRTARV TTIRDMSRQR LAEAALRASE ERFRQLAENV NQVFWICTPA
LDQILYVNPA YERIWGRSCD SLYAQPASLF EAIVLEDRER VLALHNAEYH RGYSIEFQIR
RDDGQPRWIL TRAFPVMNEA GVIYRIAAIS EDVTGRRQAE EELRATLAAL EAQYQAADRA
QSEMRAILDA SSEAIALLAP DGTFLTVNRR FFDMFGTTAE QALGHRLSDM RAAIRWIFDD
ADELYVRMCH ALQDTQTIFR ERVSQRKPQQ RELAIFSSPV WTANQTHLGR LFVFRDVTHE
RAVERMKSEF VAMVSHELRT PLTSIKGYVD MLLDGDAGPL ADEHQELLRI VKSNADRLLL
LINDLLDMSR IEAGKLSLHR IPLDLRPLIR QVAATMRPHL DAKQQRLTLD LPETPPDGSA
PLIAGDAARF HQILTNLLSN AIKYTPPNGE MTIRLTAEPP WLCITVQDTG IGMTPEEQDH
IFDRFYRARN RATRETGGTG LGLAITRSLV ELHDGRITVE SQPGKGSTFR VYVPLLEYAD
QHDNALTAAW TDGEEERDG