Gene Sare_4266 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_4266 
Symbol 
ID5705771 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp4839574 
End bp4842669 
Gene Length3096 bp 
Protein Length1031 aa 
Translation table11 
GC content71% 
IMG OID641273685 
Producthistidine kinase 
Protein accessionYP_001539038 
Protein GI159039785 
COG category[T] Signal transduction mechanisms 
COG ID[COG0642] Signal transduction histidine kinase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.159285 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAGCACCG GACCGACGAC CCTGCCCGCA GGCGGCGACA TTGACCAGCC GAAGCGGCAC 
TGGCTGCCCC GGCTTCGCAA CGCACGAATC CGCTCGAAGC TCGCGTTGAT CCTGGTCGTG
CCGGTCGCCG CGGTCATCGC GCTGGCAACT GTTCGTCTGG TCACAGTAGG TGAGGGCGCG
TATGACGCCA CGCGGGCCAA AGCGCTGACC GAACTCTCCA TCGACATCTC CGCACTCGCC
CATGACATGC ACGCCGAACG GATGGCGGCT ACGGTCTATC TCGCCTCGAC GAAAGAGACC
GCCGACGCCT ACAACCTGCG GGTGCGCAGC ACCGACGAGC GGGTGCAGGC GTACCGGGAG
GAACGCGAGC GGATTGGCGA GGTGCCTTCC GCCGTCAGTG ACCGGTTGGT CGCGATTGAC
GAGCACCTGA CGACGCTGGA CGCGGCCCGG CAACAGGTGC TGGACCGACG GCAGATGGCG
GTTTCCGAGG CGGCGTTACG GTACGGCGTC ATCCTGGCTG ACCTCGTGGC GTACGGCGAT
GGCCTCGCTC AACTCCCCGG TGATGAGCAG CTGGCGGACG CCCGGCGGGC GGTCGCGGCC
TTCGGCCGCG CCAAGGCGGC GGTCGCCGAG CAGGAGTCCG TCGCCTACAC CGCGTTGAGC
GTCGGCAGCT TCGACGAGGA GCAGTACTCC TCCTTTGTGG CCACCTTGAC CGGTCAGCAG
GAGGCGTTGC TCGCCTTCTC ACTCGCGGCC AGCCCGAGTC AACGCTCGTT CGTGGACAGC
ACCGTCTCGG GTGACGCGGT CACCCTGGCC GACAAGGTCG CCGCGGACAT CACCCGCTCG
GTCGGGCAAC GGTCCCTGGT GAGCGCGGCG GACGCCAGTG CCGCCATCGG TGCCGTCAAC
GACCTCATGC GGTGGACCGA GGCCCGCCTT CAGGAGCGGT TGCTGGCCGA CACCGAACAG
ACCCGGGCGA ATGTCCTCCG GCAGGCGATC GTCGAGTCGC TGCTGGGGCT GTTGACTCTG
ATCATCGCCA TAGCCCTCGC CGTGGTGCTG GCCCGTTCGC TGAACCACTC GCTGCGCCTG
TTGCGGGAGG GCGCCCTGGC CGTGGCCCAC CGTGACCTGC CGGAGGCGGT GCGTCGGCTG
CAGAGCATGC AGGCCGTTGA CGAGGGCGGG GTCGACGACA TCGTCCGTGA GGTGCGGGAA
CCGATCCGGC TCAACAACCA GGACGAGGTC GGTCAGGTCG CGCTCGCCTT CAACGTGGTG
CACCGGGAGG CTGTCCGGGT AGCGGCTGAA CAGGCAGCCC TGCGGACCAG CGTTTCGGCC
ATGTTCCTCA ACCTGGCCCG ACGGAGTCAG AACCTGGTTG ACCGCATGAT CGGAGAGCTG
GACGCGATCG AGCGTGGCGA GGAGGACCCG AAGCGTCTGG CCCGGCTCTT CGAACTGGAC
CACCTGGCGA CCCGGATGCG CCGCAACGAC GAGAACCTGC TGGTCCTCGC GGGGGCCGAC
TCGACCGTGC CCCGGCGGGA GGACGCTCTG CTGGTGGATG TGTTGCGGGC CGCGCAGTCC
GAGGTGGAGC TCTACAACCG GATCGAGTTC GGCACCGTTG ACACCGATGT CTCGGTGGCC
GCCCACGCGG TCAACGACGT GGTCCGACTC GTCGCCGAAC TACTCGACAA CGCCACCCGG
TTCTCGCCAC CGAACACCAC GGTGGTCGCC GACGGGCGGC GGATCCGCGA CTATGTGCTC
ATTCAGGTGG AGGACCGTGG CCTCGGCCTC TCCGACGAGC AACTCGAATC GCTCAACCGG
CGGTTGGCCG AGCCATCGAG CGTGGATGTC GCGGCATTCC GGCTGATGGG CCTGGCCGTG
GTGAGCCGGC TCGCCGACCG GTACGGCATC CGGGTCGAGC TACGCCGCAA CGTCGAGGGT
GGCACGGTCG CCCAGGTGAC TCTGCCAACG GCCACGGTCG TCCTGCCCGT TGGCCGGGGA
CCGGCCCAGA TCAGCCGGCC CCGTCAGCCG CTCGCGGTGG AGCAGGGTCC GTCCACCCCG
ACCGGCCTGG GCGGTCCGTT GGTCGGTGCC ACCCGGGCTG CCACCCTGCC GGAGCAGCGG
CCGCCCGAGC CGGCTCCATG GCAGGCGCCC GAGCCAGCTC CGTGGCAGGC GCCGGAGCAG
GCCCGCAGCG CAACGGCACC GGTGCAAGCC GGCGGAATGG GCGGAATGGT CGGGGCGTCG
CCAGGCCTCG GTGCTACGGG CCACCCCGGT GATGCGCCAA CCGCGGCCTA TCCGCTTCCC
CAGCGGAATC CGTCACGCGA TTCGTCGGCG GCGACCGCTG GCTTCCCCAC CGTGCCAGGC
AGTACCCCGC CGTTGACCGA CTACGGATCG ACCGGGGGCC TCGGCGCCGA CCTGGCCGCT
ACCGCGTCAT TCGCCTCCAC CCCGCTGGAC GTGCCACCAG CCGCTCCACC AGCCCCTCCG
CAAGCCGCTC CACCGGCAGA GGCGCCGATC TTCCGGGAGA TGGAGGCGGT CTGGTTCCGG
TCGCACGGCA ACGACGCCAC CGCCATCTTC ACTCGGCCGG ACTTCGACGG CGCGGCCCAA
CCACCGGCCC CGGACTGGTC GGCGACGGCA GGTGGCCCGG CCGGGCCACA GCTGCCCACC
CGGGTGCCGG GTGCCACGAC GACTCCGTCG CCAGTTGGCC CGCCGCCGTA CACCGCGCCA
ACCGGGGCTA CCGCCGCGCC CACGAGCCCC ACCACGGCGT CTCCCGTCGG CCCGCCGCCC
GCCGCGACCC CGACCGGCGT CCCGACGGCC GCGTCAGGTG CCCCGGGGGC GTCGGCGACC
AGCGCCGACG CGTGGCGCAC CGTTGCGGAC GACGGCTGGA GCCGGGCCAG TCGGGCCGCC
GAGCCGGCCA GCGGTGGTAC GACCCGTTCC GGCCTGCCGA AGCGGGTGCC GAAGGCGCAG
CTCGTGCCCG GCGGCATCGA GCCGCGGGCC CGGGAACGCA CTCGCCGGAC ACCGGACGAA
GTCCGCGGTC TGCTGTCGGC CTATCACCGC GGTGTGCAAC GCGGCCGAGC GGCCGGCTCG
GACCCCAACA GCACCTCGAG CAAGGAGACG AGCTGA
 
Protein sequence
MSTGPTTLPA GGDIDQPKRH WLPRLRNARI RSKLALILVV PVAAVIALAT VRLVTVGEGA 
YDATRAKALT ELSIDISALA HDMHAERMAA TVYLASTKET ADAYNLRVRS TDERVQAYRE
ERERIGEVPS AVSDRLVAID EHLTTLDAAR QQVLDRRQMA VSEAALRYGV ILADLVAYGD
GLAQLPGDEQ LADARRAVAA FGRAKAAVAE QESVAYTALS VGSFDEEQYS SFVATLTGQQ
EALLAFSLAA SPSQRSFVDS TVSGDAVTLA DKVAADITRS VGQRSLVSAA DASAAIGAVN
DLMRWTEARL QERLLADTEQ TRANVLRQAI VESLLGLLTL IIAIALAVVL ARSLNHSLRL
LREGALAVAH RDLPEAVRRL QSMQAVDEGG VDDIVREVRE PIRLNNQDEV GQVALAFNVV
HREAVRVAAE QAALRTSVSA MFLNLARRSQ NLVDRMIGEL DAIERGEEDP KRLARLFELD
HLATRMRRND ENLLVLAGAD STVPRREDAL LVDVLRAAQS EVELYNRIEF GTVDTDVSVA
AHAVNDVVRL VAELLDNATR FSPPNTTVVA DGRRIRDYVL IQVEDRGLGL SDEQLESLNR
RLAEPSSVDV AAFRLMGLAV VSRLADRYGI RVELRRNVEG GTVAQVTLPT ATVVLPVGRG
PAQISRPRQP LAVEQGPSTP TGLGGPLVGA TRAATLPEQR PPEPAPWQAP EPAPWQAPEQ
ARSATAPVQA GGMGGMVGAS PGLGATGHPG DAPTAAYPLP QRNPSRDSSA ATAGFPTVPG
STPPLTDYGS TGGLGADLAA TASFASTPLD VPPAAPPAPP QAAPPAEAPI FREMEAVWFR
SHGNDATAIF TRPDFDGAAQ PPAPDWSATA GGPAGPQLPT RVPGATTTPS PVGPPPYTAP
TGATAAPTSP TTASPVGPPP AATPTGVPTA ASGAPGASAT SADAWRTVAD DGWSRASRAA
EPASGGTTRS GLPKRVPKAQ LVPGGIEPRA RERTRRTPDE VRGLLSAYHR GVQRGRAAGS
DPNSTSSKET S