Gene Sare_4523 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_4523 
Symbol 
ID5706013 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp5110396 
End bp5113935 
Gene Length3540 bp 
Protein Length1179 aa 
Translation table11 
GC content67% 
IMG OID641273937 
ProductGAF sensor hybrid histidine kinase 
Protein accessionYP_001539286 
Protein GI159040033 
COG category[K] Transcription
[T] Signal transduction mechanisms 
COG ID[COG0642] Signal transduction histidine kinase
[COG0745] Response regulators consisting of a CheY-like receiver domain and a winged-helix DNA-binding domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.522992 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00180703 
Fosmid HitchhikerNo 
Fosmid clonabilitydecreased coverage 
 

Sequence

Gene sequence
ATGACCACGG CGAAGCCGCC GGTTGCGGAG CTGTCCGCAC CGGACCCCGA GGCGGTCCTC 
ATCGAGCTGA CCGAGGCGCT ACGGCGTGTT CGCCGCGGCG ATCTCAAGGT CCGGCTGCCC
CGCCGGCCAG GTCGGGCTGG CGAGGTCGCG GACGCCTTCA ACGACGTGGT GTCGCTCCAG
GAACGGCAGC ACCACGACCT GCGCCGGATC AGCCGGATCG TCGGCCAGGA CGGGCGGCTC
AACGAACGCC TCGACGGCGA GGGCCTGGAC GGTTCCTGGG CGGAGGGGCA GCGGGCGGTC
AACTCGCTCA TCGACGACCT CGGGCGCCCG ACCATCGAGA TCGCCCGAGT CATTGTCGCG
GTCGCCGAGG GCGACCTGTC GCAGCACATG GCGCTGAAGC TGGACGACCG ACCGCTGCGC
GGCGAGTACC TGGGCATCGG GCGCAGGGTG AACACGATGG TGGATCAGTT GTCGTCGTTC
GCGGATGAGG TGACCCGGGT GGCCCGGGAG GTGGGTACCG AGGGGAAGTT GGGTGGTCAG
GCCGATGTTC GGGGCGCCGC CGGCACCTGG AAGGACCTCA TCGACTCGGT GAACACCATG
GGGTCGAACC TGACCGCGCA GGTACGTAAC ATCTCGCAGG TTTCGACGGC GGTGGCGACG
GGTGATCTGG GTCAGAAGAT CACGGTGGAT GCGCAGGGCG AGATTCTGGA GTTGAAGAAC
ACCGTCAATA CGATGGTGGA TCAGTTGTCG TCGTTCGCGG ATGAGGTGAC CCGGGTCGCG
CGGGAGGTGG GTACCGAGGG GAAGTTGGGT GGTCAGGCCC AGGTGAAGGG TGTCTCCGGG
ACGTGGCGGG ACCTCACCGA GAACGTCAAC CAGCTGGCCG GCAACCTGAC CAGCCAGGTA
CGTAACATCT CGCAGGTTTC GACGGCGGTG GCGACGGGTG ATCTGGGTCA GAAGATCACG
GTGGATGCGC AGGGCGAGAT TCTGGAGTTG AAGAACACCG TCAACACGAT GGTGGATCAG
TTGTCGTCGT TCGCGGATGA GGTGACCCGG GTCGCGCGGG AGGTGGGTAC CGAGGGGAAG
TTGGGTGGTC AGGCCCAGGT GAAGGGTGTC TCCGGGACGT GGCGGGACCT CACCGACAAC
GTCAACTCGA TGGCGTCGAA TCTGACCTCG CAGGTGCGCA ACATCGCCTC GGTCACGACC
GCCGTCGCAC GGGGCGATCT GGGTCAGAAG ATCACGGTGG ATGCGCAGGG CGAGATTCTG
GAGTTGAAGA ACACCGTCAA TACGATGGTG GATCAGTTGT CGTCGTTCGC GGATGAGGTG
ACCCGGGTCG CGCGGGAGGT GGGTACCGAG GGGAAGTTGG GTGGTCAGGC CCAGGTGAGG
GGCGTCGCCG GGACGTGGCG GGACCTCACC GAGAACGTCA ACCAGCTCGC CTCCACCCTG
ACCGTGCAGT TACGCACCAT CGCCCAGGTC TCCACGTCGG TGACCCGCGG CGACCTGACC
CAGCGGATCG CGGTCGAGGC TCAGGGTGAG GTCGCCGAAC TGACGGACAA CATCAACCAG
ATGATCGTCA CCCTCCGAGA GACGACCGAG GAGAACGCCG AGCAGGGCTG GCTGGACTCC
AATCTGGCCC GCATCGGCGG GCTGCTCCAG GGCCAGCGGG ACCTCGGTGA GGTCTGCCGC
ATGATCATGA CGGAGGTCAC TCCGCTGGTT GACGCACAGC TCGGCGCGTT TTTCCTGGTG
GACGACTCCG AGGGCAGCGT CCGACTGCGG CTCGCCGCCT CGTATGGGTA CGTTGCCCGC
GGGCAGGACG TGACGTTCGG GCCCGGTGAG GGCCTGGTCG GCCAGACGGC GCTGTCCCGG
CGCACGATTC GGGTCAGCGG AGGGCCGGAC AACCCGCTCC GGCTCCGCTC CGGACTCGCC
GACACGGCAC CGGCCGACCT GGTGGTGCTG CCGGTCATCT TCGAGGGCGA ACTGCTCGGC
GTGATCGAGT TCGCCAGCAT GACGGCCCTC TCCGACCTCT ATCTCTCCTT CCTCGAGCGG
CTGGTCCTCA CCATCGGCGT CGCGGTCAAC ACCATCCAGG CGAACCGGCG TACCGAGGAA
CTGCTGTCCC AGTCGCAGCG GTTGGCCCAC GAGCTGCAGG AGCAGTCGGC CGAACTCCAG
CGCACGAACG CGGAACTGGA GGAGAAGGCG AAGCTGCTCT CCGAGCAGAA GGGCAACATC
GAGACCAAGA ACCGGGAGGT CGAGCTGGCC CGGCTCGGCC TGGAGGAGAC GGCGCAGCAG
CTCACCCGGG CCTCGGCCTA CAAGTCGGAG TTCCTGGCCA ACATGAGCCA CGAACTGCGT
ACGCCGCTGA ACTCGCTGCT CCTGCTGGCC CGACTGCTCG TCGAGAACCC GGAGCAGAAC
CTCAGCCCGA AACAGATCGA GTTCGCCCGG ACGATCCACA GCGCCGGAAC GGACCTGCTC
TCACTGATCG ACGACATCCT CGACCTGTCC AAGATCGAAG CGGGTCGGAT GGATGTCGAG
CCGACCGGGA TCAGGTTTAC CGAGATCCGT AGCTATGTCG AGCAGGCCTT CGTGCCGCAG
GCGCAGGAGA AGGGACTGGA CTTCGAGGTC CGGGTCAGCC CGGACCTGCC GGAGAAGATC
GTCACCGACG CGCAGCGGCT GCAGCAGATC CTGCGGAACC TGATCTCCAA CGCGGTCAAG
TTCACCGACG CTGGGGTGGT GCGACTGGCG ATCGCCCCGG CGCCGGAGTC GACTGTCTTC
GACGTGCCCG CCCTGACCAA CGCGCGGCAG GTGATCGCAT TCACCGTCGC TGACACCGGG
ATCGGTATCT CCGACGACAA GCTGTCGATC ATCTTCGAGG CGTTCCAGCA GGCCGACGGG
ACGACGAGCC GCCGGTACGG CGGGACCGGC CTGGGCCTGT CGATCAGTCG GGATCTGGCC
CGGCTGCTGG GCGGCGCGAT CGGCGTGTCC TCGGCACCCG GGCAGGGCTC GACCTTCACC
CTCTTCGTAC CCGATGTCCT GGCCCCGGAC GCGGTCGTGG GGCCGGCACC ACCCTCACCC
GCCCGGGCTG GTCTCCCGTC GTCGCTGCTG ATGCCCTCGG TGGAGCTGCT GCCCACCGCC
GCCGAGGCGG CGGCGACCCG CCAGCTGGCC GGCGCCACCG TGCTGATCGT GGACGACGAC
GTACGCACGG TCTTCGCCCT GACCAGCGCG TTGGAGCTGC ACGGGATGAC CGTGTTGTAC
GCGGACAACG GGGCTGACGG CGTCCGCCTG CTGGCCGAGC ATCCGGAGGC GGACATCGTA
TTGATGGACG CCATGATCCC CGATCAGGAC GGGTACGAGA CGACTCGGCA GATCCGGCGC
AACCATCGGT TCGCCGACCT CCCGATCGTC TTCCTGACCG CGAAGGCGAT GCCCGGTGAC
CGGGAGTCCG CGCTCGCGGC CGGAGGCAGC GACTACATCA CCAAGCCCGT TGACCTGGAC
GAGTTGATCG AGCTCATGAC GACCTGGATC AGCGGTGGCC GGAGCGAGGA GGGCCTGTGA
 
Protein sequence
MTTAKPPVAE LSAPDPEAVL IELTEALRRV RRGDLKVRLP RRPGRAGEVA DAFNDVVSLQ 
ERQHHDLRRI SRIVGQDGRL NERLDGEGLD GSWAEGQRAV NSLIDDLGRP TIEIARVIVA
VAEGDLSQHM ALKLDDRPLR GEYLGIGRRV NTMVDQLSSF ADEVTRVARE VGTEGKLGGQ
ADVRGAAGTW KDLIDSVNTM GSNLTAQVRN ISQVSTAVAT GDLGQKITVD AQGEILELKN
TVNTMVDQLS SFADEVTRVA REVGTEGKLG GQAQVKGVSG TWRDLTENVN QLAGNLTSQV
RNISQVSTAV ATGDLGQKIT VDAQGEILEL KNTVNTMVDQ LSSFADEVTR VAREVGTEGK
LGGQAQVKGV SGTWRDLTDN VNSMASNLTS QVRNIASVTT AVARGDLGQK ITVDAQGEIL
ELKNTVNTMV DQLSSFADEV TRVAREVGTE GKLGGQAQVR GVAGTWRDLT ENVNQLASTL
TVQLRTIAQV STSVTRGDLT QRIAVEAQGE VAELTDNINQ MIVTLRETTE ENAEQGWLDS
NLARIGGLLQ GQRDLGEVCR MIMTEVTPLV DAQLGAFFLV DDSEGSVRLR LAASYGYVAR
GQDVTFGPGE GLVGQTALSR RTIRVSGGPD NPLRLRSGLA DTAPADLVVL PVIFEGELLG
VIEFASMTAL SDLYLSFLER LVLTIGVAVN TIQANRRTEE LLSQSQRLAH ELQEQSAELQ
RTNAELEEKA KLLSEQKGNI ETKNREVELA RLGLEETAQQ LTRASAYKSE FLANMSHELR
TPLNSLLLLA RLLVENPEQN LSPKQIEFAR TIHSAGTDLL SLIDDILDLS KIEAGRMDVE
PTGIRFTEIR SYVEQAFVPQ AQEKGLDFEV RVSPDLPEKI VTDAQRLQQI LRNLISNAVK
FTDAGVVRLA IAPAPESTVF DVPALTNARQ VIAFTVADTG IGISDDKLSI IFEAFQQADG
TTSRRYGGTG LGLSISRDLA RLLGGAIGVS SAPGQGSTFT LFVPDVLAPD AVVGPAPPSP
ARAGLPSSLL MPSVELLPTA AEAAATRQLA GATVLIVDDD VRTVFALTSA LELHGMTVLY
ADNGADGVRL LAEHPEADIV LMDAMIPDQD GYETTRQIRR NHRFADLPIV FLTAKAMPGD
RESALAAGGS DYITKPVDLD ELIELMTTWI SGGRSEEGL