Gene Sare_4087 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_4087 
Symbol 
ID5704740 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp4646497 
End bp4648023 
Gene Length1527 bp 
Protein Length508 aa 
Translation table11 
GC content72% 
IMG OID641273513 
Productsignal transduction histidine kinase 
Protein accessionYP_001538868 
Protein GI159039615 
COG category[T] Signal transduction mechanisms 
COG ID[COG3920] Signal transduction histidine kinase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0238732 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0371464 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGTCCACGC TGCGCGACCT CGCCGACGAG CACACCCAGC TGCGTCCCGC CGACATCGAT 
CATCTGCACC GGATCGCGGG CGACTGGCAG TTGCTCTCCG ACATGTCCTT TGCCGACCTG
CTGCTCTGGG TGCCGGTGGG CGAGGACGGA ACGTTTCTCT GCGTGGCGCA GGTCCGGCCG
ACGACCGCCC CCACCGCCTA CCAGGACGAC CAGGTGGGCC GGATCGTCGG AGGGCCCGAG
GTTGACCACC TCGAGGTCGC CTACCGGCAG GGTCGGATCT GGCGGGAGGG CGACCCGGTC
TGGTACGGCG ACGTGCCCGC ACGGCACGAG GCGATCCCGG TGCGGCTGCG GGCCGCCGAC
GGCGAGCACG GTGAGGTGAT CGCCGTGGTC GGCCGGGACA CCAACCTGTC GACCGCGCGG
ACCCCCAGTC AACTGGAGCT GAACTATCTG ACCACGGCGG ACGACCTGGC GCAGATGGTC
GCCGACGGTA CCTTCCCGCC GCCCCGGCAC CCCGGGGAGA CCACCTCGGC GCCCCGGGTC
GGCGACGGGC TGGTCCGGCT GGACGCGAGC GGCAAGGTGA CGTACGCGAG CCCGAACGCG
CAGTCGGCGT ACCGGCGGCT CGGCTACGCC TCGCACCTGG TGGGCGAGGA TCTCGAGGCG
CTGCACCGGC GCCTCGCCGA CGATCCGCTG GAGGGCACCG ACGCGGCCAA CGCGGTGCTC
GCCGCGCTTC GGGGCGAGGC GCCGCCACGC CGTGAGATCG ACGCTCGGGG GGCGACCATG
CTGACCCGGG CACTGCCGTT GATACCCGCC GGGGTGCCGA TCGGCGCACT GGTGCTGGTC
CGCGACGTCA CCGAGGTTCG CCGCCGGGAC CGGGCGCTGA TGACCAAGGA CGCCACCATC
CGGGAGATCC ATCACCGGGT GAAGAACAAT CTCCAGACCG TCGCCGCCCT GTTGCGGTTG
CAGGCCCGCC GGGTGGCCCT GCCGGAGGCC CGGATCGCGC TGGAGGAGTC GGTCCGTCGT
GTCGCGTCGA TCGCGTTGGT GCACGAGACG CTGTCCATGT CCAACGACGA GGTGGTCGAG
TTCGACGGGA TCGTCGATCG GGTGGCGAGC GCGGCGACCG AGGTGGCGGC GACCGAGTCG
ACCATCCGGA TGCGTCGCCG GGGCAGCTTC GGCGTGCTCT CCGCCGAGCT CGCCACATCG
CTGGTGATGG TTCTGAACGA ACTCCTGTTG AACGCCGTCG AGCACGGGTT TCCCGCCGAC
GACGGCACCG GTGAGCAGGC CGGCACCGTC GGACCGCTCC CGGAGGTGGT GGTGTCCGCG
CACCGGTTCC GCAAGATCCT GCACGTTTCG GTGGCCGACA ACGGTGCTGG ACTGCCGCCG
CAATTCGACG CCGAGCGCGG CGGCGGCCTC GGCCTGCAGA TCGTCCGCGC CCTGGTCACC
GGTGAGCTGC GCGGCACGAT CGAGCTTCGG GCGAGTGCCA GCGGAGGCAC CGAGGCCATG
CTCGTTCTCC CACTCACCCG GCTGTGA
 
Protein sequence
MSTLRDLADE HTQLRPADID HLHRIAGDWQ LLSDMSFADL LLWVPVGEDG TFLCVAQVRP 
TTAPTAYQDD QVGRIVGGPE VDHLEVAYRQ GRIWREGDPV WYGDVPARHE AIPVRLRAAD
GEHGEVIAVV GRDTNLSTAR TPSQLELNYL TTADDLAQMV ADGTFPPPRH PGETTSAPRV
GDGLVRLDAS GKVTYASPNA QSAYRRLGYA SHLVGEDLEA LHRRLADDPL EGTDAANAVL
AALRGEAPPR REIDARGATM LTRALPLIPA GVPIGALVLV RDVTEVRRRD RALMTKDATI
REIHHRVKNN LQTVAALLRL QARRVALPEA RIALEESVRR VASIALVHET LSMSNDEVVE
FDGIVDRVAS AATEVAATES TIRMRRRGSF GVLSAELATS LVMVLNELLL NAVEHGFPAD
DGTGEQAGTV GPLPEVVVSA HRFRKILHVS VADNGAGLPP QFDAERGGGL GLQIVRALVT
GELRGTIELR ASASGGTEAM LVLPLTRL