Gene Sare_0220 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_0220 
Symbol 
ID5706124 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp251378 
End bp253030 
Gene Length1653 bp 
Protein Length550 aa 
Translation table11 
GC content71% 
IMG OID641269749 
Producthistidine kinase 
Protein accessionYP_001535146 
Protein GI159035893 
COG category[T] Signal transduction mechanisms 
COG ID[COG0642] Signal transduction histidine kinase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.122039 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00447833 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
GTGCGACGCC GGGCGCCGAG GTGGCGGCTG CGGCGCTGGG CGCACTGGAA CCTGCGGTCC 
CGGCTGGTCG TGGTCGTCGC GGCACTCACC GCACTCGCAC TGGTCGCCGC CAACGCCGCC
GGCGTGGTGC TGCTGCGCCG GACCCTCATC GAGCGGATCG ACGACCAGCT GTCCGCAGTC
AACCGTCCGT TCAACCGGGA CGGCCCCACA CCGTACGACC ATCCTCGGGT GTTCCCGAAC
GTCAAGCTCG GGCCCGAGCA GGTGCTGCTG CTCTATGGTG CCGACGGCCG GCTCAGGGAC
GTGCGCCGGT CGGACGAGTC GAACGACCTG CCGATACTCG AACCGTACGA GCAACTGGTC
GCGGATGCGG CGGCGGGCCG GCCGTACTCT GTCGGGGGTG TCGACGGTAC GTCGCCGTGG
CGAGTGCTGG TGGTTGATCG TGGCGGCGAC CGCGGGGTGG CGGTGCTGGC GGTGTCGCTG
CGTGAGGTCG ACGAAACCGT TGGTCTCCTC CTGCTGATCG ACATCGTGGT CGTCCTGATC
GCGCTGTTCC TGCTGGGCTT TCTCGCGGCG GTGGTGGTCC GGCTCGGGCT GCGTCCACTG
ACCCGAATGG AGTGGGTCAC GGCCGAGATC ACTGCTGGGA ATCTGGACCG TCGGGTTCCG
GATGCCGATC CGCACACCGA GCCAGGTCGC CTTGGGGGCG CGCTCAACCT GATGTTGGAC
CGGATCTCTG CCGAGGTCGC GGCGCGCCGT GACTCGGAGC GGCGGCTGCG GGATTTCGTC
GCCGACGCGT CACATGAGCT GCGTACGCCA CTGACCTCGA TCCGTGGCTT CGCCGAGCTG
TACCGCCGGG GCGGCGCGCC GCCGGGGCCG GACCTGGACG AGACGATGAC CCGGATCGAG
GCCGAGGCGG CTCGGATGGG ACTGCTGGTC GAGGATTTGT TGCTCCTGGC CCAGCTCGAC
CACGATCGCC CGGCGCGGCG GCGTCCGGTG GACCTGCTGG CAGTCGCCGC CGACAGCGTC
CGGGACGCAC ACGCCCGGGC GCCGTTTCGG GACATTCGGC TCACCGCGCT TGACGACGAC
GCCCTGTTCG AGGCGGTCAC GGTCAGCGGC GACGACCATC GACTACGCCA GGTAGCAGCC
AACCTGGTCA GCAACGCGCT CCAGCACACC CCGCCGGACG TGCCGGTCAC GGTGCGGGCT
GGCCGTTCGC ATCGAGTTCC GGCCGGGCCG GCCCCCACGG TGTCAGTCGG CGGCTCGCTG
CCAGCCGAGG AGCCTGTCGC CGTGCTGGAG GTCACCGACA CCGGGCCGGG AATCGCCGCT
GATCAGGCGG TGCGGGTGTT CGAGCGGCTC TTTCGGGCGG AGCGTAGTCG TAGCCGCGGT
AGCGGAGGTT CCGGGTTGGG TCTGTCCATC GTGGCAGCCA TTGTCAGCGC GCATCGGGGG
CGAGTCGAGC TGGTCACCGC CCCCGGCCGG GGCGCGACGT TCCGGGTGCT GCTGCCGGCG
GTCTCTGGCA GCGGCCGGGA CGAGCCGGGT GACCTGCCTC GCGGCGTCGC GCATGACACT
CCCAGCCGGT TCCGAGGTTG CCCTGGGTTG CCTCCGAGCT GCGGGTTCCA AGGTGTTGAC
CATGCGTGTG CGACGACTCC TCCCGGCGCC TAG
 
Protein sequence
MRRRAPRWRL RRWAHWNLRS RLVVVVAALT ALALVAANAA GVVLLRRTLI ERIDDQLSAV 
NRPFNRDGPT PYDHPRVFPN VKLGPEQVLL LYGADGRLRD VRRSDESNDL PILEPYEQLV
ADAAAGRPYS VGGVDGTSPW RVLVVDRGGD RGVAVLAVSL REVDETVGLL LLIDIVVVLI
ALFLLGFLAA VVVRLGLRPL TRMEWVTAEI TAGNLDRRVP DADPHTEPGR LGGALNLMLD
RISAEVAARR DSERRLRDFV ADASHELRTP LTSIRGFAEL YRRGGAPPGP DLDETMTRIE
AEAARMGLLV EDLLLLAQLD HDRPARRRPV DLLAVAADSV RDAHARAPFR DIRLTALDDD
ALFEAVTVSG DDHRLRQVAA NLVSNALQHT PPDVPVTVRA GRSHRVPAGP APTVSVGGSL
PAEEPVAVLE VTDTGPGIAA DQAVRVFERL FRAERSRSRG SGGSGLGLSI VAAIVSAHRG
RVELVTAPGR GATFRVLLPA VSGSGRDEPG DLPRGVAHDT PSRFRGCPGL PPSCGFQGVD
HACATTPPGA