Gene Sare_3510 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_3510 
Symbol 
ID5703319 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp4049108 
End bp4050787 
Gene Length1680 bp 
Protein Length559 aa 
Translation table11 
GC content72% 
IMG OID641272937 
Producthistidine kinase 
Protein accessionYP_001538303 
Protein GI159039050 
COG category[T] Signal transduction mechanisms 
COG ID[COG4585] Signal transduction histidine kinase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.026405 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCCCGCCT CCACTCGCGT CGTGCCTCGA CCGCATCCCC TCGCCGCCGC GGCGCGCGTG 
GTGATGCTGG CGTTGGTGGC GGTGCTGAGC CTGCTCGCCA CGCACGATCC TGCCCAACTG
TGGTGGGTCG CGCTGCTGGC GGCGACGGGC CTGCCCGCAC TGCTCGCGCC GGTGTACCAC
TGGCTCGGGC CGCTGGGCCG AGGCGCCGAG GTGGTGGTGC TGGCTCTCGC CACCAGCCAG
GTCGCCTCGG TCGCCACCAT TGGCGCGCAG AACGGTGGGT TGGGTGCCTC CGCGGTGCTG
CCCTACCTGG CCGTCCCGAT CACCGTCACG GCACTGCGCC GACGCTTCCG CGAAGGCGCC
TGGTTACTGG CGATCACCGG CCTCACCCTG CTGCTGGCCG GCGCGCTGAC CGAGGTCGAC
GGCCAGCGGC AGCTCACCCA GGTCGGCTAC CTGGCAGTCA GCGCCCAGTG GCTGGTCCTG
TCCGGTCTTG GGCTCTACGC GGCGCGGACC CTGCACCGGG TGATCCGAGC TCGCAGCGTC
AGCAAGCCCC AGCCGTACGC GGAGGCAACC CGGCTACTCA CCCAGCTTCG GACGGTGGCC
CGTCAGCTAC CCGGCGCCAC GCTGGACCCG GGTGGTATCT CCGAGCACCT GCTGGAGGAG
CTGCGCACCC TGGCCCGAGC GGACCGAGGG GCCGTTCTCT CGGCCAGCGG CGGTGGACGG
CTGGTGGTGC TGGCCCAGTG CGGAACCGAC CGGGTGGACT GGGAGACGAC GCTGGACGCG
GACTCGGCGA TCGCTGACGC GTGGGCCAGC CAGCAGCCGC ACACCGCCGC GCACTCTCAG
GCCCGCTCGC ACGCTGGCGG GGAGGTGTCC GCGCTGATCG TGCCGCTGGT CGCCGGGGTA
CGTACGGTCG GGCTGGTGGT GCTGGAGGCG GACGTCGCGC ACGCGTACCC GCCCGAGATC
GTGTCCCGGG TGACCGGGCT GACCTCCCCG GCCGCGCTGC GGCTGGAGGC GGCCCTGCTC
TTCGACGAGG TGCGGTCACT GGCCACCAAC GAGGAGCGAC AACGACTCGC CCGGGAAATC
CACGACGGGG TGGCCCAGGA ACTGGTGATG GTCGGCTACG GCATCGACAA CGCGCTGGCC
ACGGTGCACG ACGACACCGA CGAGACCGCC GAGTCGCTAC GACTGCTACG GCAGGAGGTC
ACCCGGGTCA TTACCGAGCT GCGACTCAGC CTCTTCGAGC TGCGCAGCGA GGTGGACCGG
CACGGCGGCC TGGCTGCCGC CATCGCCGAG TACGCGCGCA CGGTCGGCGT CTCCGGCGGC
CTGCGGGTAC ACCTGTCGTT GGACGAGTCG ACCGCCCGGC TGCCCGCCGC CACCGAAGCC
GAGCTGCTAC GGATCGCCCA GGAGGCCGTG GCCAACGCCC GCAAGCATGC CGGTGCGTCG
AACCTCTGGG TCACCTGTGC GGTGGACCCG CCGTACGCGC AGATCGAGGT GTCAGACGAC
GGGCACGGTA TTGCCGACCA GCGCACTGAC GGACACTACG GTCTTGCAAT CATGGCCGAG
AGGGCGGAAC GTATCCGAGG CCGACTGGAG ATCCGGCCGC GGCAACCGAG CGGCACGACC
GTGGCCGTGG TGGTCGGTTC GTCGCCTCGG CGCGATAACG TGCCTGACAG CACCGCATGA
 
Protein sequence
MPASTRVVPR PHPLAAAARV VMLALVAVLS LLATHDPAQL WWVALLAATG LPALLAPVYH 
WLGPLGRGAE VVVLALATSQ VASVATIGAQ NGGLGASAVL PYLAVPITVT ALRRRFREGA
WLLAITGLTL LLAGALTEVD GQRQLTQVGY LAVSAQWLVL SGLGLYAART LHRVIRARSV
SKPQPYAEAT RLLTQLRTVA RQLPGATLDP GGISEHLLEE LRTLARADRG AVLSASGGGR
LVVLAQCGTD RVDWETTLDA DSAIADAWAS QQPHTAAHSQ ARSHAGGEVS ALIVPLVAGV
RTVGLVVLEA DVAHAYPPEI VSRVTGLTSP AALRLEAALL FDEVRSLATN EERQRLAREI
HDGVAQELVM VGYGIDNALA TVHDDTDETA ESLRLLRQEV TRVITELRLS LFELRSEVDR
HGGLAAAIAE YARTVGVSGG LRVHLSLDES TARLPAATEA ELLRIAQEAV ANARKHAGAS
NLWVTCAVDP PYAQIEVSDD GHGIADQRTD GHYGLAIMAE RAERIRGRLE IRPRQPSGTT
VAVVVGSSPR RDNVPDSTA