Gene Sare_1555 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_1555 
Symbol 
ID5706757 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp1789335 
End bp1790663 
Gene Length1329 bp 
Protein Length442 aa 
Translation table11 
GC content74% 
IMG OID641271066 
Producthistidine kinase 
Protein accessionYP_001536442 
Protein GI159037189 
COG category[T] Signal transduction mechanisms 
COG ID[COG0642] Signal transduction histidine kinase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0110946 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTGATCG GCGTCCTCGG AGTGTCGACA GGACTCGCCA TAGGCGGCCT GGTGCTGCTG 
GCCGTGCTCA CCTGGACCCT GCACCGCAAC GTCGACACGG AGGCGCTGCG CACCGCCGAC
GCCGTCGCCC TGCTGGCCGC CGAGGACGCC CTGCCCGACC CCCTGCCCGT CGCCGGCGGC
CAGGTACGGG CACAGGTGAT CGACACACAG GGGCGGGTCC GAGCCGCCTC CATCGACGCC
GACCGGCTGG TGCCGATGGT CCGACCGGAA CAGGTCGACC GCACCGCCCG AGCCGGAATC
ACCGTGGCGG GGGAGCGGAT GGGGCTGACC GGGCCGGTGC GGGTCGTGGC CGTCCCGGCC
GGCACCGATG CCGATCCACT CACCGTGCTG GTCGCGCGCT CGCTGGTGGA CGTGCGGCAC
AGCACCAACA CCGTCCAGAC CATCCTGCTG GCCAGCTTCC TACCGCTGGT GGGCCTGCTC
GCCGTCGTGG CCTGGCGGGC AGTCGGCGCG ACGCTGCGAC CGGTGGAGGC GCTGCGCCGC
GGCGCCGAAG AGATCACCGG TCGGGATGGG AGCGGCCGGC TGCCCGTACC CGCATCGCGG
GACGAGATCC ACCGGCTCGC GGTGACCCTC AACGACATGC TGGACCGGCT GCGCGCCAGC
CGCGACCGGG AGCGGGCATT CGTCGCCGAC GCCGCACACG AACTGCGAAG TCCACTCACC
AACCTCCGAA CCGAACTCGA GGTCGCACAG CGACTCGCGG ACCGAACCGA CTGGTCGGCC
GTCTCCACGG ACCTGCTCGC CGACACCGAA CGGCTCGGCC GTCTGGTCGA CGACCTGCTG
CTGCTCGCCC GCCTCGACGA GGACCCGCCC ACCCGTGCGG CCGGACCGGT CGACCTGGGC
GCCCTGCTGG CCGAGGTGGT CGACCGCCAC CCGGCGAGGA CCGTGCACGT GGAAGCGCCG
ACGTCGCCAC TGTGGGTGGT CGGCGACGTG GACGAACTGC GCCGGGTACT CGTGAACCTG
GTCGACAACG CCGTACGGCA CGCCCAGAGC CGGGTGGTGC TCGGCGCCGA GCCGGACGGG
CCCGCGTACC ACCGGGTGAC GGTCACCGAC GACGGGCCCG GCATCCTGGT GGCGGACCGG
GAACGGGTGT TCGGCCGGTT CACCCGGCTG GACGCCGCGC GGGATCGGGA CGCCGGCGGC
GCCGGCCTGG GCCTCGCCAT CGTGCGCGAG CTGGTCCGGC GGGCCGGCGG TCGCGTCGAC
CTCGACGACG CGGACCCGCC ACCGGGCCTG CGGGTCCGGG TAACCCTGCC CGCGCTGCCC
ACCGAGTAG
 
Protein sequence
MVIGVLGVST GLAIGGLVLL AVLTWTLHRN VDTEALRTAD AVALLAAEDA LPDPLPVAGG 
QVRAQVIDTQ GRVRAASIDA DRLVPMVRPE QVDRTARAGI TVAGERMGLT GPVRVVAVPA
GTDADPLTVL VARSLVDVRH STNTVQTILL ASFLPLVGLL AVVAWRAVGA TLRPVEALRR
GAEEITGRDG SGRLPVPASR DEIHRLAVTL NDMLDRLRAS RDRERAFVAD AAHELRSPLT
NLRTELEVAQ RLADRTDWSA VSTDLLADTE RLGRLVDDLL LLARLDEDPP TRAAGPVDLG
ALLAEVVDRH PARTVHVEAP TSPLWVVGDV DELRRVLVNL VDNAVRHAQS RVVLGAEPDG
PAYHRVTVTD DGPGILVADR ERVFGRFTRL DAARDRDAGG AGLGLAIVRE LVRRAGGRVD
LDDADPPPGL RVRVTLPALP TE