Gene Sare_1089 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_1089 
Symbol 
ID5704080 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp1226922 
End bp1227983 
Gene Length1062 bp 
Protein Length353 aa 
Translation table11 
GC content72% 
IMG OID641270604 
Producthistidine kinase 
Protein accessionYP_001535988 
Protein GI159036735 
COG category[T] Signal transduction mechanisms 
COG ID[COG2205] Osmosensitive K+ channel histidine kinase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0982705 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGTGAGC TGGCGCTGAT CTTCGCGTTC GCGCTCGGGC CGGCGCTCTG CGTCGGCGCC 
GCCGGCGCGC TCGCCCTGCG CCTGCTCCGC GGACGCTCGG TGACCGTGCA CATCGTCACC
CTGCTGACGG TCGCGGTGAC CGCGGTGGTG GCCGCCGTGG CCGTCGTGGC CGACGCGATG
TTCCTCTCCG CGCACGACCG CAACGTGGTG CTGATCACAG TTGCGGCCGC GGCGGTGGTG
AGCCTCTCGG TCGGCTGGCT CTTCGGGCGT CGCCTGGCCG CCGCTGCCGT GTGGGCGGAC
CAGGCCCGGC AGCGGGAGCG CCGGATCGAG CAGGGCCGAC GGGACCTGGT TGCCTGGGTG
TCACACGACC TGCGGACCCC GCTGGCCGGG CTGCGGGCCA TGGCCGAGGC ACTGGAAGAC
CGGGTGGTCG ACGACCCCGC GACGGTGGGC GAGTACCACC GCCGGATTCG GGTGGAGACC
GACCGGATGA CCCGTCTGGT GGACGACCTG TTCGAGCTGT CCCGGATCAA TGCCGGCGCG
TTGCGCCTGC ACCTGTCGGC GGTACCCCTG GGCGACGTCG TGTCGGACGC CGTCGCCAGC
ACCACACCAC TGGCGACCGC CCGCCGAGTC CGTCTGCTGG CACCCGACTC GGGCTGGCCC
ACCGTCCTGG CCAGCGAGCC CGAGCTGGCC CGGGTGGTGG GGAACCTCCT GCTCAACGCC
GTCCGCTACA CACCGTCCGA GGGAACTGTC CGGGTCGAGG CCGGGGCGGA GACCGACTGG
GCCTGGCTGG CCGTGGCGGA CACCTGCGGC GGCATCCCGG AGGAGGACCT GCCCCGCGTC
TTCGATGTCG CCTTCCGCGG CGAGCGGGCA CGTACCCCCC ACCCCGGCAA CGGTGACCTG
GCCAGCTCGG GGGGTCTGGG GCTGGCGATC GTACGAGGGC TGGTCGAGGC GCACGGCGGC
CGGGTACACG TGCGGAACAC GACCGGCGGA TGTCGGTTCG AAATCCGGCT GCCGCTTCCG
GGAACCATCG AAGCACATCG GCTGTCATAT CTATTTTCAT AG
 
Protein sequence
MRELALIFAF ALGPALCVGA AGALALRLLR GRSVTVHIVT LLTVAVTAVV AAVAVVADAM 
FLSAHDRNVV LITVAAAAVV SLSVGWLFGR RLAAAAVWAD QARQRERRIE QGRRDLVAWV
SHDLRTPLAG LRAMAEALED RVVDDPATVG EYHRRIRVET DRMTRLVDDL FELSRINAGA
LRLHLSAVPL GDVVSDAVAS TTPLATARRV RLLAPDSGWP TVLASEPELA RVVGNLLLNA
VRYTPSEGTV RVEAGAETDW AWLAVADTCG GIPEEDLPRV FDVAFRGERA RTPHPGNGDL
ASSGGLGLAI VRGLVEAHGG RVHVRNTTGG CRFEIRLPLP GTIEAHRLSY LFS