Gene Sare_3071 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_3071 
Symbol 
ID5706842 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp3480124 
End bp3481233 
Gene Length1110 bp 
Protein Length369 aa 
Translation table11 
GC content73% 
IMG OID641272512 
Producthistidine kinase 
Protein accessionYP_001537880 
Protein GI159038627 
COG category[T] Signal transduction mechanisms 
COG ID[COG0642] Signal transduction histidine kinase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00000550678 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
GTGAACCTCC GGCGCGTCAG CCTCGCAGTG CGGCTCTTCG CCGCGCAGGT GCTCGCGCTC 
GCCGTCGGCG GGCTGACCCT GGCACTGGTC GCCGGCGCGG TGGGCCCACG GATCTTTCAC
GAGCACCTCG GGCGAGTCGG CGGGGAGGTC AGCGCCGAGG CGCGCTGGCA CGTCGAGCAG
GCGTACGCCT CGGCGAACCT GCTCGCGCTC GGTGTCGGTC TGCTCGCCGC GCTGGGCGCG
GCGCTGGCGG CCAGCGCGTA CGCGACACGG CGGATCACCC GCCCGGTCAC TCACTTCGCG
CACGCGTCGG CCAGCCTCGC CGACGGACAC TACGACGTCC GCATGGCCGA CCCCGGGCTC
GGCAACGAGC TCAAGACCCT CGCGGACTCC TTCAACACCA TGGCGGAGGG CCTGGAGACC
GTCGAAGCGA CCCGGCGACG GCTGCTCGCC GACCTCGGCC ACGAACTGCG TACCCCGCTC
GCCACCATCG AGGCCTACCT CGAAGCGGCT GAGGACGGCG TCGCTGTCGA CGACGAAGAC
CTCCAGTCGG TGCTGCGTGC CCAGACCGCG CGGCTGCACC GACTCGCCGA CGACATCGCC
GCGGTCTCCC GCGCCGAGAC GCATCAGCTC GACCTGCACC CGGTACGGAC GGCCCCGGCG
GACCTGGTCC GGGACGCCGT GGCCGCGGTG CGGCCCCGCT ACGCCGGCAA GGGTGTGACG
CTGCGCAGCG ACCTTCGCCC GTCCCCGCAG GTCGATGTCG ACCCACAGCG GATGGGGCAG
GTGCTTGGCA ACCTCCTGGA CAACGCGCTA CGGCACACCC CCGAGGGCGG GACCGTCACG
GTACACGTGA TCGGCGGTGC CAGAAGTGTG GAGCTGGCGG TGGCCGACAC CGGTCCCGGC
ATCCCGGCGC AGCACCTCCC GCACGTCTTC GAACGCTTCT ACCGGGTCGA CACGGCCCGG
GACCGGGACA ACGGCGGGTC GGGGATCGGG CTCGCCATCG TGCGGGCCGT GGTCAGCGCC
CACGGGGGGC GGGTACGGGC GGATAACGTC CCGGGTGGTG GCACGATGGT CAAGGTGGTC
CTGCCACCGT CGGGACAGGG GCAGGTCTGA
 
Protein sequence
MNLRRVSLAV RLFAAQVLAL AVGGLTLALV AGAVGPRIFH EHLGRVGGEV SAEARWHVEQ 
AYASANLLAL GVGLLAALGA ALAASAYATR RITRPVTHFA HASASLADGH YDVRMADPGL
GNELKTLADS FNTMAEGLET VEATRRRLLA DLGHELRTPL ATIEAYLEAA EDGVAVDDED
LQSVLRAQTA RLHRLADDIA AVSRAETHQL DLHPVRTAPA DLVRDAVAAV RPRYAGKGVT
LRSDLRPSPQ VDVDPQRMGQ VLGNLLDNAL RHTPEGGTVT VHVIGGARSV ELAVADTGPG
IPAQHLPHVF ERFYRVDTAR DRDNGGSGIG LAIVRAVVSA HGGRVRADNV PGGGTMVKVV
LPPSGQGQV