Gene Sare_0424 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_0424 
Symbol 
ID5708401 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp484122 
End bp485438 
Gene Length1317 bp 
Protein Length438 aa 
Translation table11 
GC content73% 
IMG OID641269949 
Producthistidine kinase 
Protein accessionYP_001535344 
Protein GI159036091 
COG category[T] Signal transduction mechanisms 
COG ID[COG4585] Signal transduction histidine kinase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.680525 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00719111 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGACCGCTG CACGCGTGGC CGCCACGCCC GCTCCCAGCC CCAGCCGCAA CCTGTTCCGC 
CAACTTCTGC GCGACTCGGG CTACGTGCTG TCGGGCCTGC CCCTCGCCAT AGTCGGCTTC
GTGGTGGCCG TCACCGGGTT CTCGCTCAGC CTTGGTCTGC TGGTGACCGC CCTGGGCCTG
CCGGTCCTGG CCGGCACGTT GTACGCCGTC CGGGTGCTGG CCGACGTCGA ACGGATCCGG
CTGCCCGCCG TGCTGGGCGT GCCCCGGATC CGCCCGGTCT ACCGCGTACC CGACTCGGGT
GCCAGCTTCT GGCGCCGAAC CCTGACGCCG GCCCGGGACC TGCAGTCGTG GCTCGATCTG
CTGCACGCGT TCTGCAAGAT GCCGGTGGCG ACGGTGACCT TCTCGGTGCT GCTGACCTGG
TGGGCGCTGG CGGTCGCCGG CGTCAGCTAC GGGGCCTACG ACCGGGCGAT CCCGTACGGT
CCGAACGACC AGAGCCTGAG CGCACTTCTC GGGATGGGCA ACGGCTCCGG CGCCCGGATC
TTCCTGAACA CCGCCATCGG GGTGTTCGCC CTGCTCACCC TGCCGCTGGT CGCTCGCGCG
TGTGCCCGGA TCGAGGGCAG CCTGTCCCGC TCGCTCCTGA CCGGCGTGGC CGAGATGCGT
AACCGGATCA CCATCCTCGA GGAGCAGAAG CGCGCGGCGG CCTCCGCCGA GGCGAACGCG
CTGCGCAAGC TGGAACGCGA CATCCACGAC GGACCGCAGC AGCGGCTGGT CCGGCTCGCG
ATGGATCTCA GCCGGGCCCG CGAGCAGCTC GCCGACGACC CGGTGGCGGC CGGGCACACG
CTCGACGAGG CAGTCGGCCA GACCCGGGAG ACCCTGACCG AGCTGCGTGC GCTGTCCCGC
GGCATCGCAC CGCCCGTCCT GGTCGACCGA GGGCTACCGA GCGCGCTAGC GGCGCTGGCC
GGACGCGGAC TGATCCCGAT CGAACTGCGG GTGGACGCCG GGCTCGGCGA GCCGGGCGGT
CGGCCCGACC CGACGGTGGA GAGCACGGCG TACTTCGTGG TCGCCGAGGC GCTCACGAAC
GTCGCGAAGC ACAGCCGAGC CACCGAGTGC CGAGTCACCG TGGAGCGGGC CGGGGAGCGG
CTGCGAGTCG GCATCGACGA CGACGGCCAG GGCGGCGCGC ACCTGGCCAA GGGGCACGGG
CTGGTCGGCA TCGCGGACCG GGTCCGGGCG GTCGGCGGGC AGCTCTCCGT GACCAGCCCG
GCCGGCGGAC CGACCGAGGT GTGCGCCGAC CTCCCCGCGA CGCCCGGCCC GTGGTAG
 
Protein sequence
MTAARVAATP APSPSRNLFR QLLRDSGYVL SGLPLAIVGF VVAVTGFSLS LGLLVTALGL 
PVLAGTLYAV RVLADVERIR LPAVLGVPRI RPVYRVPDSG ASFWRRTLTP ARDLQSWLDL
LHAFCKMPVA TVTFSVLLTW WALAVAGVSY GAYDRAIPYG PNDQSLSALL GMGNGSGARI
FLNTAIGVFA LLTLPLVARA CARIEGSLSR SLLTGVAEMR NRITILEEQK RAAASAEANA
LRKLERDIHD GPQQRLVRLA MDLSRAREQL ADDPVAAGHT LDEAVGQTRE TLTELRALSR
GIAPPVLVDR GLPSALAALA GRGLIPIELR VDAGLGEPGG RPDPTVESTA YFVVAEALTN
VAKHSRATEC RVTVERAGER LRVGIDDDGQ GGAHLAKGHG LVGIADRVRA VGGQLSVTSP
AGGPTEVCAD LPATPGPW