Gene Sare_0905 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_0905 
Symbol 
ID5706051 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp1017351 
End bp1019024 
Gene Length1674 bp 
Protein Length557 aa 
Translation table11 
GC content70% 
IMG OID641270423 
Producthistidine kinase 
Protein accessionYP_001535813 
Protein GI159036560 
COG category[T] Signal transduction mechanisms 
COG ID[COG0642] Signal transduction histidine kinase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.474957 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.117481 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGGCGGTT GGGGCGCACG GCTGGTCGCC GGGCTGCACC AGACTTGGCG GCGCTCGTTG 
CAGGTACGTG TGGTGACCAT CACGCTCCTG GCGTCCAGCC TGCTGGTCGG CGGGTTCGCT
TACCTCATCG CCGACAAGAT CACCAATATT CTGCTGGAGA ATGCCGAGGC CGATGTGCAG
GCCCGGCTGA ACAGTGGGGC GGTGTACGCG GCCAAACAGC TCGGGTTCTA CCACAAGCCG
CAGGAGGCGC AGTTCCAGGA GACCGTCGAC GGGACGGTGA ACTACCTGGC CGGCGGTGAT
CCTCAGCAGG CGACCGGCGT GGTGGTTGCC GTGATTGCCG ACGAGTTCGA CGGGGTGATC
CAGCCCCGGA CCGCCCCGGC GGTGGACAAT CTTGATGCGT TGATCAGTTC TGAGCTGCGC
GCGACGGTCG CCGGGGGTGA GGTCGCCAGC CAGATCGGCA CCGGCCGGCT CGGCGAGGCC
CACACGAAGT ACCTCGCCTA CGGGTCGCCG GTACCGACCC AGTTCGGTCA GGTCGAGCTC
TATTATCTCG TCCCGTTGGC TCGGCAGGAC GCCACCGCCG CCGATGCCCG GGCGACCGTG
GTCGCCACCG GCCTCGCCCT GGTGGTGCTG CTGGGGCTGC TCGCCGCTCT GGTGACCCGG
CTGGTGGTGA CGCCGGTCCG GGTAGCAGCC CGAACGGCGC AACGCCTCTC CGCCGGCCTG
CTCGACCAGC GAATGGCGGT CAACGGCGAG GACGACCTGG CGCTGCTCGC GGAGTCCTTC
AACCAGATGG CGACCAACCT GCAGCGGCAG ATCCTCCGGT TGGAGGAGAT GTCTCGCCTG
CAACGCCGAT TCACCTCGGA CGTCTCGCAC GAGCTGCGGA CCCCGTTGAC CACGGTTCGG
ATGGCCGCGG ACCTGATCTT CGCTGAGCGG GATGAATTCG ACCCGGCGGT GGCTCGCAGC
GCCGAGTTGC TCCAGACGGA GCTGGACCGG TTCGAGGAAC TGCTGACCGA CCTGCTGGAA
ATCAGCCGCT TCGACGCCGG TTTCGCTGCG CTCGACGCCG AGCCCACCGA TCTGGTGCCG
GTCGTGCACC GGGTCGCGGG CCGGCTGGCC GGCCTCGCCG AGCGGGTCGG TGTCACGGTC
GAACTGGATG TGCCCGGCGG TCCGGTGATC GCCGAAGTCG ACCAGCGTCG CGTCGAGCGT
GTCCTGCGGA ATCTGGTCGG CAACGCGGTC GAGCACGGCG AGGGCAAACC GGTGCGGATC
ACCCTGGGTG TGGACGAGGC CGCGGTGGCG GTCACCGTGC GCGATCACGG TTTGGGACTC
AAGCCCGGGG AGGAGAAGCT GATTTTCAAT CGGTTCTGGC GGGCCGACCC ATCACGTGCC
CGACAGACCG GGGGGACCGG GTTGGGCCTG TCGATCAGCC TTGAGGACGC CCGGCTGCAC
CGCGGATGGC TGGAGGCGTG GGGTGCGCCG GGGCAGGGCG CCCAGTTCCG GCTCACCCTG
CCGGCGCGGG CCGGCGACCG GCTGACCGCC TCGCCACTGC GATTGGTGCC CGCCGACGCG
AGACTGCCCT GTGGTGGTCC GCGCGAGGGC GGTCTGCTCG CCATCGGTCA GGGCAGCGCG
GGCGCGTTGG CAGTTGATCC CGCGCCGGCC GCGCCTGCCG AGGCCCAGCG GTGA
 
Protein sequence
MGGWGARLVA GLHQTWRRSL QVRVVTITLL ASSLLVGGFA YLIADKITNI LLENAEADVQ 
ARLNSGAVYA AKQLGFYHKP QEAQFQETVD GTVNYLAGGD PQQATGVVVA VIADEFDGVI
QPRTAPAVDN LDALISSELR ATVAGGEVAS QIGTGRLGEA HTKYLAYGSP VPTQFGQVEL
YYLVPLARQD ATAADARATV VATGLALVVL LGLLAALVTR LVVTPVRVAA RTAQRLSAGL
LDQRMAVNGE DDLALLAESF NQMATNLQRQ ILRLEEMSRL QRRFTSDVSH ELRTPLTTVR
MAADLIFAER DEFDPAVARS AELLQTELDR FEELLTDLLE ISRFDAGFAA LDAEPTDLVP
VVHRVAGRLA GLAERVGVTV ELDVPGGPVI AEVDQRRVER VLRNLVGNAV EHGEGKPVRI
TLGVDEAAVA VTVRDHGLGL KPGEEKLIFN RFWRADPSRA RQTGGTGLGL SISLEDARLH
RGWLEAWGAP GQGAQFRLTL PARAGDRLTA SPLRLVPADA RLPCGGPREG GLLAIGQGSA
GALAVDPAPA APAEAQR