Gene Sare_1767 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_1767 
Symbol 
ID5705094 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp2037066 
End bp2038475 
Gene Length1410 bp 
Protein Length469 aa 
Translation table11 
GC content73% 
IMG OID641271270 
Producthistidine kinase 
Protein accessionYP_001536645 
Protein GI159037392 
COG category[T] Signal transduction mechanisms 
COG ID[COG0642] Signal transduction histidine kinase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.422503 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00341518 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
GTGGCCGGTG TGTGCACGCT GACGGCGCTC ACGTTCCTGG TACCGCTCTC CGCCAGTCTC 
GGCGACGGCG TCCGCGAGGA GGCGATCGCC GATGCGGCCC GCCGCGGATC CTCGGTCGCC
GGCGCACTCG CCGTCAGCAC CAGCCCGGAG GTGGTCGAAC GGGCGATCGC GGGCAGCGGC
GACGGTCCGG CCGATCCACC GATCGTGCAC GGGCTGGCCG ACGTTCCGTC CGGTGGTGGC
CGAGTCTCGG CAGCCGACCT GGAACAGGCT CGGGCGACGG GAAAGTCGAT GCTTGTCGAC
GTCGACGGCG GCGTGGCCTG GCTCGACCCC GTGGTGCTGG CCGACCGGAC GGCTGTGGTC
GAGGTCTTCG TGCCGGACTC GGCACTCGCC ACGGGCACCG GTAAGCGGTG GCTGCTCCTG
GCTGGTCTGG CGACAGGGTT GGTCGCCGCG GCGGTCCTGC TGGTCGACCG GGTGGCCGTG
CGCACGGTTA ACGCGGCCCG CGGCCTGGTA CGCGGGGCGC TCGCCCTCGG CGCTGGCGAC
CTGAACGTCC GGGTGGAGCC GAGCGGTTCA CGGGAGTTGG TCGAGGCCGG GTACGCGTTC
AACCGGATGG CCGAGCATCT CGTGGCGACC CGCGCCGACG AGCGGGAACT GGTGGCCGAC
CTGTCGCACC GGCTGCGAAC GCCGCTGACC GCGCTCCGGC TGGACGCCGA GGCGCTGGAC
CCGGACGACA CCAGCATCGG AACCTTCACC GAGGCGGAGC TGGATCGCCG GCGTGGGATC
CGGCGGATCC GACAGGCGAT CGCCACCCTG GAGGGCGAGG TCGATCAGTT GATCAAGACC
ACCCGGAAGG CGGCCACCCA GGAGATCGCG CCGGGCAGCT GCGATGTGAG TGAGGTGGTC
CGGGAGCGGA TGGTGTTCTG GTCCGCCCTG GCTGGCGATC AGAACCGGCC GCACCAGGTC
ATCGGGGCAC AGCTACGCAT CCCGGCGCCG GTGCCGCGGG CGGAACTGGC CGCCGCACTG
GACGCGGTGA TCGGCAACGT GTTCCGGTAC ACGTCCCAGG GAACGGGGTT CGAGGTGGCG
GTCTCCCGGC GGGACGGGTA TGTGGCCATC CGGATCGACG ACGCCGGACC GGGCGTCGCG
GACCCGGACC GGGCGCTGCG CCGGGGCACC AGCGACCGCG GCTCGACCGG TCTCGGCCTG
GACATCGCGA AGCGGGTCGC GTTACAGGCC AACGGTTCGG TCAGCATCGA CCGCGCCCGG
TTGGGGGGAG CGAGTGTGGT GATGCTGCTG GCCGACCCGG AGGCGGCACC GCGGCAGGTC
AGCCGGTTCG GCCTGGTGGG CCGGAGGGCC CGGGACGCCC GCGACCAGAA GGGCAGCGTC
CGACGCTGGT ACCGCCGGGC TGACCGCTGA
 
Protein sequence
MAGVCTLTAL TFLVPLSASL GDGVREEAIA DAARRGSSVA GALAVSTSPE VVERAIAGSG 
DGPADPPIVH GLADVPSGGG RVSAADLEQA RATGKSMLVD VDGGVAWLDP VVLADRTAVV
EVFVPDSALA TGTGKRWLLL AGLATGLVAA AVLLVDRVAV RTVNAARGLV RGALALGAGD
LNVRVEPSGS RELVEAGYAF NRMAEHLVAT RADERELVAD LSHRLRTPLT ALRLDAEALD
PDDTSIGTFT EAELDRRRGI RRIRQAIATL EGEVDQLIKT TRKAATQEIA PGSCDVSEVV
RERMVFWSAL AGDQNRPHQV IGAQLRIPAP VPRAELAAAL DAVIGNVFRY TSQGTGFEVA
VSRRDGYVAI RIDDAGPGVA DPDRALRRGT SDRGSTGLGL DIAKRVALQA NGSVSIDRAR
LGGASVVMLL ADPEAAPRQV SRFGLVGRRA RDARDQKGSV RRWYRRADR