Gene Sare_3937 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_3937 
Symbol 
ID5703674 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp4480149 
End bp4481339 
Gene Length1191 bp 
Protein Length396 aa 
Translation table11 
GC content73% 
IMG OID641273362 
Productputative signal transduction histidine kinase 
Protein accessionYP_001538718 
Protein GI159039465 
COG category[T] Signal transduction mechanisms 
COG ID[COG4585] Signal transduction histidine kinase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.24552 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0307354 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCGGCCG TTGCCCCCAC GACCGCACTA CCCGGCCTAC CCGAGCCCAA GGGGCCGCCG 
GAGGCGGCCA CTCCTGCGGG CGACGCCTTC ACCCTCATCT TCACCACCAC TCCGGCCCTG
CTCCGACTGA CCGTCGGGCT GGTCGGCGCC GTGGTGGCGG TGTCGGTCCG GACGCCGCCC
GTGGTGCCCC CGCTACTGTT CCCCGCCACC GTGATACTGG CCTCGTGGTC GGTCTGGTAC
GCACGTCGGG CGCTCCGACG TGGCTTCACC ACGCCGCTGG TATCCGGTGA CGTCGCTCTG
ACCTCCGCAG CCTGTCTGGC CACTCCCGTG CTGGTCGCCC CGGAGGTGTT GCCCGGTGAG
GTCAGCTGGA TCGCGGTGTT GGCCAGCACC ACCGTGATCA ACGCCCAGGC GACCGCGCCG
GCCCGCTGGT CGATCCCGGC CGGCGTGCTG GTCACCGCCG CCTACGCCGT CGGCTCGCAC
ACCGCCGGCA ACCCACGAGA GGCCGTCGCG CATACGGCCA CCCTGCTCGT CCAGACCGGC
ATCGCTGCGG CGATAACCGC GGTGATGCGT CGTCGGATCA CGCGCGCCGA CCACGCCTTC
GCCAAGGACC AGCGGCTGGC CCGCCAGCAC CTGATCGCCC GCACCGCGCG GGATGCCGAA
CGCCGGCAGA ACCGGAACCT GCACGACACC GTGCTGGCGA CACTGACCGT GGTCGGGCTG
GGGGCGGGAG CCGGTCCAGC GCTGCGGGAG AGGTGCTCCG CCGACCTGTC CACCCTCTCC
GCGCTGGTGG ACCGCCCCCC GGCGAACGGC CCGGTCGCCT TGGACACACG GCTACGGACG
GTACTTTCCC GACTGCCGGG CCTGGCGGTC ACCGCGGACC TGGCACCCTG CACCGTGCCC
GTGGCAGTGG CCGAGGCGGT AGCGGAGAGC GTCGCTGCCG CGCTGTCCAA CGTGGCCCGG
CATGCCCCGA CCGCGGCGAC CGTGCTGCGT CTCACCCGGG CCGGCGGCGC CGTCGTGGTG
GAGGTCGTCG ACGACGGTCC CGGTTTCGAG CCGGCCACGG TACCGACCCA TCGGTACGGG
ATTCGCGAGT CGATCTGCGG ACGGATGGTC AGCGTGGGCG GGCGGGCCCA GGTCCACTCC
CGGCCCGGCG CTGGCACTCG GATCCGGCTG GAGTGGTCGG ATGTCTGCTG A
 
Protein sequence
MPAVAPTTAL PGLPEPKGPP EAATPAGDAF TLIFTTTPAL LRLTVGLVGA VVAVSVRTPP 
VVPPLLFPAT VILASWSVWY ARRALRRGFT TPLVSGDVAL TSAACLATPV LVAPEVLPGE
VSWIAVLAST TVINAQATAP ARWSIPAGVL VTAAYAVGSH TAGNPREAVA HTATLLVQTG
IAAAITAVMR RRITRADHAF AKDQRLARQH LIARTARDAE RRQNRNLHDT VLATLTVVGL
GAGAGPALRE RCSADLSTLS ALVDRPPANG PVALDTRLRT VLSRLPGLAV TADLAPCTVP
VAVAEAVAES VAAALSNVAR HAPTAATVLR LTRAGGAVVV EVVDDGPGFE PATVPTHRYG
IRESICGRMV SVGGRAQVHS RPGAGTRIRL EWSDVC