Gene Sare_0052 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_0052 
Symbol 
ID5707253 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp61221 
End bp62648 
Gene Length1428 bp 
Protein Length475 aa 
Translation table11 
GC content72% 
IMG OID641269578 
Productserine/threonine protein kinase 
Protein accessionYP_001534979 
Protein GI159035726 
COG category[K] Transcription
[L] Replication, recombination and repair
[R] General function prediction only
[T] Signal transduction mechanisms 
COG ID[COG0515] Serine/threonine protein kinase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.302171 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.296682 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTCAGCC CCGGGGTGAA GCTCGGTAAC CGCTATCGTC TCGACGAGCG GATCGCCAGT 
GGCGGCATGG GCGACGTCTG GCGCGGCACC GATCAGGTGC TCGGTCGAAC CGTGGCCGTG
AAGAGCCTGC TCCCGGCCCT GCTCGACGAC CCCGACTTCG CCGAACGGTT CCGGGGCGAG
GCACGCACGA TGGCCACCAT CAACCATCCC GGCGTGGTGG ACATCTACGA CTTCGGCAGC
GACCAGCAGA TCGCCTTCCT GGTGATGGAG TACGTGGAGG GCGACGCGCT CTCGTCCACC
CTGAAACGGG TCGGCCGGCT CACCCCCGCC CGCACGATGG CTCTCGTCGC CCAGGCCGCT
GACGCGTTGC ACGCCGCCCA CCTGGAGGGC ATCGTGCACC GGGACGTGAA GCCCGGGAAC
CTGCTCGTCC GGCCGAACGG CACCCTGGTG CTCACCGACT TCGGTATCGC CCGATCCGAC
CTGGTGGCCC AGCTCACCGC CGCCGGCTCG GTGCTCGGCA CCGCCTCGTA CATCTCACCG
GAACAGGCCA CCGGTGCGGT GGCCACCCCC GTCTCGGACG TCTACGCCCT CGGTGTGGTC
GCGTACCAGT GCCTTTCCGG GAGGCGGCCC TTCGAGGGCG ACAATCCGCT CGAGATCGCG
ATGCAGCACG TCCGGGAGAC TCCGCGGTCG TTGCCCGCTG ACATTCCACC GCAGGTGCAG
GTGGTGGTCG AGCGGGCCAT GGCGAAGGAC CCGGCGGACC GCTGGCAGAG CGCGGCGGCG
TTGGCCGGGG TGACCCGACA GCTGAAGGTC CAGCTCTCCC AGATCGCCCG GGAGACCGGC
CGGGCCCGCC CGGTTTCCGC GGCGCCGGCC TCACCGGCGC CGGGCCGGGC CAAGGTGCCG
CCACCGGCCG TTGCCCGCCC ACCGCGAGCA TCCCGCCCGC CGATCGTGGC GCCGCAACCG
TCCGGCCCGC CGATGGTGGC GCCGCAGCCC TCCCGGCCGT CGGTTGCGGT ACCGCAGCCG
TCCCGGCCGC CCGTGGTGGC GCCGCAGGCA TCCCGACCGC CAGCCCCGGC GCTGGCTCCG
CGTCCCCAGC CAGGGTGGCC ACGGGTCGCA GCCGCCGCGC CGGCATCGGC CCGACCCGGA
TACGCCCCGG TTCCTGCTCC GCCACCACGC CGATCCCGCC TCGGCACGGT GTTCCTCGCC
ATCCTCTTGG CCGTGCTGGT CCTGGCCTGC TTCGGCATGG TTTCCTTCAT CCTGCGACAG
TCGAACCAGG CCGGGCCCTC CGGCGTGCCG GCGCGGCGGA CGGGGACGTC CAGCGAGCTC
CGGCTGGACG GCCGTGACGA TCCAGTTGGA ACGTCGTACC GTCGACAGCA ACTGCCCCGG
ACGGGCGGCG ATGAGACGAC GACGAGCGAA GGACGACAGA CGCGATGA
 
Protein sequence
MLSPGVKLGN RYRLDERIAS GGMGDVWRGT DQVLGRTVAV KSLLPALLDD PDFAERFRGE 
ARTMATINHP GVVDIYDFGS DQQIAFLVME YVEGDALSST LKRVGRLTPA RTMALVAQAA
DALHAAHLEG IVHRDVKPGN LLVRPNGTLV LTDFGIARSD LVAQLTAAGS VLGTASYISP
EQATGAVATP VSDVYALGVV AYQCLSGRRP FEGDNPLEIA MQHVRETPRS LPADIPPQVQ
VVVERAMAKD PADRWQSAAA LAGVTRQLKV QLSQIARETG RARPVSAAPA SPAPGRAKVP
PPAVARPPRA SRPPIVAPQP SGPPMVAPQP SRPSVAVPQP SRPPVVAPQA SRPPAPALAP
RPQPGWPRVA AAAPASARPG YAPVPAPPPR RSRLGTVFLA ILLAVLVLAC FGMVSFILRQ
SNQAGPSGVP ARRTGTSSEL RLDGRDDPVG TSYRRQQLPR TGGDETTTSE GRQTR