Gene Sare_2236 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_2236 
Symbol 
ID5704299 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp2572489 
End bp2573460 
Gene Length972 bp 
Protein Length323 aa 
Translation table11 
GC content71% 
IMG OID641271716 
ProductDNA primase small subunit 
Protein accessionYP_001537087 
Protein GI159037834 
COG category[L] Replication, recombination and repair 
COG ID[COG3285] Predicted eukaryotic-type DNA primase 
TIGRFAM ID[TIGR02778] DNA polymerase LigD, polymerase domain 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value0.848931 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.182306 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGCGACGC CGGTCGAGGA GATCCGGGTG GGGCGGCGGC TGGTCCGCGT CTCCAGCCCA 
GACAAACCGT ACTTTCCGGA GCGTGGGCTG ACCAAACTCG ACGTGGTGCG CTACTTCCTT
GCCGTCGGCG ATGGCATCCT GCGCGCCCTG CGGGACCGGC CGACGATGCT CGAACGGTGG
CCGCGTGGGG TCTTCGCGGG TGCCAAGATC GCGACTCGGG CGGACAACCG GGGCGACGCC
TTCTATCAGA AGCGGCTTCC GGCGGGAGCC CCCAGCTGGG TCCGTACCGC ACACATCACG
TTCCCCAGTG GCCGCAGTGC GGACGAGGTC GCACCGAGCG AACTCGCCGT GGTGGCCTGG
GCGGTCAACC TCGGCACGCT CCGCTTCCAT CCGTGGCCGG TGTCCCGGCG GGACGTCGAG
CGACCGGACC AACTGCGCGT CGACCTGGAT CCGCTGCCCG GAGTCGGGTT CGACCAGGTG
GTTTCGGTGG CACACGAGGT CCGCGCGTTC CTCGACGAGC TCGGGCTGGT GGGCTACCCG
AAGACCACCG GGGGTCGGGG GCTGCACGTC TACCTCACCA TCGAGCCGCG GTGGAGCTTC
GGTGACTGCC GCCGGGCGGT GCTGGCGCTG GGCCGGGAGA TGCAGCGTCG CCGGCCCGAT
CTGGTCACCA CCACCTGGTG GCGGGACCAG CGGGACCGAC CGGTCTTCGT CGACTACAAC
CAAATGGCCC GCGACCACAC GATGTCCTCG GCGTACTCGA TCCGGCCCAC CCCGGCGGCG
CTGGTCTCCG CGCCGGTGGG CTGGGGCGAG CTGGACGATG CCCAGCCGGA GGACTTCGAC
GTCACCACGA TGCCGACCCG CTTCGCCGAG CGCGGCGACC CGCACGCGGG CCTGGACGAC
CGGGCGTACT CGCTGGAGCC CCTGCTGGAG CTGGCCGACC GGGAGGACCT GACGGTCCCG
CCGGAGCGTT GA
 
Protein sequence
MATPVEEIRV GRRLVRVSSP DKPYFPERGL TKLDVVRYFL AVGDGILRAL RDRPTMLERW 
PRGVFAGAKI ATRADNRGDA FYQKRLPAGA PSWVRTAHIT FPSGRSADEV APSELAVVAW
AVNLGTLRFH PWPVSRRDVE RPDQLRVDLD PLPGVGFDQV VSVAHEVRAF LDELGLVGYP
KTTGGRGLHV YLTIEPRWSF GDCRRAVLAL GREMQRRRPD LVTTTWWRDQ RDRPVFVDYN
QMARDHTMSS AYSIRPTPAA LVSAPVGWGE LDDAQPEDFD VTTMPTRFAE RGDPHAGLDD
RAYSLEPLLE LADREDLTVP PER