Gene Sare_3802 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_3802 
Symbol 
ID5704553 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp4330495 
End bp4332378 
Gene Length1884 bp 
Protein Length627 aa 
Translation table11 
GC content70% 
IMG OID641273224 
ProductDNA primase 
Protein accessionYP_001538586 
Protein GI159039333 
COG category[L] Replication, recombination and repair 
COG ID[COG0358] DNA primase (bacterial type) 
TIGRFAM ID[TIGR01391] DNA primase, catalytic core 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0345876 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGCGGAGA TGACGGGGCG GATCCGGGAC GAGGACATCG CGCTGGTCCG CGAGCGCACC 
TCGATCGCGG AGGTCATCTC CGAGACGGTC ACTCTGCGGT CTGCCGGTGG TGGCAACCTG
AAGGGGCTCT GCCCGTTCCA TGACGAGAAG AGCCCGTCGT TCAACGTGTC ACCGGCCCGT
AACGTCTGGT ACTGCTTCGG ATGTGGTGCC GGTGGCGACG CCATCAAGTT CCTGATGGAC
GCCGAGCATC TCAGCTTCGT CGAGTCCGTC GAGCGGCTGG CCGCCCGCGC CGGCCTGCAG
CTTCGCTACG TGGCGAACGA CCACACCGCT CCCCGCTCCC GGCCGCAGCA GGGGCAGAGG
CAGCGGCTGG TCGCCGCGCA CGCCGCTGCC GTCGAGTTCT ACCGGGCCCA GCTCACCACC
CCTGGTGCCC GCCCGGCCCG CGAGTTCCTC GCCCACCGTG GCTTCGACCG GGCCGCCGCC
GAGCGGTATG CCTGTGGCTT CGCGCCCGAC GGGTGGGATC TGTTGACCCG TCACCTGCGC
CAGCAGGGCT TCAGCCACGA CGAGTTGGTC ACGGCCGGGC TGTCTCGACC AGCCCGGTCG
GGCAGCCTCA TCGACCGGTT CCGGCGCCGG CTGCTCTGGC CCATCCGGGA CCTGACCGGC
GACGTCGTCG GCTTCGGCGC GCGCAAGCTG TTCGACGACG ACGACAGCCC GAAATACCTC
AACACCCCCG AGACACCGAT CTACAAGAAG TCCCACGTCC TCTATGGCAT CGACCAGGCC
AAGCGGGAGA TCGCCAAGCA GGGCAAGGTG GTCGTGGTCG AGGGCTACAC CGACGTGATG
GCCTGCCACC TGGCCGGGGT ACCGACCGCC GTGGCGACTT GTGGCACCGC CTTCGGTGCC
GATCACATCG GGGTGCTGCG CCGGCTGCTG CTGGACACCG ACGCCGTCGC GGGGGAGATC
ATTTTCACCT TCGACGGGGA CGCTGCCGGG CAGAAGGCGG CGTTGCGCGC GTTCGACGAC
GATCAGCGCT TCGTCGGGCG TACCTTCATC GCGGTCAGCC CGGACGGCAT GGATCCCTGC
GAGCTGCGCC TGGCCAAGGG TGAGCTGGCG GTCCGCGACC TGGTCGCGCG CCGCGAACCG
CTGGTCGACT TCGCGTTGCG ACACGTGATC AACCGGCACG ACCTCGACAC CGTCGACGGC
CGGGTGGAGG CGATGCGCCG GGCGGCCCCG TTGGTCGCCA AGCTCAAGGA CCGGGAAAAG
CGCCCGGAGT ACGTCCGCAA GCTCGCCGGG GACCTCGGCA TGGAGATCGA GCCGGTGCAG
CGGGCCGTGC TGGCCGCCGC GCACGCCGCA CCGTCCGGCG GGGCGCTGGG CAACCCCGCA
CCACGCGCAG CCGCGGCGGA GCCACAGGCA GACAGCCCGC AGTTGGCGGT CGAGCGGGAG
GCGCTGAAAC TGGCTCTCCA GGCGCCGGTG CTCGCCGGGC CGATGTTCGA CGCCGTGGAG
GCCGCCGAAT ACCGCCATCA GGTGCACGTC GCGGTCCGGG CAGCGGTGGC GGCGGCCGGC
GGAGCGGCCA CGGCCACGGG AGGCGCGGTG TGGATCGAGT CGGTCCGCAA CGCGTGCGAG
GACCTCACCG CCCAGGCGCT GGTCGGTGAA TTGGCCGTGG AACCGCTGCG CATCGACGGG
GAGCTCGATC CGCGCTACGT GTCGGTGACG ATGGCTCGTC TCCAGTGGGG GGCGGTGACC
GGACGCATCC GGGAGCTCAA GTCCAGAATC CAGCGGATCA ACCCGGTCAG CAACAAGGAC
GATTACTTCG CGGCCTTCGG TGAACTGCTG TCGCTCGAGC AACACGCCCG GGCGCTGCGC
GAGCAGGCCG CGGGCGGGCT GTGA
 
Protein sequence
MAEMTGRIRD EDIALVRERT SIAEVISETV TLRSAGGGNL KGLCPFHDEK SPSFNVSPAR 
NVWYCFGCGA GGDAIKFLMD AEHLSFVESV ERLAARAGLQ LRYVANDHTA PRSRPQQGQR
QRLVAAHAAA VEFYRAQLTT PGARPAREFL AHRGFDRAAA ERYACGFAPD GWDLLTRHLR
QQGFSHDELV TAGLSRPARS GSLIDRFRRR LLWPIRDLTG DVVGFGARKL FDDDDSPKYL
NTPETPIYKK SHVLYGIDQA KREIAKQGKV VVVEGYTDVM ACHLAGVPTA VATCGTAFGA
DHIGVLRRLL LDTDAVAGEI IFTFDGDAAG QKAALRAFDD DQRFVGRTFI AVSPDGMDPC
ELRLAKGELA VRDLVARREP LVDFALRHVI NRHDLDTVDG RVEAMRRAAP LVAKLKDREK
RPEYVRKLAG DLGMEIEPVQ RAVLAAAHAA PSGGALGNPA PRAAAAEPQA DSPQLAVERE
ALKLALQAPV LAGPMFDAVE AAEYRHQVHV AVRAAVAAAG GAATATGGAV WIESVRNACE
DLTAQALVGE LAVEPLRIDG ELDPRYVSVT MARLQWGAVT GRIRELKSRI QRINPVSNKD
DYFAAFGELL SLEQHARALR EQAAGGL