Gene Sare_0001 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_0001 
Symbol 
ID5707373 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp148 
End bp1923 
Gene Length1776 bp 
Protein Length591 aa 
Translation table11 
GC content69% 
IMG OID641269524 
Productchromosomal replication initiator protein DnaA 
Protein accessionYP_001534928 
Protein GI159035675 
COG category[L] Replication, recombination and repair 
COG ID[COG0593] ATPase involved in DNA replication initiation 
TIGRFAM ID[TIGR00362] chromosomal replication initiator protein DnaA 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000633656 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0000000292755 
Fosmid HitchhikerNo 
Fosmid clonabilityunclonable 
 

Sequence

Gene sequence
GTGGCCGGTA CGACCGACCT TGCCGCAGTG TGGACGGCGA CGACTGACGA ACTCGCCGAC 
GAGATCATAT CTGCGCAGCA ACGGGCGTAC CTGCGACTGA CCCGGCTCCG GGCCATCGTC
GAGGACACCG CGCTCCTGTC CGTCCCGGAC GCGTTCACCC GGGACGTCAT CGAGTCCCGG
CTACGCCCAG CGATCACCGA GGCGCTCACC CGCCGACTGG GCCGGCCCAT CCAGGTGGCC
GTTACGGTGC GGGTGACGGA GGACGGCGCC AGTCGCCCGA CAGGCACGCT TTACCACAGC
GCCCCGAACC CGGAGACCGG CGCGTTACCC CTCGACACCG TCGACGGCCT CGCGGCGACA
GGTCCGGACG ACGGCGCGGA CCACCACTTT CCGACGATCC CCGGGCAGAC CCAGCCGGAG
CACCGGACAG CGACGCCCGT CGGACCGTAC GGCGCCGAGC AGACCACGCA GGTGCCCCGG
GACGGGCAGG AACCGCTGTT CGGTAACGCC TTCGCCGGGC CGCCCCACCC CGAGCGACGC
GATGGTGGCG AGCAGGCCCT GCTGGTCCCG CCGGCGACCG ACCTTCCGTT CGACGGCCGT
TACCGGACCG ACGGGGTCGC ACCGCGCGAC CAGCACGGGA TCCGGGCACT GCCCCGCGAC
CACGGCACCG ACAGCGGGCC GGGGCGGGTG GACCACCGGC CCGGCGGCCG GGAGGATCGG
CGCTTGTCCG GGCCCGCCGA CGGTGGCGGC AACCGGCTCA ACCCCAAGTA CATGTTCGAG
ACGTTCGTCA TCGGCTCGTC GAACCGGTTC GCCCACGCGG CTTCGGTGGC GGTAGCCGAG
TCGCCGGCGA AGGCGTACAA CCCGCTGTTC ATCTACGGCA GCTCGGGACT GGGTAAGACG
CACCTGCTGC ACGCGATCGG CCATTACGCC ACGACGCTGG GCAACGCCCA CTCGGTCCGG
TACGTCTCGA CCGAGGAGTT CACCAACGAC TTCATCAACT CGCTACGCGA CGACAAGACG
AGCGCCTTCC AGCGCCGGTA CCGGGACGTG GACATCCTCC TGATCGACGA CATCCAGTTC
CTGGAGAACC GGGAGCGAAC GCAGGAGGAG TTCTTCCACA CGTTCAACAC GCTGCACAAC
GCCAACAAGC AGATCGTGAT CACCTCCGAC CGGTCGCCCA AGCAGCTGGC GACCCTGGAG
GACCGGTTGC GGACCCGGTT CGAATGGGGC CTCCTCGCCG ACATCCAGCC GCCAGACCTG
GAGACCCGGA TCGCGATCCT GCAGAAGAAG GCCGCCCAGG AGCGGCTGTT CGCCCCGCCG
GACGTACTCG AGTTCATCGC CTCCCGGGTG TCGAACTCGA TCCGGGAACT CGAGGGCGCG
TTGATCCGGG TCACCGCGTT CGCCAGCCTC ACCCGGTCGT CGGTGGAGTT GTCACTGGCC
GAGGAGGTGC TGCGGGACTT CATTCCGGAC GGCGCCGGGC CAGAGATCAC CGCCGACCAG
ATCATGGTGG CCACCGCGGA CTACTTCGGG GTGAGCCTGG AAGACCTGCG CGGGCACTCG
CGGTCACGGG TTCTCGTCAA CGCCCGCCAA GTCGCCATGT ACCTGTGCCG GGAGCTGACC
GACCTGTCGC TGCCCCGAAT CGGGCAGGCG TTCGGGGGCC GCGACCACAC CACGGTCATG
CACGCCGACC GCAAGATCCG TCAGCAGATG GCAGAACGCC GGTCGCTCTA CAACCAGATC
GCCGAGCTGA CCAACCGGAT CAAGCAGAAC ACCTGA
 
Protein sequence
MAGTTDLAAV WTATTDELAD EIISAQQRAY LRLTRLRAIV EDTALLSVPD AFTRDVIESR 
LRPAITEALT RRLGRPIQVA VTVRVTEDGA SRPTGTLYHS APNPETGALP LDTVDGLAAT
GPDDGADHHF PTIPGQTQPE HRTATPVGPY GAEQTTQVPR DGQEPLFGNA FAGPPHPERR
DGGEQALLVP PATDLPFDGR YRTDGVAPRD QHGIRALPRD HGTDSGPGRV DHRPGGREDR
RLSGPADGGG NRLNPKYMFE TFVIGSSNRF AHAASVAVAE SPAKAYNPLF IYGSSGLGKT
HLLHAIGHYA TTLGNAHSVR YVSTEEFTND FINSLRDDKT SAFQRRYRDV DILLIDDIQF
LENRERTQEE FFHTFNTLHN ANKQIVITSD RSPKQLATLE DRLRTRFEWG LLADIQPPDL
ETRIAILQKK AAQERLFAPP DVLEFIASRV SNSIRELEGA LIRVTAFASL TRSSVELSLA
EEVLRDFIPD GAGPEITADQ IMVATADYFG VSLEDLRGHS RSRVLVNARQ VAMYLCRELT
DLSLPRIGQA FGGRDHTTVM HADRKIRQQM AERRSLYNQI AELTNRIKQN T