Gene Sare_4408 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_4408 
Symbol 
ID5705510 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp4981377 
End bp4984226 
Gene Length2850 bp 
Protein Length949 aa 
Translation table11 
GC content69% 
IMG OID641273827 
ProductDNA topoisomerase I 
Protein accessionYP_001539176 
Protein GI159039923 
COG category[L] Replication, recombination and repair 
COG ID[COG0550] Topoisomerase IA 
TIGRFAM ID[TIGR01051] DNA topoisomerase I, bacterial 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0610462 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCCGAGCA ATGCTGGAAC CACCCGTCTG GTCATCGTCG AGTCCCCGGC GAAGGCCAAG 
ACGATCTCGG GCTACCTCGG CCCGGGGTAC GTCGTGGAGG CCAGTTTCGG GCACGTCCGC
GACCTGCCGC GCAACGCCGC CGACGTGCCG GCGAAGTACA AGGGCGAGGC GTGGGCCCGG
CTCGGCGTCG ATGTCGACAA CGGCTTCCAC GCCCTCTACG TCGTCTCGTC GGACCGCCGG
CAGCAGATCA GCAAGTTGAC CAAGCTGGCC CGGGAGGTTG ACGAGATTCT CCTGGCCACG
GATGAGGACC GGGAGGGCGA GGCGATCGCC TGGCACCTGG TGGAGACGCT CAAGCCGAAG
GTGCCGGTCA AGCGGATGGT CTTCCACGAG ATCACCAAGC CGGCGATCCA GGCCGCGGTG
GCGAACCCGC GTGAGATCGA CCGGGACCTG GTCGACGCGC AGGAGGCCCG CCGCATCCTC
GACCGGTTGT ACGGCTACGA GGTGTCACCG GTGCTGTGGA AGAAGGTCAT GCCGAAGCTT
TCGGCGGGCC GGGTGCAGTC GGTGGCGACC CGCACCGTGG TCGAGCGGGA GCGCCAGCGG
ATGGCGTTCC GTACTGCCGA GTACTGGGAC ATCCTGGCCA CGCTGGCCGT CGAACAGGCC
GGCGAGGGCC CCCGCACCTT CAACGCCACC CTGGTGGCGC TCAACGGTGA CCGGATCGCC
GCCGGCAAGG ACTTCGAGCC GACCACCGGG CGGGTCCGTC CCGGCGCGGG TGTCGTCCAC
CTCGACGAGA GCGGGGCCCA CGGGCTGGCC GCCCGGTTGG CAGGTCGGCC ATTCACCGTC
ACCCGGGTCG AGGAGAAGCC CTACCGTCGC CGCCCCTACG CGCCGTTCAT CACCTCCACC
CTGCAACAGG AGTCGGCCCG CAAACTGCGC TTCTCCTCCC AGCAGACGAT GCGCACCGCG
CAGCGACTCT ATGAGAACGG CTACATCACC TATATGCGAA CCGACTCGGT GAACCTGTCG
GAGACCGCCA TCGCGGCGGC CCGCCGGCAG ATCGTCGAGT TGTACGGCGA GCGCAGTGTC
CCGCCGGAGC CGCGCCGCTA CACCGGCAAG GTGAAGAACG CGCAGGAGGC GCATGAGGCG
ATCCGCCCCG CGGGGGACAC CTTCCGCACC CCCGGTGAGC TGCGCAACGA ACTCTCGGTT
GAGGAGTACA AGCTCTACGA GTTGATCTGG CGGCGCACCA TCGCCTCCCA GATGACCGAC
GCGGTGGGTT TCAGTGTCTC GGTGCGTATC CGCGCCGTCA CCTCCGCCGG TGAGGAGGCC
GACTTCGGCG CCACCGGCAA GACCATCACC GACCCGGGTT TTCTCCGTGC CTATGTGGAG
TCCAGCGACG ACGAGAACGC CGAGGCGGAG GACGCCGAGC GGCGTCTGCC GACGCTGGTG
AAGGACCAGC CGCTTACCGG CGAGCAGCTG GTGGCGCAGG GTCACCACAC CCAGCCGCCC
GCCCGCTACA CCGAGGCGTC CCTGGTCAAA GCGCTGGAGG AACTGGGCAT CGGTCGCCCG
TCGACGTACG CGTCGATCAT GCAGACGATC CAGGACCGGG GGTACGTCGT CAAGCGTGGC
CAGGCGATGA TCCCGTCCTT CCTGGCGTTC GCCGTGGTCG GGCTGATGGA GCGGCACTAC
CCACGCCTGA TCGACTATGG CTTCACCGCC AGCATGGAGA ACGAGCTGGA CGAGATCGCC
GGTGGCGACC ACGCGGCGGT CGATTTCCTC ACCGCGTTCT ACTTCGGCAC CACCAACGGC
GCCGGTGACC AGGACATCGC CCGTTCCGGT GGGCTGAAGA AGCTGGTCAC CGAGAACCTC
AGCGACATCG ACGCACGGAG CGTCAACTCG ATACCCCTGT TCACCGACGA ACAGGGACGG
GTCGTCGTCG TCCGGGTGGG CCGCTACGGG CCGTACCTGC AGCGGGAACT ACCCGGCGAG
GCGGCGACGC CCGCCGACGG TGAGGAGGGC GGCGGCCAGG GCGACCGGGC ACCGATTCCA
GACGGTCTGG CACCCGACGA GCTGACCCCG GAGAAGGTGC ACGAGCTGTT CCTCGGCGGT
GGGGGCGAGC GCAAGCTCGG CGAGGACCCG GCCACCGGAG AGCCGATCCT GCTCAAGTCG
GGCCGGTTCG GCCCGTACGT GGCCAGCGGG GAGCGTAAGT CGTCACTGCT GCGCTCCCAG
TCGCCGGACG CGCTCACCCT CGAGGAGGCG TTGCGACTGC TGAGCCTGCC TCGCCTGGTC
GGTGTCGACC CGGAGGGCAA CGAGGTCTTC GCCAACAACG GCCGCTACGG ACCGTATGTC
AAGCGGGGCG AGGAGTTCCG GTCGCTGGAC TCCGAGGAGA AGATGTTCAC GGTCACGCTG
GATGAGGCGC TGGCCCTGCT GGCCGCCCCG AAGACCCGGC AGCGCCGGGC GCCCGCGCCT
CCACTGCGGG AACTGGGCGC CGACCCGCTG ACCGAGAAAC CGCTGGTCAT CAAGGATGGG
CGGTTCGGGC CGTACGTGAC CGACGGTGAG ACCAACGCGT CATTGCGGCG CGGGCAGACG
CCGGAGGCGT TGAGCCTGGA GCAGGCGTCG GAGATGCTCG CCGAGAAGCG GGCGAAGGGT
CCGGCGCCGC GGAAACGGGC GGCCAAGAAG GCCACGCCGG CCAAGAGGTC CACGCCGGCG
AAGAAGGCGG CGGCGACGAA GACCACCGCG ACGGGCAAAG CCACGAAGAA GACCGCCGCG
GCGACCAAGG TGGCGAAGAA GACGGCGACG GCGAAGACCA CAACGGCCAA GCAGGCGAAA
CCGAAGCGGG CGAGCAGCAC GACCGAGTGA
 
Protein sequence
MPSNAGTTRL VIVESPAKAK TISGYLGPGY VVEASFGHVR DLPRNAADVP AKYKGEAWAR 
LGVDVDNGFH ALYVVSSDRR QQISKLTKLA REVDEILLAT DEDREGEAIA WHLVETLKPK
VPVKRMVFHE ITKPAIQAAV ANPREIDRDL VDAQEARRIL DRLYGYEVSP VLWKKVMPKL
SAGRVQSVAT RTVVERERQR MAFRTAEYWD ILATLAVEQA GEGPRTFNAT LVALNGDRIA
AGKDFEPTTG RVRPGAGVVH LDESGAHGLA ARLAGRPFTV TRVEEKPYRR RPYAPFITST
LQQESARKLR FSSQQTMRTA QRLYENGYIT YMRTDSVNLS ETAIAAARRQ IVELYGERSV
PPEPRRYTGK VKNAQEAHEA IRPAGDTFRT PGELRNELSV EEYKLYELIW RRTIASQMTD
AVGFSVSVRI RAVTSAGEEA DFGATGKTIT DPGFLRAYVE SSDDENAEAE DAERRLPTLV
KDQPLTGEQL VAQGHHTQPP ARYTEASLVK ALEELGIGRP STYASIMQTI QDRGYVVKRG
QAMIPSFLAF AVVGLMERHY PRLIDYGFTA SMENELDEIA GGDHAAVDFL TAFYFGTTNG
AGDQDIARSG GLKKLVTENL SDIDARSVNS IPLFTDEQGR VVVVRVGRYG PYLQRELPGE
AATPADGEEG GGQGDRAPIP DGLAPDELTP EKVHELFLGG GGERKLGEDP ATGEPILLKS
GRFGPYVASG ERKSSLLRSQ SPDALTLEEA LRLLSLPRLV GVDPEGNEVF ANNGRYGPYV
KRGEEFRSLD SEEKMFTVTL DEALALLAAP KTRQRRAPAP PLRELGADPL TEKPLVIKDG
RFGPYVTDGE TNASLRRGQT PEALSLEQAS EMLAEKRAKG PAPRKRAAKK ATPAKRSTPA
KKAAATKTTA TGKATKKTAA ATKVAKKTAT AKTTTAKQAK PKRASSTTE