Gene Strop_4027 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagStrop_4027 
Symbol 
ID5060509 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora tropica CNB-440 
KingdomBacteria 
Replicon accessionNC_009380 
Strand
Start bp4576864 
End bp4579695 
Gene Length2832 bp 
Protein Length943 aa 
Translation table11 
GC content69% 
IMG OID640476288 
ProductDNA topoisomerase I 
Protein accessionYP_001160835 
Protein GI145596538 
COG category[L] Replication, recombination and repair 
COG ID[COG0550] Topoisomerase IA 
TIGRFAM ID[TIGR01051] DNA topoisomerase I, bacterial 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.563129 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCCGAGCA ATACTGGAAC CACCCGTCTG GTCATCGTCG AGTCCCCGGC GAAGGCCAAG 
ACGATCTCGG GCTACCTCGG CCCGGGGTAC GTCGTGGAGG CCAGTTTCGG GCACGTCCGC
GACCTGCCGC GCAATGCCGC CGATGTGCCG GCGAAGTACA AGGGCGAGGC GTGGGCCCGG
CTCGGCGTCG ATGTCGACAA CGGCTTCCAC GCCCTCTACG TCGTCTCGTC GGACCGCCGG
CAGCAGATCA GCAAGTTGAC CAAGCTGGCC CGGGAGGTTG ACGAGATCCT CCTGGCCACG
GATGAGGACC GGGAGGGCGA GGCGATCGCC TGGCACCTGG TGGAGACGCT CAAGCCGAAG
GTTCCGGTCA AGCGGATGGT CTTCCACGAG ATCACCAAGC CGGCGATCCA GGCCGCGGTG
GCGAACCCGC GCGAGATCGA CCGCGATCTG GTCGATGCCC AGGAGGCCCG CCGCATCCTC
GACCGGTTGT ACGGCTACGA GGTGTCCCCG GTGCTGTGGA AGAAGGTCAT GCCGAAGCTC
TCGGCGGGCC GGGTGCAGTC GGTGGCGACC CGCATCGTGG TCGAGCGGGA GCGGCAGCGG
ATGGCGTTCC GCACCGCCGA GTACTGGGAC ATCCTCGCCA CGCTCGCCGG CGAACAGCCC
GGCGAGGGCC CCCGTACCTT CAACGCCACC CTGGTCGCGC TCAACGGTGA CCGGATCGCC
ACCGGCAAGG ACTTCGAGCC GACCACCGGG CGGGTCCGTC CCGGTGCCGG CGTGGTGCAC
CTCGACGAGG GCGGGGCCCG GGGGCTGGCC GCCCGGCTGG CGGATCGGCC GTTCACCGTC
ACCCGGGTCG AGGAGAAGCC GTACCGTCGC CGTCCGTACG CGCCGTTCAT CACCTCGACG
CTGCAACAGG AGTCGGCCCG CAAGCTGCGC TTCTCCTCCC AGCAGACGAT GCGGACCGCG
CAGCGGCTCT ACGAGAACGG CTACATCACC TATATGCGTA CCGACTCGGT GAACCTGTCG
GAGACCGCCA TCGCGGCGGC TCGCCGGCAG ATCGTCGAGC TGTACGGGGA ACGTAGCGTT
CCACCGGAGC CGCGTCGCTA CACCGGCAAG GTGAAGAACG CGCAGGAGGC GCATGAGGCG
ATCCGCCCCG CGGGCGACAC CTTCCGTACC CCTGGTGAGC TGCGCAACGA GCTGTCGGTT
GAGGAGTACA AGCTTTACGA ACTGATCTGG CGGCGCACCA TCGCCTCCCA GATGACCGAT
GCTGTGGGCT CCAGCGTTTC GGTGCGGATT CGTGCCGTCA CGGCCGCCGG TGAGGAGGCG
GACTTCGGCG CCACCGGTAA GACCATCACC GACCCGGGTT TCCTCCGTGC CTACGTGGAG
TCCAGCGACG ACGAGAACGC CGAGGCGGAG GATGCCGAGC GGCGTCTGCC ACATCTGGTG
AAGGACCAGC CGCTGACCGG CGAGCAGCTC GCCGCGCAGG GGCACCACAC CCAGCCGCCC
GCCCGTTACA CCGAGGCGTC CCTGGTCAAG GCGCTGGAGG AGCTGGGTAT CGGCCGCCCC
TCCACCTATG CGTCGATTAT GCAGACCATC CAGGACCGGG GGTATGTCGC CAAGCGTGGC
CAGGCGATGA TCCCGTCGTT CCTGGCCTTC GCCGTGGTCG GGCTGATGGA GCGGCACTAC
CCGCGCCTGA TCGACTACGG CTTCACCGCC AGCATGGAGA ACGAGCTGGA CGAGATCGCC
GGTGGCGACC ACGCGGCGGT GGACTTCCTG ACCGCGTTCT ACTTCGGCAT CACCAACGGC
GCCGGTGACC AGGACATTGC CCGCTCCGGT GGGCTGAAGA AGCTGGTCAC GGAGAATCTG
AGTGAGATCG ACGCGCGGAG CGTCAACTCG ATTCCGCTCT TCAGCGACGA CGAGGGGCGG
CCAGTCGTCG TCCGGGTGGG CCGCTACGGG CCGTACCTAC AGCGGGAGTT GCCCGGCGAG
GCGGCCGCCG CCGACGGTGA GGAGGGCGGC GGCCAGGGCG ACCGGGCTCC GATTCCGGAG
GGGCTGGCCC CGGATGAGCT GACCCCGGAG AAGGTGCACG AGCTGTTCCT CGGTGGCGGT
GGCGAGCGCA AGCTCGGGGA GGACCCGGCC ACCGGGGAGC CGATTCTGCT CAAGTCGGGC
CGGTTCGGCC CCTACGTGGC CAGTGGGGAG CGTAAGTCGT CCCTGCTGCG CTCCCAGTCG
CCGGACGCCC TCACCCTCGA CGAGGCACTG CGGCTGTTGA GCCTGCCTCG GCTGGTCGGT
ACTGACCCGG AGGGCAACGA GGTCTTCGCC AACAACGGCC GCTACGGCCC GTACGTCAAG
CGGGGCGAGG AGTTCCGGTC GCTGGAGTCC GAAGAGAAGA TGTTCACGGT CACGCTGGAC
GAGGCGTTGG CTCTGCTGGC CGCTCCGAAG ACCCGGCAGC GTCGGGCCCC CGCGCCTCCG
CTGCGGGAGT TGGGCGCCGA CCCGTTGACC GAGAAGCCGC TGGTCATCAA GGACGGCCGG
TTCGGGCCGT ACGTGACCGA CGGGGAGACC AACGCGTCGT TGCGGCGTGG TCAGACGCCG
GAGGCACTGA GCCTGGAGCA GGCGTCGGAG ATGCTGGCCG AGAAGCGGGC GAAGGGCCCG
GCGCCGCGGA AGAAGGCGGC CAAGAAGGCC GCGCCGGCCA AGAAGGCGAC GCCGGCCAAG
AAGGCGACGA AGAAGGCGGC AGCGACGAAG AAGACCGCCG CGGCGACCAA GTCAGCGAAG
AAGACCGCGA CGGTGAACAG CACCGCGGCC AAGCAGGCGA AGCCGAAGCG GGCGGGCAGC
GCGGCCGAGT GA
 
Protein sequence
MPSNTGTTRL VIVESPAKAK TISGYLGPGY VVEASFGHVR DLPRNAADVP AKYKGEAWAR 
LGVDVDNGFH ALYVVSSDRR QQISKLTKLA REVDEILLAT DEDREGEAIA WHLVETLKPK
VPVKRMVFHE ITKPAIQAAV ANPREIDRDL VDAQEARRIL DRLYGYEVSP VLWKKVMPKL
SAGRVQSVAT RIVVERERQR MAFRTAEYWD ILATLAGEQP GEGPRTFNAT LVALNGDRIA
TGKDFEPTTG RVRPGAGVVH LDEGGARGLA ARLADRPFTV TRVEEKPYRR RPYAPFITST
LQQESARKLR FSSQQTMRTA QRLYENGYIT YMRTDSVNLS ETAIAAARRQ IVELYGERSV
PPEPRRYTGK VKNAQEAHEA IRPAGDTFRT PGELRNELSV EEYKLYELIW RRTIASQMTD
AVGSSVSVRI RAVTAAGEEA DFGATGKTIT DPGFLRAYVE SSDDENAEAE DAERRLPHLV
KDQPLTGEQL AAQGHHTQPP ARYTEASLVK ALEELGIGRP STYASIMQTI QDRGYVAKRG
QAMIPSFLAF AVVGLMERHY PRLIDYGFTA SMENELDEIA GGDHAAVDFL TAFYFGITNG
AGDQDIARSG GLKKLVTENL SEIDARSVNS IPLFSDDEGR PVVVRVGRYG PYLQRELPGE
AAAADGEEGG GQGDRAPIPE GLAPDELTPE KVHELFLGGG GERKLGEDPA TGEPILLKSG
RFGPYVASGE RKSSLLRSQS PDALTLDEAL RLLSLPRLVG TDPEGNEVFA NNGRYGPYVK
RGEEFRSLES EEKMFTVTLD EALALLAAPK TRQRRAPAPP LRELGADPLT EKPLVIKDGR
FGPYVTDGET NASLRRGQTP EALSLEQASE MLAEKRAKGP APRKKAAKKA APAKKATPAK
KATKKAAATK KTAAATKSAK KTATVNSTAA KQAKPKRAGS AAE