Gene Ksed_25040 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagKsed_25040 
Symbol 
ID8374007 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameKytococcus sedentarius DSM 20547 
KingdomBacteria 
Replicon accessionNC_013169 
Strand
Start bp2581524 
End bp2584415 
Gene Length2892 bp 
Protein Length963 aa 
Translation table11 
GC content71% 
IMG OID644992730 
ProductDNA topoisomerase I 
Protein accessionYP_003150233 
Protein GI256826273 
COG category[L] Replication, recombination and repair 
COG ID[COG0550] Topoisomerase IA 
TIGRFAM ID[TIGR01051] DNA topoisomerase I, bacterial 


Plasmid Coverage information

Num covering plasmid clones59 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones43 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGCAGCCA AGGGCAGCAA GCTCGTGATC GTGGAGTCGC CGACCAAGGT GAAGTCCATC 
GGCCAGTACC TGGGGGACGA GTACCGCGTG GAGGCCTCCG TCGGCCACAT TCGCGACCTG
CCCACGCCCA GCGAGATGCC CGCAGACATG AAGACGGGGC CCTACGGCAA GTTCGCCGTG
GACGTGGACA ACGGCTTCGA CCCCTACTAC GTCATCGACG CGGACAAGAA GAAAAAGGTC
ACCGAGCTGC GCAAGGCCCT CAAGGAGGCC GACGAGCTCT ACCTCGCCAC CGACGAGGAC
CGCGAGGGCG AGGCCATCGC CTGGCACCTG CTGCAGGCGC TCAAGCCCAA AGTGCCGGTC
AAGCGCATGG TCTTCCACGA GATCACCAAG GACGCCATCC AGCACGCGGT GAACACCACC
CGCGACCTCG ACGACCGCAT GGTCGACGCC CAGGAGTCCC GCCGCATCCT CGACCGCCTC
TACGGCTACG AGGTCTCCCC GGTGCTCTGG CGCAAGGTCA AGCAGGGCCT GTCCGCCGGC
CGTGTGCAGT CCGTCGCCAC CCGCCTGGTG GTGGAGCGCG AGCGCGCCCG CATGGCCTTC
CGCATCGCGT CCTACTGGGA CGTGGAGGCC GACCTCGCCC CCGCCGACGA TCAGCAGTTC
ACCGCGCGCC TCACCAGCCT CGATGGGGTG CGCGTGGCCA CCGGCCGCGA CTTCACCGAT
GCGGGGGAGC TCAAGTCCTC CGCCCAGGTC ACCCACCTGG CCGAGGCCGA CGCCCGGGCC
GTCGCCTCCG GCATCGAGGC CGCGCAGGTG ACCGTCACCG ATGTCAGCGA GAAGCCGTAC
ACGCGCAAGC CGGCCGCGCC GTTCATCACC TCCACCCTGC AGCAGGAGGC CAGTCGCAAG
CTGCGCCTGG GCTCCAAGGA CGCCATGCGG GTGGCGCAGC GGCTGTACGA GAACGGCTAC
ATCACCTACA TGCGTACCGA CTCGACCACC TTGTCGCAGT CGGCCATGAC CGCGGCCCGT
CAGCAGGCTC GTGACCTCTA CGGCGCGGAC TACGTGCCGG ACTCCCCACG CTTCTACGGC
AAGAAGGCCA AGGGCGCGCA GGAGGCCCAC GAGGCCATCC GTCCCGCCGG CGACACCTTC
CGCACCCCGG CCCAGGTGGC CGGCGAGCTG CGCGGCAGCG AGTACGCCAT GTACGAGCTC
ATCTGGAAGC GCACGGTCGC CTCCCAGATG GCTGACGCCA AGGGCTCCAC GGCCACCGTC
CGCATCGGCG CGCCGCTGCA GGGCGTGCAG GTCGGGGGCA AGCCGGCGCG CCAGGCCGAG
CTCACCGCCT CGGGCACGGT CATCACCTTC CGGGGCTTCC TGGCCGCCTA CGAGGAGGGG
CGCGACGCCG ACCGTTACGG CGGGGGCGAG CAGGGCGGCT CGGGCGACGC CGGCGCGAAG
TCCGGGAAGG GTGCCAAGGA GGTGCGCCTG CCGAAGATGG CCGCAGGGCA GGAGCTGGCC
ACCTCCCGGG TGGAGGCGCT GGGGCACGAG ACCTCCCCGC CGCCGCGGTA CACCGAGGCC
ACGTTGGTGA AGGCGCTGGA GGAGAAGGGC ATCGGCCGCC CGTCCACCTA CGCGGCGACC
GTCGGCACCA TCCAGGACCG CGGGTACGTG CGCACCAAGG GCAATGCGTT GGTGCCCACC
TGGCTGGCGT TCGCCGTGAC CACCCTGTTG GAGAAGCACT TCCCCACGTT GGTGGACTAC
GACTTCACCG CATCGATGGA GGAGGGGCTC GACGCGATCG CCGCTGGCGA GGAGGACCGC
GTCGCCTGGC TGCAACGGTT CTACTTCGGT GACGAGGCGT CCTCGGCCAC CGGGTTGCGT
GAGCTGGTGG AGGACCTCGG TGAGATCGAC GCCAAGGGGG TTTCCACGAT CGACATCGGC
GACGGGATCG TCGTGCGCGT GGGTCGTTAC GGGCCGTACG TGGAGGAGAT CGCGCCGGCG
GGCACCGACC TGACCACCGG AGAGGTGCCG GACGATGCGG GGAGCGCCGC CCGTGCCGCG
GACGGCACCG CTGCCTCGGG CGAGGAGGAG GCCAAGCCGC TGCGCGCCAG CATCCCGGAG
GACATCGCGC CGGACGAGCT GACCCCGGCG ATGGCGCGGG AGTACCTGGC CGAGGCCGCC
TCCGACGGGC GAGTGCTGGG TCAGGACCCC GAGACCGGCC GCGACATCGT GGCCAAGACC
GGGCGGTACG GGCCGTACGT CTCCGAGGTG TTCACCGACG AGGACGTCGC CCGCTTCGAG
GCCGAGGGCA AGAAGACCAC CGGCCGCGGC AAGGACCTGG TGAAGCCGCG GACCGGCTCG
CTGCTGGCCT CCATGGACGT GCGGACGGTG ACGCTCGAGG ACGCCCTGAA GATCCTCTCG
CTGCCGCGGG TGGTGGGTGC GGACCCGGAG AGCGGCGCCG AGATCACCGC GCAGAACGGG
CGTTACGGGC CGTACCTGAA GAAGGGTACG GACTCGCGCT CGCTGGCCAG TGAGGAGCAG
ATCTTCGAGA TCACCCTCGA GGAGGCCCTG AAGATCTACG CCGAGCCCAA GCGGCGCGGG
CGCTCGGCGG CCAACCCGCC GCTGGCGGAG TTCGCCGAGG ACCCGGTCTC GGGCAAGAAG
GTGGTCATCA AGGACGGCCG CTTCGGGCCC TACATCACCG ATGGCGAGAC GAACGTGACC
GTGCCGCGCG CCATGCGGCC GGAGGACGTC TCGGAGCTGC AGGCCTTCGA GCTGCTCGCG
CAGAAGCGGG CCGAGGGGCC GAAGAAGAAG CCGGTGCGCA AGAGCTCGGG CACCACCCGC
AAGGCGCCGG CGAAGAAGCC GGCCGCCGGG ACTGCGCGCA CGGTGAAGGC CGGGAGCCGC
AAGAAGAGCT GA
 
Protein sequence
MAAKGSKLVI VESPTKVKSI GQYLGDEYRV EASVGHIRDL PTPSEMPADM KTGPYGKFAV 
DVDNGFDPYY VIDADKKKKV TELRKALKEA DELYLATDED REGEAIAWHL LQALKPKVPV
KRMVFHEITK DAIQHAVNTT RDLDDRMVDA QESRRILDRL YGYEVSPVLW RKVKQGLSAG
RVQSVATRLV VERERARMAF RIASYWDVEA DLAPADDQQF TARLTSLDGV RVATGRDFTD
AGELKSSAQV THLAEADARA VASGIEAAQV TVTDVSEKPY TRKPAAPFIT STLQQEASRK
LRLGSKDAMR VAQRLYENGY ITYMRTDSTT LSQSAMTAAR QQARDLYGAD YVPDSPRFYG
KKAKGAQEAH EAIRPAGDTF RTPAQVAGEL RGSEYAMYEL IWKRTVASQM ADAKGSTATV
RIGAPLQGVQ VGGKPARQAE LTASGTVITF RGFLAAYEEG RDADRYGGGE QGGSGDAGAK
SGKGAKEVRL PKMAAGQELA TSRVEALGHE TSPPPRYTEA TLVKALEEKG IGRPSTYAAT
VGTIQDRGYV RTKGNALVPT WLAFAVTTLL EKHFPTLVDY DFTASMEEGL DAIAAGEEDR
VAWLQRFYFG DEASSATGLR ELVEDLGEID AKGVSTIDIG DGIVVRVGRY GPYVEEIAPA
GTDLTTGEVP DDAGSAARAA DGTAASGEEE AKPLRASIPE DIAPDELTPA MAREYLAEAA
SDGRVLGQDP ETGRDIVAKT GRYGPYVSEV FTDEDVARFE AEGKKTTGRG KDLVKPRTGS
LLASMDVRTV TLEDALKILS LPRVVGADPE SGAEITAQNG RYGPYLKKGT DSRSLASEEQ
IFEITLEEAL KIYAEPKRRG RSAANPPLAE FAEDPVSGKK VVIKDGRFGP YITDGETNVT
VPRAMRPEDV SELQAFELLA QKRAEGPKKK PVRKSSGTTR KAPAKKPAAG TARTVKAGSR
KKS