Gene CPR_2039 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCPR_2039 
Symbol 
ID4206374 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium perfringens SM101 
KingdomBacteria 
Replicon accessionNC_008262 
Strand
Start bp2252257 
End bp2255196 
Gene Length2940 bp 
Protein Length979 aa 
Translation table11 
GC content30% 
IMG OID642566589 
ProductDNA topoisomerase IV subunit A 
Protein accessionYP_699348 
Protein GI110803503 
COG category[L] Replication, recombination and repair 
COG ID[COG0188] Type IIA topoisomerase (DNA gyrase/topo II, topoisomerase IV), A subunit 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCTAAGA AGAATATTGA AATACCTAAG GATAATAATA TTATCAGAAT GCCTCTTGAA 
GAGGTTATGC CTGATAACTA TTTACCTTAT GCAGTAGAAG TTGCTAAGGA TAGAGCACTT
CCAGATGTAA GAGATGGATT AAAGCCTGTA CACAGAAGAA TTTTATATGG GGCGTATATG
TTAAAGGCTT TTCCAGATAA ACCTTATTAC AAATCAGCTA GAATAGTTGG GGATATACTA
GGTAAATATC ACCCACATGG AGATAGTTCA GTTTATGATG CTATGGTAAT TTTAGCTCAA
AATTTTAGTA CAAGAGCTCC TCTTATTGAT GGACACGGAA ACTGGGGAAG TATAGATGGA
GATGGAGCAG CGGCTATGAG ATATACAGAG GCTAGACTTT CTAGTATATC TATGGAGATG
CTTAGAGATA TAGAAAAAAA TGTAGTAGAC ATGGTTCCAA ACTACTCAGA TTCTGAAATG
GAACCTAAGG TATTACCAGC TAGATATCCA AATCTCTTAG TAAATGGTAC TTTTGGTATA
GCAGTTGGAC TTTCAACTAA TATACCACCA CATAATTTAA GAGAAGTTAT AGATGGTACT
TTAGCTTATA TAGATAATAA TGAAATTACT ACTAGAGAAT TAATGAACTA TATAAAAGGA
CCAGACCTTC CAACAGGAGG AGTTTTAATT GGTGAGAAAA CTTTATTATC TGCTTATGAA
ACAGGAGAAG GAAAAGTAAC TTTAAGAGCT AAGGCTAAAA TTGAAACTTT AGAAAATGGA
AGACTTGGAA TAGTAATAAC TGAATTCCCT TATAGAAGAA ATAAGGCTAG AATACTTCAA
ACAATATCAG ATATGACTGG TGACAAGAGA CATGCTAAAG CTTTAGATGG AATAGTGGAT
ATTAGAGATG AGTCAGATAG AACTGGTATA AGAGCCGTAA TAGAATTTAA GAAGGCTGTA
GATCACGATA TGGCTGATAA GGTTCTTAAG TATCTTTATA AGAAAACTGA TCTTCAAGGT
AACATAAGCT TTAATATGGT TGCCTTAGCA GATGGAAAAC CAGAGACTAT GGGATTAAAA
ACAATAATAT CTCATTATGT AAACCATCAA AAGGATGTTG TTACAAGAAG AACAAAAAGA
GAGTTAGAAG TAGCAGAAAA GAGATTTCAC ATAGTTGAAG GTTTCATAAA GGCTATAGGA
ATAATGGATG AGGTTATAGC TACAATTAGA GCTTCAAAAT CAAAGAAAGA TGCTCATGAG
AATTTAGTTT TAAAATTTGG ATTTACTGAC TTACAGGCAG AGGCAATTCT AGAATTAATG
CTTTATAGAT TAACTGGATT AGAGATAAAA GTATTCCAAA AAGAGCATAA AGAATTATCA
AAGAAAATAA AGGCACTTAG AAAAATTTTA GAAAATGAAT CAGTTCTTTT AGGAGTTATA
AAAGATGAAT TAAAAGAAGT GGCTGAAGTT TATGGTGATG AAAGAAGAAC TGCCTTAATA
GAAGATGAAA GCGAAGCTAA GATAGATCTA GAAGAGTTAA TAGTGGCAGA GGATGTTATG
GTTACTCTAT CAAACGAAGG ATTTATAAAG AAGATTCCTC TTAAGACTTA TAATCGTTCA
AATGTTGATG AAAATGAAAT TGAATATAGA GAAGGAGATT ATTTAAAATT CCTTATTAAA
TCAAATACTA AGGATACCTT AGCTATATTC ACTGATAAAG GAACTGTTTA TCAAATAAAA
TGTAATTCTG TAGCAGATAA GAAATGGAAA GATAAAGGAG AAAGACTTGA GGATTTAATA
AGAGGATTAA GTTTAGAAGA TGAAAAAATT ATTGCTCTTG AATCAATTGA AAATTTCCTT
CCAAACAAAT GCTTTAAATT TATAACTGCT AATGGATTAA TAAAGAAGAC TACTTTAGAT
AAATTTGTTA CTGCTTACTC AAAACTTATG GCTATAAAGC TTAAAAATGA CGATTTATTA
GCAAGTGTAT CTTTAATAGA TTCACAGGAT GAAGAAAGAT TTGTTGAAAT TGAAACAACT
AATGGATTAA ATTTTGTTGT TTCAGAACCA GAATTAGAGT TTACAGATAG AAATATACTA
GGAGTTCAAT TAGTACCATT AAAGAGTGGT AACCAAATTA AGAGTATAAG ATTTGTAGAT
AACTATGAAT ATAAAGAATT TATCATAGGA ATAAATAAAA AGGGAAATAT AAAAACTTTC
AGCAATATGA ATAGCAATTC TTATGAAAAG GTAAAGGTTA ATTCCTTTAG AAATATAATT
GCTTTCTCAA ATAAAGGAAA GGTATTTAAA TTCCCAGCAT ACTTACTTCA AAATACAGAA
GAAAGTAATA TTTCAGACTT AGTAGATGGA TTTGAAAAGG ATGAACTTAT AATAAAGGTA
GCTCCTATAA ATGAATTTGG AAAAATAGGT GAAGATTTAT TTGTTTACTT CTTCTCAAGA
GAAGGATTAG TTAAAAAGAC ATCTTTAAGA GAGTTCTTAG GAGAATTTAA TAATCAAATA
GCTTATAAGT TTAAAACTCC AAAGGATGAA TTAGTAAATG TAGATATAAA TTTTGAAAAT
GCTACAGTAA TCTTAGTAAC TAAGAATGGT ATGGGAATTA AGTTCTTAGC TACAGCTATT
AATCCAATGG GAAGAATAGC TTCAGGGGTA ACAGGAATAA GCTTAAAAGA TGATAATAAA
GTTATATTTG GTAAGGTTAT ACCACCATCT GAAGGTATTG ATGATAAAAC TTTAGAGGCG
TATAATGACT ATAAAAAAGA ATTAACTAGT AATTATGAAA AACTTGTTTT AGAGTCTAAG
CAGAAGGAAA AAGCCGAAGT CAATATTGAA GATATTAAAC TACAAAATAG AGCGGGAAGA
GGAAGTAGTT TAATGATTTT AGTATTGGAA GACTATATAA GGGACGTAAT TATTAAGTAG
 
Protein sequence
MAKKNIEIPK DNNIIRMPLE EVMPDNYLPY AVEVAKDRAL PDVRDGLKPV HRRILYGAYM 
LKAFPDKPYY KSARIVGDIL GKYHPHGDSS VYDAMVILAQ NFSTRAPLID GHGNWGSIDG
DGAAAMRYTE ARLSSISMEM LRDIEKNVVD MVPNYSDSEM EPKVLPARYP NLLVNGTFGI
AVGLSTNIPP HNLREVIDGT LAYIDNNEIT TRELMNYIKG PDLPTGGVLI GEKTLLSAYE
TGEGKVTLRA KAKIETLENG RLGIVITEFP YRRNKARILQ TISDMTGDKR HAKALDGIVD
IRDESDRTGI RAVIEFKKAV DHDMADKVLK YLYKKTDLQG NISFNMVALA DGKPETMGLK
TIISHYVNHQ KDVVTRRTKR ELEVAEKRFH IVEGFIKAIG IMDEVIATIR ASKSKKDAHE
NLVLKFGFTD LQAEAILELM LYRLTGLEIK VFQKEHKELS KKIKALRKIL ENESVLLGVI
KDELKEVAEV YGDERRTALI EDESEAKIDL EELIVAEDVM VTLSNEGFIK KIPLKTYNRS
NVDENEIEYR EGDYLKFLIK SNTKDTLAIF TDKGTVYQIK CNSVADKKWK DKGERLEDLI
RGLSLEDEKI IALESIENFL PNKCFKFITA NGLIKKTTLD KFVTAYSKLM AIKLKNDDLL
ASVSLIDSQD EERFVEIETT NGLNFVVSEP ELEFTDRNIL GVQLVPLKSG NQIKSIRFVD
NYEYKEFIIG INKKGNIKTF SNMNSNSYEK VKVNSFRNII AFSNKGKVFK FPAYLLQNTE
ESNISDLVDG FEKDELIIKV APINEFGKIG EDLFVYFFSR EGLVKKTSLR EFLGEFNNQI
AYKFKTPKDE LVNVDINFEN ATVILVTKNG MGIKFLATAI NPMGRIASGV TGISLKDDNK
VIFGKVIPPS EGIDDKTLEA YNDYKKELTS NYEKLVLESK QKEKAEVNIE DIKLQNRAGR
GSSLMILVLE DYIRDVIIK