Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | CPR_2039 |
Symbol | |
ID | 4206374 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium perfringens SM101 |
Kingdom | Bacteria |
Replicon accession | NC_008262 |
Strand | - |
Start bp | 2252257 |
End bp | 2255196 |
Gene Length | 2940 bp |
Protein Length | 979 aa |
Translation table | 11 |
GC content | 30% |
IMG OID | 642566589 |
Product | DNA topoisomerase IV subunit A |
Protein accession | YP_699348 |
Protein GI | 110803503 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0188] Type IIA topoisomerase (DNA gyrase/topo II, topoisomerase IV), A subunit |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCTAAGA AGAATATTGA AATACCTAAG GATAATAATA TTATCAGAAT GCCTCTTGAA GAGGTTATGC CTGATAACTA TTTACCTTAT GCAGTAGAAG TTGCTAAGGA TAGAGCACTT CCAGATGTAA GAGATGGATT AAAGCCTGTA CACAGAAGAA TTTTATATGG GGCGTATATG TTAAAGGCTT TTCCAGATAA ACCTTATTAC AAATCAGCTA GAATAGTTGG GGATATACTA GGTAAATATC ACCCACATGG AGATAGTTCA GTTTATGATG CTATGGTAAT TTTAGCTCAA AATTTTAGTA CAAGAGCTCC TCTTATTGAT GGACACGGAA ACTGGGGAAG TATAGATGGA GATGGAGCAG CGGCTATGAG ATATACAGAG GCTAGACTTT CTAGTATATC TATGGAGATG CTTAGAGATA TAGAAAAAAA TGTAGTAGAC ATGGTTCCAA ACTACTCAGA TTCTGAAATG GAACCTAAGG TATTACCAGC TAGATATCCA AATCTCTTAG TAAATGGTAC TTTTGGTATA GCAGTTGGAC TTTCAACTAA TATACCACCA CATAATTTAA GAGAAGTTAT AGATGGTACT TTAGCTTATA TAGATAATAA TGAAATTACT ACTAGAGAAT TAATGAACTA TATAAAAGGA CCAGACCTTC CAACAGGAGG AGTTTTAATT GGTGAGAAAA CTTTATTATC TGCTTATGAA ACAGGAGAAG GAAAAGTAAC TTTAAGAGCT AAGGCTAAAA TTGAAACTTT AGAAAATGGA AGACTTGGAA TAGTAATAAC TGAATTCCCT TATAGAAGAA ATAAGGCTAG AATACTTCAA ACAATATCAG ATATGACTGG TGACAAGAGA CATGCTAAAG CTTTAGATGG AATAGTGGAT ATTAGAGATG AGTCAGATAG AACTGGTATA AGAGCCGTAA TAGAATTTAA GAAGGCTGTA GATCACGATA TGGCTGATAA GGTTCTTAAG TATCTTTATA AGAAAACTGA TCTTCAAGGT AACATAAGCT TTAATATGGT TGCCTTAGCA GATGGAAAAC CAGAGACTAT GGGATTAAAA ACAATAATAT CTCATTATGT AAACCATCAA AAGGATGTTG TTACAAGAAG AACAAAAAGA GAGTTAGAAG TAGCAGAAAA GAGATTTCAC ATAGTTGAAG GTTTCATAAA GGCTATAGGA ATAATGGATG AGGTTATAGC TACAATTAGA GCTTCAAAAT CAAAGAAAGA TGCTCATGAG AATTTAGTTT TAAAATTTGG ATTTACTGAC TTACAGGCAG AGGCAATTCT AGAATTAATG CTTTATAGAT TAACTGGATT AGAGATAAAA GTATTCCAAA AAGAGCATAA AGAATTATCA AAGAAAATAA AGGCACTTAG AAAAATTTTA GAAAATGAAT CAGTTCTTTT AGGAGTTATA AAAGATGAAT TAAAAGAAGT GGCTGAAGTT TATGGTGATG AAAGAAGAAC TGCCTTAATA GAAGATGAAA GCGAAGCTAA GATAGATCTA GAAGAGTTAA TAGTGGCAGA GGATGTTATG GTTACTCTAT CAAACGAAGG ATTTATAAAG AAGATTCCTC TTAAGACTTA TAATCGTTCA AATGTTGATG AAAATGAAAT TGAATATAGA GAAGGAGATT ATTTAAAATT CCTTATTAAA TCAAATACTA AGGATACCTT AGCTATATTC ACTGATAAAG GAACTGTTTA TCAAATAAAA TGTAATTCTG TAGCAGATAA GAAATGGAAA GATAAAGGAG AAAGACTTGA GGATTTAATA AGAGGATTAA GTTTAGAAGA TGAAAAAATT ATTGCTCTTG AATCAATTGA AAATTTCCTT CCAAACAAAT GCTTTAAATT TATAACTGCT AATGGATTAA TAAAGAAGAC TACTTTAGAT AAATTTGTTA CTGCTTACTC AAAACTTATG GCTATAAAGC TTAAAAATGA CGATTTATTA GCAAGTGTAT CTTTAATAGA TTCACAGGAT GAAGAAAGAT TTGTTGAAAT TGAAACAACT AATGGATTAA ATTTTGTTGT TTCAGAACCA GAATTAGAGT TTACAGATAG AAATATACTA GGAGTTCAAT TAGTACCATT AAAGAGTGGT AACCAAATTA AGAGTATAAG ATTTGTAGAT AACTATGAAT ATAAAGAATT TATCATAGGA ATAAATAAAA AGGGAAATAT AAAAACTTTC AGCAATATGA ATAGCAATTC TTATGAAAAG GTAAAGGTTA ATTCCTTTAG AAATATAATT GCTTTCTCAA ATAAAGGAAA GGTATTTAAA TTCCCAGCAT ACTTACTTCA AAATACAGAA GAAAGTAATA TTTCAGACTT AGTAGATGGA TTTGAAAAGG ATGAACTTAT AATAAAGGTA GCTCCTATAA ATGAATTTGG AAAAATAGGT GAAGATTTAT TTGTTTACTT CTTCTCAAGA GAAGGATTAG TTAAAAAGAC ATCTTTAAGA GAGTTCTTAG GAGAATTTAA TAATCAAATA GCTTATAAGT TTAAAACTCC AAAGGATGAA TTAGTAAATG TAGATATAAA TTTTGAAAAT GCTACAGTAA TCTTAGTAAC TAAGAATGGT ATGGGAATTA AGTTCTTAGC TACAGCTATT AATCCAATGG GAAGAATAGC TTCAGGGGTA ACAGGAATAA GCTTAAAAGA TGATAATAAA GTTATATTTG GTAAGGTTAT ACCACCATCT GAAGGTATTG ATGATAAAAC TTTAGAGGCG TATAATGACT ATAAAAAAGA ATTAACTAGT AATTATGAAA AACTTGTTTT AGAGTCTAAG CAGAAGGAAA AAGCCGAAGT CAATATTGAA GATATTAAAC TACAAAATAG AGCGGGAAGA GGAAGTAGTT TAATGATTTT AGTATTGGAA GACTATATAA GGGACGTAAT TATTAAGTAG
|
Protein sequence | MAKKNIEIPK DNNIIRMPLE EVMPDNYLPY AVEVAKDRAL PDVRDGLKPV HRRILYGAYM LKAFPDKPYY KSARIVGDIL GKYHPHGDSS VYDAMVILAQ NFSTRAPLID GHGNWGSIDG DGAAAMRYTE ARLSSISMEM LRDIEKNVVD MVPNYSDSEM EPKVLPARYP NLLVNGTFGI AVGLSTNIPP HNLREVIDGT LAYIDNNEIT TRELMNYIKG PDLPTGGVLI GEKTLLSAYE TGEGKVTLRA KAKIETLENG RLGIVITEFP YRRNKARILQ TISDMTGDKR HAKALDGIVD IRDESDRTGI RAVIEFKKAV DHDMADKVLK YLYKKTDLQG NISFNMVALA DGKPETMGLK TIISHYVNHQ KDVVTRRTKR ELEVAEKRFH IVEGFIKAIG IMDEVIATIR ASKSKKDAHE NLVLKFGFTD LQAEAILELM LYRLTGLEIK VFQKEHKELS KKIKALRKIL ENESVLLGVI KDELKEVAEV YGDERRTALI EDESEAKIDL EELIVAEDVM VTLSNEGFIK KIPLKTYNRS NVDENEIEYR EGDYLKFLIK SNTKDTLAIF TDKGTVYQIK CNSVADKKWK DKGERLEDLI RGLSLEDEKI IALESIENFL PNKCFKFITA NGLIKKTTLD KFVTAYSKLM AIKLKNDDLL ASVSLIDSQD EERFVEIETT NGLNFVVSEP ELEFTDRNIL GVQLVPLKSG NQIKSIRFVD NYEYKEFIIG INKKGNIKTF SNMNSNSYEK VKVNSFRNII AFSNKGKVFK FPAYLLQNTE ESNISDLVDG FEKDELIIKV APINEFGKIG EDLFVYFFSR EGLVKKTSLR EFLGEFNNQI AYKFKTPKDE LVNVDINFEN ATVILVTKNG MGIKFLATAI NPMGRIASGV TGISLKDDNK VIFGKVIPPS EGIDDKTLEA YNDYKKELTS NYEKLVLESK QKEKAEVNIE DIKLQNRAGR GSSLMILVLE DYIRDVIIK
|
| |