Gene CPR_2040 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCPR_2040 
Symbol 
ID4204901 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium perfringens SM101 
KingdomBacteria 
Replicon accessionNC_008262 
Strand
Start bp2255210 
End bp2257156 
Gene Length1947 bp 
Protein Length648 aa 
Translation table11 
GC content31% 
IMG OID642566590 
ProductDNA topoisomerase IV subunit B 
Protein accessionYP_699349 
Protein GI110801950 
COG category[L] Replication, recombination and repair 
COG ID[COG0187] Type IIA topoisomerase (DNA gyrase/topo II, topoisomerase IV), B subunit 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCAATA AAAATAACAA TTATGATGTT ACCTCTTTAA CTTCTTTAGA AAAGTTAGAA 
CCAGTAAGAG TAAGACCAGG AATGTACATA GGTTCTACAG GAAGCAAGGG ACTTCATCAT
TGTATATGGG AAATATTAGA TAACTCTATA GACGAAATAT CAAATGGATA TGGAAATAAG
GCTAAGGTTA TTTTAAATAA AGATAAAAGT GTAACTGTAA TAGATAATGG AAGAGGAATT
CCTACGGGAA TGCATCCAAT AAAGAAAAAA ACTGGTGTTG AAATGGTATT TACAGAGCTT
CACACTGGGG GTAAGTTTAA TAACCAAAAC TATAAGACTT CAGGAGGACT TCATGGTGTT
GGAGCAGCCG TTGTTAATGC CTTATCAAGA TGGTTAGAAG TTGAAGTAAA ACAAAATGGA
AAGATATATA GACAAAGATT TGAATATGCT TATGACAAAG AGCTTAAAAA AGATATGCCA
GGAACACCAG TAACTCCATT AGAGGTAGTA GGAGAATCTA AGGAAACTGG AACAAAGGTT
ACTTTTTTAC CAGATAGAGA AGTATTTTCA ACATCAGATT TTAAATTTGA CATAATAGAT
GAAAGATTAC AAGAATTAGC TTTCCAAAAT AAAGGAATAA GATTAGAACT TATTGATAAT
AGAAAAGAAG AAAGTGTAAG TAAAGAATAT TACTCTGAAA GAGGACTTTT AGATTTTATA
GATTATTTAA ATGAAAGTAA AACCCCTCTT CATCCAACTC CTGTTTTATT TGAAGGAGAA
AAAGAAGTAA ATAACATACA AATGTATGGA GAAATATGTC TTCAATTTAC TGATTCAACA
ACAGAAAATA TAGTTAGTTA CGTAAATAAT ATTCCTACTA CAGAGGCTGG TACTCATGAG
ACTGGTTTTA AAACAGGAAT GACTCGTGCT TTTAAGGAAT GGGCTAAAAA GCTTAACCTT
ATAAAAGAAA AAGATAAGGA ATTTGAAGGC GATGATTTAA GAGAAGGTAT GACAGCTATT
GTCAGAATAA AAATCACAAA TCCAGTCTTT GAAGGACAAA CAAAAACTAA GCTTGGGAAT
AATGAAGCTT ACACAATGAT GAATGATTTA GCTTACACTA AATTTGGTCA TTGGATAGAG
GATAATAAAG AAGTTGCAAC TATGATTATA AATAATGCCT TAGAAGCAGC TGCTAGAAGA
GAAAAAATTA AAAAGATTAA TGATGCAGAG AGAAAGAAAA TAGGAAAAGG AACAGCACCT
TTAGCAGGAA AGGTTGCAGT ATGCACATTA AAAAATAAAG AAGCTAATGA ATTCATAGTA
GTTGAGGGAG ATTCTGCAGG TGGATCAGCT AAGCAAGCTA GAGATAGAAG ATTCCAAACA
ATAATGCCTT CAAAAGGTAA AATAATGAAT ACAGAAAAAC AAAAGTTAGA AAATGTTATA
GCTTCTGAGG AATTAAAAAT CTTTAATACT GCCATAGGTA CAGGAATTTT AGACAATTAT
AATGAAGAAG ATTTAAAATA TGACAAAATA ATAATAATGA GTGATGCTGA CGTAGATGGA
TATCATATAA GAACTCTTTG GATGACATAT ATCTATAGAT ACATGAGACC TCTTATAGCT
AATGGTCATC TTTATTTAGC ACAACCACCT CTTTATAAAG TTGCTAAATC AGGTAAGGGT
AAAGATAAGA TTCTTTATGC TTATAGTGAT GACGAATTAG AAGAAGTTAA AAAGAAGGTA
GGAAAGGGAG CAACTATTCA AAGATTTAAA GGACTTGGAG AAATGAATCC AGACCAATTA
TGGGAGACAA CTTTAAATCC AGAAACAAGA ACTTTAGTAC AAGTTACTAT AGAAGATGCC
GCTAAGGCTG AAAAAATGGT TTCTTTATTA ATGGGTGATG TTGTTCAGCC TAGAAAGAAC
TATATGTATA AGTATGCTGA GTTCTAA
 
Protein sequence
MSNKNNNYDV TSLTSLEKLE PVRVRPGMYI GSTGSKGLHH CIWEILDNSI DEISNGYGNK 
AKVILNKDKS VTVIDNGRGI PTGMHPIKKK TGVEMVFTEL HTGGKFNNQN YKTSGGLHGV
GAAVVNALSR WLEVEVKQNG KIYRQRFEYA YDKELKKDMP GTPVTPLEVV GESKETGTKV
TFLPDREVFS TSDFKFDIID ERLQELAFQN KGIRLELIDN RKEESVSKEY YSERGLLDFI
DYLNESKTPL HPTPVLFEGE KEVNNIQMYG EICLQFTDST TENIVSYVNN IPTTEAGTHE
TGFKTGMTRA FKEWAKKLNL IKEKDKEFEG DDLREGMTAI VRIKITNPVF EGQTKTKLGN
NEAYTMMNDL AYTKFGHWIE DNKEVATMII NNALEAAARR EKIKKINDAE RKKIGKGTAP
LAGKVAVCTL KNKEANEFIV VEGDSAGGSA KQARDRRFQT IMPSKGKIMN TEKQKLENVI
ASEELKIFNT AIGTGILDNY NEEDLKYDKI IIMSDADVDG YHIRTLWMTY IYRYMRPLIA
NGHLYLAQPP LYKVAKSGKG KDKILYAYSD DELEEVKKKV GKGATIQRFK GLGEMNPDQL
WETTLNPETR TLVQVTIEDA AKAEKMVSLL MGDVVQPRKN YMYKYAEF