Gene CPF_1760 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCPF_1760 
SymboltopB 
ID4203225 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium perfringens ATCC 13124 
KingdomBacteria 
Replicon accessionNC_008261 
Strand
Start bp1981288 
End bp1983483 
Gene Length2196 bp 
Protein Length731 aa 
Translation table11 
GC content30% 
IMG OID638082632 
ProductDNA topoisomerase III 
Protein accessionYP_696196 
Protein GI110801342 
COG category[L] Replication, recombination and repair 
COG ID[COG0550] Topoisomerase IA 
TIGRFAM ID[TIGR01056] DNA topoisomerase III, bacteria and conjugative plasmid 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATTTTAT TGAAGAAATT AGTACTAGCA GAAAAACCTA GTGTAGGTAG AGAAATTGCC 
AAGGTTTTAA AATGCAATAA TAATAAAGGA AGTTATATTG AAGGAAAGGA TTATGTAATA
ACTTGGGCTT TAGGCCATTT AGTAGAATTA CAATCTCCAG AAGATTATGA TAACAAGCTT
AAAACATGGT CTATGGAAAC TCTTCCAATG CTTCCTAAGC ATATGAAACT TAAGGTTATT
AAAAAAGCCA CTAAGCAATT TTATGAAGTT AAAAAGCAAA TGGAAAGAAA AGATATAGAT
GAAATAATAA TAGCCACAGA TGCTGGAAGA GAAGGGGAGC TCGTAGCACG TTGGATAATA
GAAAAAGCCC ATGTTAAAAA GCATATAAAG AGACTTTGGA TATCCTCTCA AACAGATAAA
GCTATCTTAG ATGGATTTAA AAATCTTAAA CCAGGAGAAA ATTACGATAA TTTGTATAAG
GCTGCCCAGT GTAGAGCAGA GGCTGACTGG TTAGTTGGTT TAAATGTTAC TAGGGCTTTA
ACTTGTAAAT ATAATGCTCA GCTTTCAGCA GGAAGGGTTC AATCACCTAC CTTAGCAATG
ATAGTTCAAA GAGAAGAGGA TATTAAAAAT TTTAAACCAA AAGAGTATAG TACTATTTTA
ATAGAAACAG ATAAATGTAA TTTCACATGG ATAAATAAGG ATAATAATTC AAGAATATTT
AAAAATGATT TTAAAGAAAA AGTTGTTGCC CATTTAAATG AGAAGAAAAT AGGAAAAATA
GTTAATATAA ATGAAAGTAA TAAAAAGAAA TTTTCTCCTC AACTATATGA TTTAACTGAG
CTTCAAAGAG ATTGTAATAA AATTTTTGGA TATTCAGCAA AACAGACCCT TAATATAATG
CAAAGATTAT ATGAAAACCA TAAACTTCTT ACATATCCAA GAACTGATTC AAGATATATT
TCAAAGGACA TAGTTTCTAC TATTCCAGAA AGATTAAAAG CTGTAGCTAC AGGAGAATTT
AGAATTATAG CAAATGATTT ATTAAAAAAT CCTATAAAAG CAAACAAAAG TTTTGTAGAT
GACAATAAGG TTTCTGATCA CCATGCTATA ATTCCAACGG AGGAAAAAGG AAACCTAGCT
AATTTAAGTT CTGATGAGAG AAGAGTTTAT GAGCTTGTAG TTAAGAGATT TCTAAGTGTA
TTAATGCCTC CATTTGAATA TATTCAAACA ACTTTAACTG GTGAAGTTAA TGGAGAAAAA
TTAATTGCAA AGGGAAAAGT AGTTAAAGCT AAAGGATGGA AGAAGCTTTA TGATAGAGAA
GTTGATGATG AGGGAGAAGA GGATATAAAA GAACAAGAAC TTCCTAAATT AAAAGAGGGC
GATGAATTTA AAGTTCTTAA GGTTAATGTT AAAAAAGGAG AAACAAAGCC ACCAGCAAGA
TTTAATGAAG GAACTTTATT ATCTGCTATG GAAAATCCTC AAAAATTTAT ATCAGTAGAT
AAATCTTCAG CAAAGACTTT AGGAGAAACT GGAGGACTTG GTACTGTTGC AACTAGAGCA
GATATTATAG AAAAATTATT TAATTCCTTT GTAATAGAAA AGAGAGGAAA GGAAATAATT
CCAACTTCTA AGGGAAAACA ACTTATAGAT TTAGTACCAA AGGATTTAAA GTCACCACTT
TTAACAGCTA ATTGGGAAGA GATGTTAGAG AGAATAAGTA AAGGAAAAGG TGATTCTAAG
AAATTTATAA AAGATATAAG AAACTATACT GTTGCTTTAG TTGAGGATGT TAAGCTTGGA
GAAAGTAAAT TTCATCATGA TAACTTAACA GGGAAAAAAT GTCCTCAGTG TGGTAAGTAT
ATGCTTGAGG TAAAAGGCAA AAATGGAACA ATGAATGTTT GCCAAGATAG AGAGTGTGGA
TATAGAGAAA ATATATCTAG AATAACTAAT GCTAGATGTC CTGAGTGTAA GAAAAAGCTA
GAGCTTAGGG GACATGGAGA AGGTAAAATA TATGTTTGCC CGGGAGTAAA TTGTAACTTT
AGAGAAAAAG AATCTCAATT TAAAAAGAGA TTTGAGAAAA ATAACAAAAC AAATAAAAGA
GAAGTAAATA AAATAATGCA AAAGATGAAA AAGGAAGCAA ATGAAGATGT AAATAATCCT
TTTGCTGATC TTTTAAGTGG TCTTAAATTT GACTAA
 
Protein sequence
MILLKKLVLA EKPSVGREIA KVLKCNNNKG SYIEGKDYVI TWALGHLVEL QSPEDYDNKL 
KTWSMETLPM LPKHMKLKVI KKATKQFYEV KKQMERKDID EIIIATDAGR EGELVARWII
EKAHVKKHIK RLWISSQTDK AILDGFKNLK PGENYDNLYK AAQCRAEADW LVGLNVTRAL
TCKYNAQLSA GRVQSPTLAM IVQREEDIKN FKPKEYSTIL IETDKCNFTW INKDNNSRIF
KNDFKEKVVA HLNEKKIGKI VNINESNKKK FSPQLYDLTE LQRDCNKIFG YSAKQTLNIM
QRLYENHKLL TYPRTDSRYI SKDIVSTIPE RLKAVATGEF RIIANDLLKN PIKANKSFVD
DNKVSDHHAI IPTEEKGNLA NLSSDERRVY ELVVKRFLSV LMPPFEYIQT TLTGEVNGEK
LIAKGKVVKA KGWKKLYDRE VDDEGEEDIK EQELPKLKEG DEFKVLKVNV KKGETKPPAR
FNEGTLLSAM ENPQKFISVD KSSAKTLGET GGLGTVATRA DIIEKLFNSF VIEKRGKEII
PTSKGKQLID LVPKDLKSPL LTANWEEMLE RISKGKGDSK KFIKDIRNYT VALVEDVKLG
ESKFHHDNLT GKKCPQCGKY MLEVKGKNGT MNVCQDRECG YRENISRITN ARCPECKKKL
ELRGHGEGKI YVCPGVNCNF REKESQFKKR FEKNNKTNKR EVNKIMQKMK KEANEDVNNP
FADLLSGLKF D