Gene CPF_2359 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCPF_2359 
Symbol 
ID4203788 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium perfringens ATCC 13124 
KingdomBacteria 
Replicon accessionNC_008261 
Strand
Start bp2623715 
End bp2625472 
Gene Length1758 bp 
Protein Length585 aa 
Translation table11 
GC content28% 
IMG OID638083224 
Producthelicase domain-containing protein 
Protein accessionYP_696782 
Protein GI110799948 
COG category[J] Translation, ribosomal structure and biogenesis
[K] Transcription
[L] Replication, recombination and repair 
COG ID[COG0513] Superfamily II DNA and RNA helicases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAAAAA ATGCGGCTCA AAGGGAGTAT AAAAAATTAT CAGGTCAAAT AAATCAAATA 
GAAAACATAG TAAGACATTC AAAAGCAGGA GCACTTTTTG AACATGAATC AACCTTAAGA
AAAAAAATAT TGCAATTAAA AGAGATGAAA GATCAAGGGT TAAAAGGATA TGAATCTCTT
TATGATAGAT ATGAAGAATT ATTAGAAGAG GTAGGAAAGA GAATTCTAGA AAATTATAAT
AAGAAAAATG ATACTAATTT TGATTTTTAT GAAGTTTTAA GAAACAATTA CAATGTATTT
CTTAATTCAG GAATTATGAC TCTTCTTGTA AAACATCATA TTCCAGAGTT AATATCTAAG
GAATTTGATG AGAAGTTTCC AGCAAATCCA AAGGATGAAT ATCTTCATAC AAGAAGGCTT
AAAAGAAAAT TCTATCTTCA CTTAGGAGAA ACGAATACTG GAAAAACCTA TACTGCTATG
CAGAGACTTA AAGAGGTTAG AAAAGGTGTA TATTTATCTC CTCTTAGAAT TTTAGCTTTA
GAGAACTTTG AAAGATTGAA TAATGAGGGT GTTAAATGTA ATCTTCTTAC TGGAGAAGAA
GAAATATTAT TTGAAGATGC AACACATGTA TCATGCACCA TAGAAAAAGC TAATATACAT
GAAAGATATG ATGTGGCAGT TATAGATGAA ATTCAAATGA TAGATGATTC ACAAAGAGGG
TATGCTTGGA CTAGAGCTTT ACTTGGTTTA TATTGTACTG AAATACATAT ATGTGGAGCC
TTTAATGCTA AGAATATATT AAAAGAAATT ATAGAAGATT GTGGAGATGA CTATGAGATT
ATAGAATATC ATAGGGATAT TCCACTTATT GTAGAAGATG AAAGTTTTCA TCCTAAAAAT
GTTCAAGAGG GAGATGCCTT AGTTTTATTT TCAAAGAAAA AAGTTCTTCA AATGGCTGAG
CAATATTCAC AGATGGGAAT TAAATGCTCC ATAATCTATG GAGATTTACC ACCAGAGGTT
AGAAAGAAGC AGTATGAAGA ATTTATAACT GGAAAAAATA AAATTTTAAT AACAACTGAT
GCCATTGGAA TGGGAGTCAA TCTTCCTATA AAGAGAATTA TTTTCTTATC AATAAGTAAA
TTTGATGGAG AACAAATGAG AGAGTTAACT TCGCAGGAGG TTAAACAAAT TGCAGGAAGA
GCAGGAAGAA AAGGCATATA TGATACTGGA TATGTAGCTA CTTATAGAGA TAATAAAGAA
TTTATTGAAG AGAGATTAGA GGAAGAGGAT ATTAGTATAA AAAGAGCAGT TTTAGGACCT
TCAGATGCAA TATTAGAAAT TGATAATCTT CCTTTAAATG AAAAATTAGC TTTATGGAGT
ACAAAGAAAT GTGAAGTTCC ATACTATAGA AAAATGGATA TAAGTGAATA TTTAATAATA
TTAGAAAGAT TAAAATCATA TAAACTTCTT GAAGAAATTC AATGGGAGCT TTTAAAAATT
CCTTTTGATA TATCTAAAGA TGACTTAATG AATCAATTTT TAAATTTTGT TGATCAGCTA
TTTATAAATG ATCAAGAAGA ACTATTTAAA CCTCAATGTT ATTCAGGAAC TTTATATGAC
TTAGAAACTT ATTATCAAAT GGTAAATATG TATTATTCTT TTAGCAAGAG ATTTAATTTA
AATTTTGACT TAGAGTGGAT TGAAAATGAA AGGCTTACTG TAAGTGAAGA AATAAACAAT
ATTCTTATGA GAATTTAA
 
Protein sequence
MKKNAAQREY KKLSGQINQI ENIVRHSKAG ALFEHESTLR KKILQLKEMK DQGLKGYESL 
YDRYEELLEE VGKRILENYN KKNDTNFDFY EVLRNNYNVF LNSGIMTLLV KHHIPELISK
EFDEKFPANP KDEYLHTRRL KRKFYLHLGE TNTGKTYTAM QRLKEVRKGV YLSPLRILAL
ENFERLNNEG VKCNLLTGEE EILFEDATHV SCTIEKANIH ERYDVAVIDE IQMIDDSQRG
YAWTRALLGL YCTEIHICGA FNAKNILKEI IEDCGDDYEI IEYHRDIPLI VEDESFHPKN
VQEGDALVLF SKKKVLQMAE QYSQMGIKCS IIYGDLPPEV RKKQYEEFIT GKNKILITTD
AIGMGVNLPI KRIIFLSISK FDGEQMRELT SQEVKQIAGR AGRKGIYDTG YVATYRDNKE
FIEERLEEED ISIKRAVLGP SDAILEIDNL PLNEKLALWS TKKCEVPYYR KMDISEYLII
LERLKSYKLL EEIQWELLKI PFDISKDDLM NQFLNFVDQL FINDQEELFK PQCYSGTLYD
LETYYQMVNM YYSFSKRFNL NFDLEWIENE RLTVSEEINN ILMRI