Gene CPF_1148 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCPF_1148 
Symbol 
ID4203780 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium perfringens ATCC 13124 
KingdomBacteria 
Replicon accessionNC_008261 
Strand
Start bp1312094 
End bp1315303 
Gene Length3210 bp 
Protein Length1069 aa 
Translation table11 
GC content28% 
IMG OID638082029 
Productputative helicase 
Protein accessionYP_695594 
Protein GI110800073 
COG category[K] Transcription
[L] Replication, recombination and repair 
COG ID[COG0553] Superfamily II DNA/RNA helicases, SNF2 family 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0135217 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCTTTGA ACTTAGATAT TTTAATGAAG AACTTAGTAA GGCGAACTTC CAGTTTTACT 
AGAGAACAAG GTAAAAAGTT AATTAAGGAA GCATATGTGA AAGATGTTAG GGGCAAGAGC
ATAGATGGAA TTTATCATAT TTATGGGAGT GTATTAAATG ATGATAAAAA CTGGGATTAT
AATACACACA TAAAAATAAA TATGAAAAAT AGTGATATTG TTGGTACTAA TTGTAGTTGT
GAAACCTTTA AAGAGAATAG TAAACATATA AAAAATTATG TGTGTAAACA CATTTCAGCA
ACTAATGATG TTTTTTATTC CTTAGCAAAG AAGAAGATGC AAAATAATAA GTTAAAAAGT
AATAATAAAA CTCAATTAGT TAAAGAGAAA AATGAGGAAC ATAAAGGGCA AGAAAAGAGA
TTTTTATCTT TAGATATAAA TATTAAGCAC ATGGTAAAAG AGGGGATTAC TTTATTTAAC
TGTGAGTTTA GAATTGGTGC AGGAAATTTA AATTTAATAC TGGATTTAAA AGATTTCTTA
TATAAAAATA GTTTAAAAAA GCCTTTAAAA TTTAATGATG GATTTACTTA TAATCCATTA
AAAGATGAGT TTTTAGAAGA AGATAAAAGA GTTCTTCAAT TTGTAGCAAG CCATAAGGAC
ATGATAAGTG GAAGATATTT AAGGATTAAA CAAAATAATT TAAAGGATTT TCTAAAATTA
GTTGATGAAA AGAAAAAAAT AAATTTTAAT TTTAACTCAA TTAATTATGA GGTTAAGGTG
AAAAAGGAAA ATGTTCCAGT AGCCCTGACT TTAAAGGAAG GCAAAGAGGG ATTTGTATTA
AGTCACCATA AGAAATTCCC TGTTATTTTA AATAACTTAG GAGATGTTAT GTTTTTTGAT
AGAAACTTAT ATCTTCCAAG GAAGAGACAG TTAGAATATT ACAGTCCAAT TCATAAGTTG
TTTCTAAAAA ATAATACAAT CACATATAAG AAGAGCTTAG AGAGTCTAAG AAACCTATTA
GAAGAACTGA AGAACATAAG CAAGAATATA GTTTTAGATG AAAATATTAG AGTTTTTAAA
GAAAAGCTTA TGAAAACAAC CTTTAATTTA TATAAAAATA AGGGAAAAAT TTATTGTAAT
GTAAAAATTG ATTATTGTGG ATACATAATT GATTTAATAA GAGATGAAAA AGATAATAGC
TTTTTAAGAG ATTTAAAAAG TGAGAAATAT ATAGAATTTC AACTTGAAAG ATTTAAGTTC
ATAAAGAGAG AAGAAGATTT TTGTTTTATA GGAAATGAAG AGGAAATATA TGAGCTTTTC
TCTAAGGGAA TTAAGAGGCT TAGAGAACTT GGAGAAGTTT TATTATCAGA GGAACTTAAG
GAGTTTAAGG TTTTAGATTC ATCCCTTATT TCATCAGAAC TTAAAGAGTT TAGTAATTTC
TATAAGCTTA AGTTTGACTT TGGAGATTTT GAATTAAGAG AACTTAGAGA AAGCATAGAG
GCTATGAAAA GGGGAGACCG CTTTTATAGA ACTAAGAAGG TTTATTTAGA TTTAGAGGAT
CCTGGTATAG TAAACTTTTT AAATCTTCTA GAGGATTTAG GATTAGAAAA TATAAAAGAT
AATGAAGTCT ATATAGATAA GAGTAAGGTT TTATATATCC AAGAAAAATT AAAGGATAGA
AATCTCTCTT TTATAAAAGG GGGAAATGTT CTTCAAGAAA TAGTAGGAAA GCTTCTAAAT
AAAGAATTTA AGAGAAAGCT TCTTCCAAAG GCTTTAAATG CAGAGCTTAG ACCTTATCAA
AAAGAGGGGT TTAAATGGAT TAATGAAATT ACTGATTTAG GTTTTGGTGG AGTTTTAGCA
GATGATATGG GACTTGGTAA AACCCTTCAA ATAATAGCTT TTTTATTATC TCAGAAAAAA
AGCAAAAGCA TAGTAGTTGT TCCAACCTCT GTTATATATA ACTGGATGGA TGAGTTTGAA
AAGTTTGCTC CAAGTATAAG GGTTGGATTA GTTCACGGTA GTAAATCTAA GAGGGATAAG
GTTTTAAGAG ATTTTAAAAG AGGACTTGGT ATTAAGATAG AAGAGGAGAA TTTAAAAGAA
AAATCCTATG AAAAATATGA TGTTTTATTA ACTACCTATG GAACCTTAAA GAATGATGAA
AAAGCTTATG AAAACTTAAG TTTTGATTAT TGCATAATAG ATGAGGCTCA GAACATAAAG
AATCCATCAG CACAAGCAAC TTTATCTGTT AAAAATATTA AATCAAGATG CAATATTGCC
TTAACAGGAA CTCCTATAGA GAATAATTTA ATGGAGCTTT GGTCAATATT TGATTTTGTT
ATGCCTGGAT ATTTATTTAC TAAGGAGAGA TTTAGAGAAA GATTTATTTT AGATGAAAGC
AATTTAAGTG AATTAAAATC TTTAATAACT CCTTTTATTT TAAGAAGACT TAAGGAAGAT
GTTTTAAGTG AACTTCCAGA AAAACTTGAG AAAAAATACT TAGTAGAAAT GAAAGGAAAA
CAAAAACAGT TATATAGTTT CTATGTAAAG GCAATAAAGA ATCAATTAAA TGAAAATAAA
AGTTCTGAAA AGAGTGGAAG AGATAAAATT AATCTATTTG CCTATTTGAC AAAATTAAGA
GAAATTTGTT TAGATCCTTC ACTAGTTGTA CCAGATTATA CTGGAGAAAG TAGTAAGCTT
ACTGTGGTAA AAGAAATTGT AAAAGATGCT AGTGAATCAG GAAAGAAGAT TTTACTATTT
TCTCAATTTA CTTCAGTACT ACAAAAAATA GAAGAGGACT TTAAAAAAGA GGATATTTCT
TATTTATACT TAGATGGAGG AACTTCTGCT AAGGATAGAG TAGAGAGAGT TAAAAAATTC
AATGAGGATA GTAATATTAA AGTGTTTTTA ATATCTCTAA AGGCTGGGGG AGTTGGACTA
AATTTAACCT CTGCCAGTGT GGTTATACAT TTTGATCCTT GGTGGAATCC AGCCGTAGAA
GATCAAGCTA CAGATAGAGC ACATAGATTT GGACAAGAAA ATAAAGTTGA AGTTATAAAG
TTAGTTGCAA AGGATACTAT TGAGGAAAAA ATAGTATTAA TGCAGGAGGA TAAAAGAGAA
CTTATTCAAA GTTTAATGGA TGGAAAAACT ATGGATGGTA AGGGATTAAA ACGCCTTACG
GAAGAAGAAA TTAGTAAATT ATTTGAGTAA
 
Protein sequence
MPLNLDILMK NLVRRTSSFT REQGKKLIKE AYVKDVRGKS IDGIYHIYGS VLNDDKNWDY 
NTHIKINMKN SDIVGTNCSC ETFKENSKHI KNYVCKHISA TNDVFYSLAK KKMQNNKLKS
NNKTQLVKEK NEEHKGQEKR FLSLDINIKH MVKEGITLFN CEFRIGAGNL NLILDLKDFL
YKNSLKKPLK FNDGFTYNPL KDEFLEEDKR VLQFVASHKD MISGRYLRIK QNNLKDFLKL
VDEKKKINFN FNSINYEVKV KKENVPVALT LKEGKEGFVL SHHKKFPVIL NNLGDVMFFD
RNLYLPRKRQ LEYYSPIHKL FLKNNTITYK KSLESLRNLL EELKNISKNI VLDENIRVFK
EKLMKTTFNL YKNKGKIYCN VKIDYCGYII DLIRDEKDNS FLRDLKSEKY IEFQLERFKF
IKREEDFCFI GNEEEIYELF SKGIKRLREL GEVLLSEELK EFKVLDSSLI SSELKEFSNF
YKLKFDFGDF ELRELRESIE AMKRGDRFYR TKKVYLDLED PGIVNFLNLL EDLGLENIKD
NEVYIDKSKV LYIQEKLKDR NLSFIKGGNV LQEIVGKLLN KEFKRKLLPK ALNAELRPYQ
KEGFKWINEI TDLGFGGVLA DDMGLGKTLQ IIAFLLSQKK SKSIVVVPTS VIYNWMDEFE
KFAPSIRVGL VHGSKSKRDK VLRDFKRGLG IKIEEENLKE KSYEKYDVLL TTYGTLKNDE
KAYENLSFDY CIIDEAQNIK NPSAQATLSV KNIKSRCNIA LTGTPIENNL MELWSIFDFV
MPGYLFTKER FRERFILDES NLSELKSLIT PFILRRLKED VLSELPEKLE KKYLVEMKGK
QKQLYSFYVK AIKNQLNENK SSEKSGRDKI NLFAYLTKLR EICLDPSLVV PDYTGESSKL
TVVKEIVKDA SESGKKILLF SQFTSVLQKI EEDFKKEDIS YLYLDGGTSA KDRVERVKKF
NEDSNIKVFL ISLKAGGVGL NLTSASVVIH FDPWWNPAVE DQATDRAHRF GQENKVEVIK
LVAKDTIEEK IVLMQEDKRE LIQSLMDGKT MDGKGLKRLT EEEISKLFE