Gene CPF_1542 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCPF_1542 
SymbolrecQ 
ID4203020 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium perfringens ATCC 13124 
KingdomBacteria 
Replicon accessionNC_008261 
Strand
Start bp1759672 
End bp1761450 
Gene Length1779 bp 
Protein Length592 aa 
Translation table11 
GC content28% 
IMG OID638082420 
ProductATP-dependent DNA helicase RecQ 
Protein accessionYP_695985 
Protein GI110801424 
COG category[L] Replication, recombination and repair 
COG ID[COG0514] Superfamily II DNA helicase 
TIGRFAM ID[TIGR00614] ATP-dependent DNA helicase, RecQ family
[TIGR01389] ATP-dependent DNA helicase RecQ 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.260757 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGAAGAG AGTTTGAGGT ATTAAATAAT ATTTTTGGAT ATAGGGACTT TAGAAAAGGG 
CAACAGGAGG TAATAGATAA GATTTTAAAT GGAAAGGATG TATTTTGTAT ATTACCTACT
GGAGGAGGAA AATCTTTATG TTATGAAATA CCTGCTTATA TTTTTAAAGG AACAACACTT
GTTATTTCTC CCTTAATATC TCTTATGAAG GATCAAGTTG ATAACTTAAA TTCCTTAGGA
ATTAATGCTG CTTATATAAG TGGAGGAAAT GACTTTGAAG AGGTTAAAAA TATAATTAGG
AAATTCATAA AAGGAGAATT TAAGCTTCTT TATGTATCAC CTGAGAGATT AGAAAATAGA
TATTTTTTAG ATAATATAGA AAAAGCTAAT ATTGCACAAA TTGTAGTTGA TGAAGCTCAC
TGCGTATCAA TGTGGGGCCA TGACTTTAGA AAAAGTTATA GCAAGATAAA ACCATTTATT
TGTAATTTAA AGAAAAGACC AATTATTACA GCCTTTACAG CTACGGCTAC AGAGATAGTT
ATGAAGGATT CCATTGAACT TTTAGGTCTT TATAATCCCT TTATTTATAA GGGTAGCTTT
AGTAGAGATA ATTTGGAGAT AAATGTATTA AAAGAAGTTG ATAAATTAGA AATTATAAGT
GAAATTATAA GTGAACATGA GGAAGAGTCA GGAATAATTT ATTGCTCAAC TAAAAATGAA
GTTGAAGAAC TTTATAAACA TATGCTTTAT AGAGGGAAAA GTGTAGGGAA ATATCATGGA
TCCTTAAAAG ATAAGGAAAA AAATTATTAC CAAGAAGAGT TTTTAAATGA TAATTTTAAG
GTTATGATTG CCACAAATGC TTTTGGTATG GGAATTGATA AGCCTGATGT TAAGTTTATA
ATACATTCAA CATTTCCAAA AAGTATAGAA AATTATTATC AAGAAATAGG CCGTGGAGGA
AGAGATGGAA GCTTAGCTAA GTGTTATCTC CTTTATTCAG AGCAAGATAT AAGGGTTATG
GATTATTTGA TAAGTTCAAC TACAGAAATA TCTAGAAGAA CTATAGAACT TAAAAAACTA
GAAAAAATAA TAGAGTTTTG TAATTATGAT AAGTGTCTTA GAAAATATAT TTTAGATTAT
TTTGGAGAGG AAAATTCCAT AAAGTATTGC AATAATTGTA CTAACTGCTT AAAAAATAGT
GATTTAATAG ATATGACATT AGAGGCACAA AAAATATTAT CATGTATATA TAGAACAAAG
GAAGCCTTTG GAGAAAGTGT TTTAATAGAT ATTTTAAGGG GGATTCATGG GCCAAAGATT
GAAAAATATA AGCTATATGA GCTTTCAACC TTTGGAATAA TGAAAGAATA TACAAGTAAG
TATATAAAGG ACATAATAAA GGAGCTTTTA TCTATTAAGG CTTTAGAGAG AAAAGAAGGA
ACATATTCTA TGCTAAAGTT AAATAGAAAA TCTATAGGAA TATTAAAGGG AGAAGAAAAG
GTACTTTTAG AGGTTAATAA TAATGAAGAA AGCATGTGCA TGGATTTAGA ACTTTTCAAG
AAGCTTCGTA TTTTAAGAAA AGATATATCA AGAAGAGAAG GAGTAAAACC ATACATAGTT
TTTACAGATT CAATGATTAT GGAGATAATA AATAAAAATC CTAAAAGTAA AGAGGATTTA
AAAAATATTA GGGGTTTTGG AGAGCAGAAA ATAACAAAGT ATGGCCCTTT TATTCTTACA
ACTTTAAGAG ATTATGAAAA GTATGGAAAA AGAAATTAG
 
Protein sequence
MRREFEVLNN IFGYRDFRKG QQEVIDKILN GKDVFCILPT GGGKSLCYEI PAYIFKGTTL 
VISPLISLMK DQVDNLNSLG INAAYISGGN DFEEVKNIIR KFIKGEFKLL YVSPERLENR
YFLDNIEKAN IAQIVVDEAH CVSMWGHDFR KSYSKIKPFI CNLKKRPIIT AFTATATEIV
MKDSIELLGL YNPFIYKGSF SRDNLEINVL KEVDKLEIIS EIISEHEEES GIIYCSTKNE
VEELYKHMLY RGKSVGKYHG SLKDKEKNYY QEEFLNDNFK VMIATNAFGM GIDKPDVKFI
IHSTFPKSIE NYYQEIGRGG RDGSLAKCYL LYSEQDIRVM DYLISSTTEI SRRTIELKKL
EKIIEFCNYD KCLRKYILDY FGEENSIKYC NNCTNCLKNS DLIDMTLEAQ KILSCIYRTK
EAFGESVLID ILRGIHGPKI EKYKLYELST FGIMKEYTSK YIKDIIKELL SIKALERKEG
TYSMLKLNRK SIGILKGEEK VLLEVNNNEE SMCMDLELFK KLRILRKDIS RREGVKPYIV
FTDSMIMEII NKNPKSKEDL KNIRGFGEQK ITKYGPFILT TLRDYEKYGK RN