Gene CPF_2604 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCPF_2604 
Symbol 
ID4203956 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium perfringens ATCC 13124 
KingdomBacteria 
Replicon accessionNC_008261 
Strand
Start bp2871743 
End bp2873401 
Gene Length1659 bp 
Protein Length552 aa 
Translation table11 
GC content26% 
IMG OID638083471 
Productresolvase family site-specific recombinase 
Protein accessionYP_696994 
Protein GI110800035 
COG category[L] Replication, recombination and repair 
COG ID[COG1961] Site-specific recombinases, DNA invertase Pin homologs 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0107725 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATTCAC TCCCTAAAAA TATAGCTATA TATACTAGAG TATCTACTGA AAAGCAAGAT 
AGCAATAACT CTAAAGCTAA TCAACTTGAA AGTATAAAAC AGTATATAAA CAATCAAGGA
TGGAGTGATG TTCCCTATGA AATATATTCT GATACTCAAT CTGCTTCAAT AACATCTAAT
TCTCTTAATA TGTATAATGA TTCAGATACA ATGAATCCTA GCATATTTTT AAGAGATGGA
TTAAGACGAT TAATTTATGA TGCTAACTAT AAAAAATTTG ATAAATTAAT AGTTTATTCT
CATGATAGGT TATCTCGTGA TATATATGAG GGCTTATTAA TTAAACATAC TTTAAAGAAA
CTAAACATCG ATATATTATA TTCTAAAGCT GGTGAACAGA TAAATTCTGA AAACCAATCT
ATTAATAACT TTTTTGAAAA TATGCTAAGT AATATAGCAG CTCTTGAATC TAGTATTATT
GGTGGAAGGG TCCTTATTGG AAATAGACAT AATATATTAA ATAACATATG GGCTGGAGGT
CCAGCTCCCT ATGGTTATAA ATTAATTGCT CTGCCATCTA ACCGTAAAAA AAGTAAGCTT
TCAATATATG CTCCCGAAGC TAGAGTCGTT AAGAAAATTT TTGAATTATA TACAACAGGG
TACACCCCTA AAGATATAGT TCAATTTATT AAATCAGAAT ATAGCTACAA TAAAGATAGA
CTTTGGACAA TAAATAGTAT TAAAAGTATA TTAAATAATC CAGTATATAC AGGAACTATG
GTTTGGAATA AAAAAGGTGG TAAAAGAAAT CCAAGAAAAC ATCCAATTGA TGAATATGTT
TATTCTAAGT TTGATGAAAA CATAAGAATA ATAGATGAAA AGTTGTGGAA AAAATCTTTA
AACATACGAA AGCTTCAAGA TGAGAACCCT AAATTTTTAT CAACAGCCTT TTTACTTCAA
GGAATGCTTG TGTGTAAAAG CTGTGGAAAT TATTTAACTA GTAAAAATCA TGGAAATTCA
TCTGGTAGAG TCTACTTTTG CTCTCATAAT AATAAAGATA AATTTTCTAA AAAAATAAAT
GCTCCAACAA TTACAATGAA AGCATCACTA ATTCACAATA TTGTTTTTAA AGAACTACTA
AATCTTTATA AATCAATATT AAGTATTCCT GCATTTTTTG ATGAGTTTTA TTCTAACTAT
TTAAATAAAT TAAAAGAAAA AAATAACAAC CTATTAGTTC AGAAAGATGA AATTGAAAGT
AATATAAACC ATTGTGATGA TATATTAATG AAATGCTCAT CTGAAATTTC AAGATTGCTA
GAACTAACTA ATGTTATGAG TAAAGATATT GCTGATATTG ACTATCATGA AAAAAATTTA
TTGTTACTTG ATTCAATTAA AGAATTTAAA ACTAGCCTTA TTTTATCTAA ATCTCAATTA
AACGATGATT TAAATAAAAT AAATAGAAAG CTTTCAAAAT CTATACCTAG TAAAGCTTGT
TTTAAAGACT ATGTAGTAAA TACTTTAAAT CCTATTGAAG ATATACTTAG CCAAGAAAAT
TCAAAGATTA AAAACAGATG TTTAAGGCTA TTGCTACATG AAATTATTGA TAAAATAATA
ATTTCTGAAA CATTTGAAAT AGAAATAATA TTTAAATAG
 
Protein sequence
MNSLPKNIAI YTRVSTEKQD SNNSKANQLE SIKQYINNQG WSDVPYEIYS DTQSASITSN 
SLNMYNDSDT MNPSIFLRDG LRRLIYDANY KKFDKLIVYS HDRLSRDIYE GLLIKHTLKK
LNIDILYSKA GEQINSENQS INNFFENMLS NIAALESSII GGRVLIGNRH NILNNIWAGG
PAPYGYKLIA LPSNRKKSKL SIYAPEARVV KKIFELYTTG YTPKDIVQFI KSEYSYNKDR
LWTINSIKSI LNNPVYTGTM VWNKKGGKRN PRKHPIDEYV YSKFDENIRI IDEKLWKKSL
NIRKLQDENP KFLSTAFLLQ GMLVCKSCGN YLTSKNHGNS SGRVYFCSHN NKDKFSKKIN
APTITMKASL IHNIVFKELL NLYKSILSIP AFFDEFYSNY LNKLKEKNNN LLVQKDEIES
NINHCDDILM KCSSEISRLL ELTNVMSKDI ADIDYHEKNL LLLDSIKEFK TSLILSKSQL
NDDLNKINRK LSKSIPSKAC FKDYVVNTLN PIEDILSQEN SKIKNRCLRL LLHEIIDKII
ISETFEIEII FK