Gene CPF_2494 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCPF_2494 
Symbol 
ID4203306 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium perfringens ATCC 13124 
KingdomBacteria 
Replicon accessionNC_008261 
Strand
Start bp2765439 
End bp2766818 
Gene Length1380 bp 
Protein Length459 aa 
Translation table11 
GC content31% 
IMG OID638083359 
Productserine protease 
Protein accessionYP_696908 
Protein GI110801229 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGTGATT TCAATAAAAA AGATGAAAAC TTAAATGACT ACTTTGGTTT TGATGATAAA 
GAAAATAAAA CTTTAGATTC AGATAAGGAC ATAAAAGACA AAGAAAATAT AGAGTCAAAT
AATGATACGA AGCAAACTAA CATAGATGAA ACTCAGCAAT TAAATATAGA TAATGAAATT
AATTCAAAAG ATGAAGTTAA AAAAGAGGAT GATAAAAACT TCTCTGACGT AAAATCAAAA
AGCTCAAAAG AATCTAATGA TAATAGTAAA AACAAAAAAG TAAAGAAAAA GAGTGGATTT
AAAAGAGGAA TAGCTCTAGT AGCAGGTGCT GTCATAGTTG CTATAATAGG AGGAGCTATA
GGTGCTGGCG GAGTTTATTA TGCTTTTAAA AATAGCATAC CAGTAAGTAC ACTAGAGAAT
AATAGTAATA CCTCAGTTAA TCCACCAGCC TTTAAAGGGG AAGATGGAGC ATTAACTGTT
CCACAAGTAG TTGAAAAAGT TACACCTGCC GTTGTAGGAG TATCCACAAA GAGCTTAGTA
AGAGACCAAT TCTTTAATGT AAAAGAACAA GAAGGATTAG GATCTGGATT TATAATAAAT
GAAGATGGAT ATGTAGTTAC AAATTACCAT GTTATAAATG GAGCTCAAGA AGTTAAAGTA
ATATTCTCTG ATGGAAAAGA AGTAAATGCT AAGGTTGTAA ATTACGATGC AGAAAGAGAT
ATCGCAGTAA TAAAAATAAC AGACGATGTT AAAATGCCTG GAATAGCACA ATTAGGAGAT
TCATCTACAG TTAAAGCTGG TGAAGAAGTA ATTGCTATAG GAAATCCTCT AGGAAAAGAA
TTTAGTTCAA CAGTAACTAA GGGTATAGTA AGTTCACCAA ATAGAAAGAT GAAGACTGAA
AACGGAAATG TATTAGATTA TATACAAACA GATGCAGCTA TTAACCCTGG TAATAGTGGT
GGTCCATTAA TAAACTCTAA GGGAGAAGTT ATTGGAATAA ACACTGCTAA AAAAGTTGGT
GAAGATATTG AAGGTATAGG ATTTGCAATT CCTATAAATG AAGTTAAAAC AAGATTAGGT
TCATTATCAA AGCCAATATT AAAACTTGGT ATTACAGCTA GAACTGTTAC TCCAGAATTA
GCAAAAGAAA ATAAGCTAGA AGAAGGAGTT TATGTTGTAG GAGTACAAGA GTTTAGTCCA
GCAGAAAAAG CAGGATTAAA AATAGGTGAC TTAATAGTTG AATTTGGTGG AAAAAGAGTA
AAAACTTTAG AAGAATTAAA TCAAGTTAAA AGTCAATATA ATGATGGAGA TTCAGTACCA
GTTGAAATAA TTAGAGATGG TAAGAAAGTA AACTTAAATT TAACATTAGT TGCTAATTAG
 
Protein sequence
MSDFNKKDEN LNDYFGFDDK ENKTLDSDKD IKDKENIESN NDTKQTNIDE TQQLNIDNEI 
NSKDEVKKED DKNFSDVKSK SSKESNDNSK NKKVKKKSGF KRGIALVAGA VIVAIIGGAI
GAGGVYYAFK NSIPVSTLEN NSNTSVNPPA FKGEDGALTV PQVVEKVTPA VVGVSTKSLV
RDQFFNVKEQ EGLGSGFIIN EDGYVVTNYH VINGAQEVKV IFSDGKEVNA KVVNYDAERD
IAVIKITDDV KMPGIAQLGD SSTVKAGEEV IAIGNPLGKE FSSTVTKGIV SSPNRKMKTE
NGNVLDYIQT DAAINPGNSG GPLINSKGEV IGINTAKKVG EDIEGIGFAI PINEVKTRLG
SLSKPILKLG ITARTVTPEL AKENKLEEGV YVVGVQEFSP AEKAGLKIGD LIVEFGGKRV
KTLEELNQVK SQYNDGDSVP VEIIRDGKKV NLNLTLVAN