Gene CPF_2598 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCPF_2598 
SymbolhsdS 
ID4201421 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium perfringens ATCC 13124 
KingdomBacteria 
Replicon accessionNC_008261 
Strand
Start bp2865161 
End bp2866414 
Gene Length1254 bp 
Protein Length417 aa 
Translation table11 
GC content26% 
IMG OID638083465 
Producttype I restriction-modification enzyme, S subunit 
Protein accessionYP_696988 
Protein GI110800744 
COG category[V] Defense mechanisms 
COG ID[COG0732] Restriction endonuclease S subunits 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0133376 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGAAAAAAG AAGTAAGAGA AGGATATAAG ATGACTGAAT TAGGTGAGAT ACCCAATGAA 
TGGGAGGTTT GTCGCATAGA TGACTTATGC AAAGTGAATT CTAAATCATT AAATTCTAAG
ACAGAACCAA ATCTAGTTGT TAATTATATA GATATAGAAA GTGTATCAAC TGGTAAAATA
AATAATATTA AACAAATGAT ATTTAGTCAA GCACCTAGTA GAGCGAGAAG AGTTGTGAAG
AAAAATGATG TTATAATGTC AACAGTAAGA CCATATTTAA AAGCTTTTGT AAAGGTTAAG
AGTAGTTTGA ATAATTTAGT GTGTTCAACT GGATTTGCTG TGTTAGAAGT GAATGAAGGC
GTTAATTCGG AATTTGTATA TCAATCAATA TTAAGTAATT ATTTCATAGA ACAAATAAAA
AATAAAATGG TAGGTTCAAA TTATCCAGCG GTAAATTCAG ATGATGTTAA AGAAAGTAAG
TTGATATTAC CAAGTATACA AGAGCAAGAA AAAATTGCTG AAATACTTTC AACTGTAGAT
GAACAAATAG AAAATACAGA AAAATTAATA CAAAAAAATC AAGAACTTAA AAAAGGATTA
ATGCAACAAT TATTGACAAA AGGAATAGGA CATACAGAAT TTAAGAAAAC AGAATTAGGT
TATATACCAA AGGAATGGAA AATCATGAAG TTAGGAGAAG TATGTGATTT TAAACAAGGA
TTCCAAATTC CAAGAAGTGA GCAAATTAAT GAAGAAAAAG ATGGATATAT AAGGTATCTT
TATATAACTG ATTTCTTTTC AAATAATAAC AAGTTATTTA TAAAAGGTTC TGATAAATAT
TATTATATAA AATCAGATGA TATTACAATA GCTAACACAG GAAATACTTG TGGTAAAGCA
TTTAAAGGAG CTGAAGGGAT TTTAAGTAAT AATATGTTTA AGATATTTAA TAATAAAGAA
GTTTTGTTGA ATGATTTTCT ATGGCAATAT TTAAATAGTA ATTACTATTG GAAAGAGCTT
AATAAATATT TTAATACTGC TGGTCAACCA CATGTAGGGC ATAAAAATAT GGCAAATTTA
ATGATTGCTA TTCCTGAATC GTTAAATGAA CAAAGTGAAA TAGCTTTAAT ATTATCATCA
ATAGATAAAA GAATAGAAAA ATATGAAAAC AAAAAAGAAA AATTAAAAGA ATTAAAAAAA
GGATTAATGC AGCAATTATT AACAGGATAT ATAAGGCTTA TTTGGAATGA TTAA
 
Protein sequence
MKKEVREGYK MTELGEIPNE WEVCRIDDLC KVNSKSLNSK TEPNLVVNYI DIESVSTGKI 
NNIKQMIFSQ APSRARRVVK KNDVIMSTVR PYLKAFVKVK SSLNNLVCST GFAVLEVNEG
VNSEFVYQSI LSNYFIEQIK NKMVGSNYPA VNSDDVKESK LILPSIQEQE KIAEILSTVD
EQIENTEKLI QKNQELKKGL MQQLLTKGIG HTEFKKTELG YIPKEWKIMK LGEVCDFKQG
FQIPRSEQIN EEKDGYIRYL YITDFFSNNN KLFIKGSDKY YYIKSDDITI ANTGNTCGKA
FKGAEGILSN NMFKIFNNKE VLLNDFLWQY LNSNYYWKEL NKYFNTAGQP HVGHKNMANL
MIAIPESLNE QSEIALILSS IDKRIEKYEN KKEKLKELKK GLMQQLLTGY IRLIWND