Gene CPR_2475 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCPR_2475 
Symbol 
ID4204478 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium perfringens SM101 
KingdomBacteria 
Replicon accessionNC_008262 
Strand
Start bp2695434 
End bp2697824 
Gene Length2391 bp 
Protein Length796 aa 
Translation table11 
GC content28% 
IMG OID642567025 
Productstage II sprulation protein E, putative 
Protein accessionYP_699729 
Protein GI110802462 
COG category[K] Transcription
[T] Signal transduction mechanisms 
COG ID[COG2208] Serine phosphatase RsbU, regulator of sigma subunit 
TIGRFAM ID[TIGR02865] stage II sporulation protein E 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.461942 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCAATATG GAGTTAAATT TGATAATTAC AAAAGAATAA AGGGTTCAGA AAATGTGAAA 
AATAATATGA AAATAGAAGC TTCAAGTATT GGTCTTATTA TGGCTGTATT TGTAGGTTTT
TTAATAAGCA GAGTGTATTT AGATATGACT TTTGGAGTAG TTCAAGTATT AGCCCCATTT
GGATTAGCAT ATTTAATTGC AATAACGAAT TATGAAAGAA AATATATATT ATCATCAAGC
TTGGGAGTTA TATTAGGGTA TATTACCTTG TTTAATAAGG TTAGCAACTT TTCGGCTTAC
ATAATAATAT CAACTATAGT TACCATAATT TCTATGAGTA AACTAAATAA AAAGGCTAGA
AATATAGCAA GTTTTATAAT GGTATTTATA GGATTATTTA TTTACGGAGT TTTAGCAGGA
AGTTCGGATA TAATTTTACA CTTAATAGGA GCTATTTCAG TAACAGCATT AGTTTTTCCT
ATATATTATA TTGTGAGCTA TACTTTAAAG TGCATAGAGG AAATAAATAC TCAACATTTT
TTCTTGATAG ATGAAATTGT AAGTATAGAA TTATTTATAT GTTTATTAAT AGTTGGAATT
GGAACCATAT CAATAAATGA TATTAGTTTT AGAAATATAA TAGCTATACT TTTTATTATA
GCTTTAGCAT TTATTTCAGA TACTAATATG GGGGCTGGAG CAGGTATAAC TATGGGTATA
ATATTAGGAT TTGCTACAGG AAATCTTATG GAGAGCATAG CCATTTATGG AGCTTGTGGT
TTAGTTGCAG GCATATTTAG AGAGTCAGGA AAGCTTTTTA CAGCCTTATC ATTTAACATA
ATTTTTATAA TAGTAACTTT ATATTCTGGA GTTTTTAATA ATATTTCATT TGTAGAAACT
TTAGTTGGAA CAGTAATTTT TCTTTTAATT CCTAAAAAAA TATACAATAA AATTTCTTTA
GAGATAAACA AGGATAAAAA GGTTGGTCAT TTTAGTGAAG TTAGATTTTC TGAGATTAAA
GATGAGTTAA CTGAAAGATT AAAAGATTTT ACTGAAGTTT TATCTGTAAT GGGTAAATCA
TTAAATAATC TAGTTAGTAA TGATAAATTA GCTATAAAAA ATAAGGGGAA TGCATTGGTT
GAAAATTTAT CAGATAGAAC TTGTAGTGAT TGTGATATGA GATATATGTG TTGGAAGAGA
GAACTTCATC AAACTTATAA TGCTTTTTCA GATTTAATAA GAAATTATGA AAATAATTTG
GGTGATTTTC CTCATGAACT TGAGAAAAAG TGCATAAAGA AATATGCTCT AGTTAAAAAT
TTAGAGGACA TAATGAATAT CTATATGGTA AATGAAACTT TAAAGAGTAG ATTAGGAGAG
GGAAGAAAAA TCTTATCTAA TCATATAAAT AATATGTCAG TTACAATTAG TGAGATAGTT
GACGAGTTTG GAAATGAACT ACATCTATGT ACTGATGTGG AGAAAAGTAT AAAAAAATCT
CTTTTAAAGT ATGGCGTTAA CTTTGGAAGC TTAATATGTT ATAACGATAA AAATGGAAGA
ATTAAAATTA AGATGCAAAT GGAAAATTGT ATGGGATCTC AAACATGCAT AAAAACAGTT
CTTCCTATAA TAAGTGAGAC CATAGGGAAA AATATGAGTA TTGGAAGTGA AGGGTGTAAT
ATAAATAGCA AAAATAATAT GTGTGAGATC GTTGTTGAAG AAGCTCCTAA ATATCATATT
AATTCCCATG TAGCAGTTGC TACTAAAGAG GGAGAAAAAT TCACTGGAGA TTCATATTCA
TATGGTAGAA CAAAGGATGG TAATTATATA ACTGTAATAT CAGATGGTAT GGGATCAGGC
CCTGAAGCAG GACTTGAAAG TAAAGTCTCA GTAGAAATAA TAGAAAAATT TATGGATGTT
GGTTTTGATG AAAAAATAGC TATTGATGCA GTTAATGCAA TTATGAGTAT AAAGTTTAGT
GAAGATGAAA AGTTTTCTAC ATTAGATATG AGTAAGATAG ACTTATATAC TGGAAATGCT
AAGTTTATGA AAGTTGGAGC TATAGAAAGT TTTATAAAGA GAGGAAATAA GGTGGAGGTT
ATAAATTCAA ATACACTTCC TTTTGGTGTC TTAGAAGAAC CAGATGTAGA CACTGTTGAA
AAGCAAGTAA GTAATGGGGA TGTAATAGTT AGTATAAGTG ATGGTATTTT AGATGTTAAA
AATGATGGAA GTTTTGATAC TACATGGTTA ATTGAATTTT TAAAGAATAC TAAGTATAGA
CAGCCTAAGG ATTTATCAAT AGCTATTTTA GAAAAAGCAA AGGAATTAAG TGGAGGAAAG
GCTAAGGATG ATATGACAGT GGTTGTATCT AAGGTTTTTG CAATAAATTA A
 
Protein sequence
MQYGVKFDNY KRIKGSENVK NNMKIEASSI GLIMAVFVGF LISRVYLDMT FGVVQVLAPF 
GLAYLIAITN YERKYILSSS LGVILGYITL FNKVSNFSAY IIISTIVTII SMSKLNKKAR
NIASFIMVFI GLFIYGVLAG SSDIILHLIG AISVTALVFP IYYIVSYTLK CIEEINTQHF
FLIDEIVSIE LFICLLIVGI GTISINDISF RNIIAILFII ALAFISDTNM GAGAGITMGI
ILGFATGNLM ESIAIYGACG LVAGIFRESG KLFTALSFNI IFIIVTLYSG VFNNISFVET
LVGTVIFLLI PKKIYNKISL EINKDKKVGH FSEVRFSEIK DELTERLKDF TEVLSVMGKS
LNNLVSNDKL AIKNKGNALV ENLSDRTCSD CDMRYMCWKR ELHQTYNAFS DLIRNYENNL
GDFPHELEKK CIKKYALVKN LEDIMNIYMV NETLKSRLGE GRKILSNHIN NMSVTISEIV
DEFGNELHLC TDVEKSIKKS LLKYGVNFGS LICYNDKNGR IKIKMQMENC MGSQTCIKTV
LPIISETIGK NMSIGSEGCN INSKNNMCEI VVEEAPKYHI NSHVAVATKE GEKFTGDSYS
YGRTKDGNYI TVISDGMGSG PEAGLESKVS VEIIEKFMDV GFDEKIAIDA VNAIMSIKFS
EDEKFSTLDM SKIDLYTGNA KFMKVGAIES FIKRGNKVEV INSNTLPFGV LEEPDVDTVE
KQVSNGDVIV SISDGILDVK NDGSFDTTWL IEFLKNTKYR QPKDLSIAIL EKAKELSGGK
AKDDMTVVVS KVFAIN