Gene CPF_2789 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCPF_2789 
SymbolspoIIE 
ID4203810 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium perfringens ATCC 13124 
KingdomBacteria 
Replicon accessionNC_008261 
Strand
Start bp3053682 
End bp3056072 
Gene Length2391 bp 
Protein Length796 aa 
Translation table11 
GC content28% 
IMG OID638083657 
Productstage II sporulation protein E 
Protein accessionYP_697161 
Protein GI110800430 
COG category[K] Transcription
[T] Signal transduction mechanisms 
COG ID[COG2208] Serine phosphatase RsbU, regulator of sigma subunit 
TIGRFAM ID[TIGR02865] stage II sporulation protein E 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.113996 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCAATATG GAGTTAAAAT TGATAATTAC AAAAGAGCAA AGGATTCAGA AAATGTAAAA 
AATAATATGA AAATAGAAGC TTCTAGCATT GGGCTTATTA TGGCTATGGT TGCAGGTCTT
TTAATAAGTA GAGTGTATTT AGATATGACT TTTGGAGTAG TTCAAGTATT AGCTCCATTT
GGATTAGCAT ATTTAATTGC AATAACAAAT TATGAAAGAA AATATATATT ATCATCAAGT
TTAGGAGTTA TATTAGGGTA TATTACCTTG TTTAATAAGG TTAGCAACTT TTCGGCTTAC
ATAATAATAT CAACTATAGT TACCATAATT TCTATGAGTA AACTAAATAA AAAGGCTAGA
AATATAGCAA GTTTTATTAT TGTATTTTCA GGATTGTTTG TTTATGGAGT TTTAGTAGGA
AGTTCAGATA TAATTTTACA CTTAATAGGT GCGATTTCAG TGACAGCCTT AGTTTTTCCT
ATATATTATG TTGTGAGCTA TACATTAAAG TGCATAGAAG AAATAAATAC TCAACATTTT
TTCTCAATAG ATGAAATTGT AAGTATAGAA CTATTTATAT GTTTATTAAT AGTTGGAATT
GGAACTCTAT CACTAAATGA TATTAGTTTT AGAAATATAG CAGCTATACT TTTTATTATA
GTTTTAGCCT TTATTTCAGA TACAAATATG GGTGCTGGAG CAGGTATAAC TATGGGCATA
ATATTAGGTT TTGCCACAGG GAATCTTATG GAGAGCATAG CTATTTATGG AGCTTGTGGC
TTAGTTGCAG GGATATTTAG AGAGTCAGGA AAGCTTTTTA CAGCCTTATC ATTTAACATA
ATTTTTATAA TAGTAACTTT ATATTCTGGA GTTTTTAATA ATATTTCATT TATAGAAGCC
TTAGTTGGAA CAGGAATATT TCTTTTAATT CCTAAAAAAA TATACAATAA AATTTCTTTA
GAGATAAACA AGGATAAAAA GGTTGGTCAT TTTAGTGAAG TTAGATTTGC TGAGATTAAG
GATGAGTTAA CTGAAAGATT AAAAGATTTT ACTGAAGTTT TATCTATAAT GGGTAAGTCA
CTAAATAATC TAGTTGGTAA TGATAAATTA GCTATAAAAA ATAAGGGTAA TGCATTGGTT
GAAAATTTAT CAGATAGAAC TTGTAGTGAC TGTGATATGA GATATATGTG TTGGAAGAGA
GAGCTTCATC AAACTTATAA TGCTTTTTCA GATTTAATAA GAAATTATGA AAATAATTCA
GGTGCTTTTC CTCATGAGCT TGAGAAAAAG TGCATAAAGA AATATGCTCT AGTTAAAAAT
TTAGAGGACA TAATGAATAT CTATATGGTA AATGAAACCT TAAAGAGTAG ATTAGGAGAG
GGAAGAAAAA TCTTATCTAA TCATATAAAC AATATGTCAG TTACAATTAG TGAAATAGTT
GACGAGTTTG GAAATGAACT GCATCTATGT ACTGATGTAG AGAAAAGCAT AAAAAAATCT
CTTTTAAAGT ATGGCATTAA TTTTGGAAGC TTAATATGTT ATAACGATAA AAATGGAAGA
ATTAAAATTA AGATGCAAAT GGAAAATTGT ATGGGATCTC AAACATGTAT AAAAACAGTT
CTTCCTATAA TAAGCGAGAC CATAGGAAAA AATATGAGTA TTGGAAGTGA AGGGTGTAAT
ATAAATAGCA AAAATAATAT GTGTGAGATT GTTATTGAAG AAGCGCCTAA ATATCATATT
AATTCCCATG TAGCAGTTGC TACTAAAGAG GGAGAAAAAT TCACTGGAGA TTCATATTCA
TATGGTAGAA CAAAGGATGG TAATTATATA ACTGTAATAT CAGATGGCAT GGGATCAGGA
CCTGAAGCAG GACTTGAAAG TAAAGTTTCA GTAGAAATAA TAGAAAAATT TATGGAAGTT
GGTTTTGATG AAAAAATAGC TATTGATGCA GTTAATGCAA TTATGAGTAT AAAGTTTAGT
GAAGATGAAA AGTTTTCTAC ATTAGATATG AATAAGATAG ACTTATATAC TGGAAATGCT
AAGTTTATGA AAGTTGGAGC TATAGAAAGC TTTATAAAGA GAGGAAATAA AGTAGAAGTT
ATAAATTCAA ATACACTTCC CTTTGGTGTC TTAGAAGAAC CAGATGTAGA TACTGTTGAA
AAGCAAGTAA GTAATGGGGA TGTAATAGTT AGTATAAGTG ATGGTATTTT AGATGTTAAA
AATGATGGAA GTTTTGATAC TACATGGTTA ATTGAATTTT TAAAGAATAC TAAGTATAGA
CAACCTAAGG ATTTATCAAT AGCTATTTTA GAAAAAGCAA AGGAATTAAG TGGAGGAAAG
GCTAAGGATG ATATGACAGT AGTTGTATCT AAGGTTTTTG CAATAAATTA A
 
Protein sequence
MQYGVKIDNY KRAKDSENVK NNMKIEASSI GLIMAMVAGL LISRVYLDMT FGVVQVLAPF 
GLAYLIAITN YERKYILSSS LGVILGYITL FNKVSNFSAY IIISTIVTII SMSKLNKKAR
NIASFIIVFS GLFVYGVLVG SSDIILHLIG AISVTALVFP IYYVVSYTLK CIEEINTQHF
FSIDEIVSIE LFICLLIVGI GTLSLNDISF RNIAAILFII VLAFISDTNM GAGAGITMGI
ILGFATGNLM ESIAIYGACG LVAGIFRESG KLFTALSFNI IFIIVTLYSG VFNNISFIEA
LVGTGIFLLI PKKIYNKISL EINKDKKVGH FSEVRFAEIK DELTERLKDF TEVLSIMGKS
LNNLVGNDKL AIKNKGNALV ENLSDRTCSD CDMRYMCWKR ELHQTYNAFS DLIRNYENNS
GAFPHELEKK CIKKYALVKN LEDIMNIYMV NETLKSRLGE GRKILSNHIN NMSVTISEIV
DEFGNELHLC TDVEKSIKKS LLKYGINFGS LICYNDKNGR IKIKMQMENC MGSQTCIKTV
LPIISETIGK NMSIGSEGCN INSKNNMCEI VIEEAPKYHI NSHVAVATKE GEKFTGDSYS
YGRTKDGNYI TVISDGMGSG PEAGLESKVS VEIIEKFMEV GFDEKIAIDA VNAIMSIKFS
EDEKFSTLDM NKIDLYTGNA KFMKVGAIES FIKRGNKVEV INSNTLPFGV LEEPDVDTVE
KQVSNGDVIV SISDGILDVK NDGSFDTTWL IEFLKNTKYR QPKDLSIAIL EKAKELSGGK
AKDDMTVVVS KVFAIN