Gene CPR_1975 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCPR_1975 
SymbolrpoD 
ID4204171 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium perfringens SM101 
KingdomBacteria 
Replicon accessionNC_008262 
Strand
Start bp2181235 
End bp2182335 
Gene Length1101 bp 
Protein Length366 aa 
Translation table11 
GC content32% 
IMG OID642566525 
ProductRNA polymerase sigma factor RpoD 
Protein accessionYP_699284 
Protein GI110803528 
COG category[K] Transcription 
COG ID[COG0568] DNA-directed RNA polymerase, sigma subunit (sigma70/sigma32) 
TIGRFAM ID[TIGR02393] RNA polymerase sigma factor RpoD, C-terminal domain
[TIGR02937] RNA polymerase sigma factor, sigma-70 family 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value6.4083e-06 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAGCTA AAACAACAAA AGCAAAAAAA GGTGAAGGTA AAGAAAAAGT TGACAAAATG 
GTCTTAGTAA AAAGACTTAT AGATAAAGGT AAAAAAAGCG GTTCATTGAC TTACAAGGAG
ATAATGGATG AGCTTGATGA AATAGAATTA AATCCAGAAC AAATAGAGAA AATCTATGAG
GTTCTAGAAT CAATGGGTAT AGAGGTCATA AGTGAAATCG AGCAAGAAGA GGAAGAGGAG
GAAGAATTAG ATCTTTCTGT TCCAGAAGGT ATTGCTATTG ATGACCCTGT AAGAATGTAC
TTAAAAGAAA TAGGTAAAGT TCCACTATTA TCATCAGAGG ATGAAATAGA GCTTGCTAAA
AAAATAGAAG AAGGAAGCAA CTATGCTAAG AAAAAATTAG CAGAGGCTAA CTTAAGACTT
GTTGTAAGTA TAGCTAAAAG ATATGTTGGT AGAGGAATGT TATTCCTAGA TCTTATACAA
GAAGGTAATT TAGGTCTTAT AAAGGCTGTT GAAAAATTTG ATTACAGAAA AGGGTATAAG
TTCTCAACAT ATGCTACATG GTGGATAAGA CAGGCAATAA CTAGAGCTAT TGCTGACCAA
GCAAGAACTA TAAGAATACC AGTTCATATG GTAGAAACTA TAAATAAGCT TATAAGAATA
CAAAGACAAT TAGTTCAAGA GTTAGGAAGA GATCCATTAC CAGAGGAATT ATCAAAACAA
ATGGATATGC CAGTAGATAA GGTAAGAGAA ATCTTAAAAA TAGCTCAAGA ACCAGTTTCA
TTAGAAACTC CAATTGGTGA AGAGGAAGAT TCACATTTAG GTGACTTTAT ACCAGATGAT
GATGCTCCAG CACCAGCAGA GGCAGCAGCA TTTACAATGT TAAAAGAACA ATTAATAAAT
GTTTTAGATA CTTTAACTCC TAGAGAGGAA AAAGTATTAA GATTAAGATT TGGATTAGAT
GATGGAAGAG CTAGAACTCT TGAAGAAGTT GGTAAAGAAT TCAACGTAAC TAGAGAGAGA
ATTAGACAGA TTGAAGCAAA AGCTTTAAGA AAATTAAGAC ATCCAAGTAG AAGTAAAAAG
TTAAAAGATT ATTTAGATTA G
 
Protein sequence
MKAKTTKAKK GEGKEKVDKM VLVKRLIDKG KKSGSLTYKE IMDELDEIEL NPEQIEKIYE 
VLESMGIEVI SEIEQEEEEE EELDLSVPEG IAIDDPVRMY LKEIGKVPLL SSEDEIELAK
KIEEGSNYAK KKLAEANLRL VVSIAKRYVG RGMLFLDLIQ EGNLGLIKAV EKFDYRKGYK
FSTYATWWIR QAITRAIADQ ARTIRIPVHM VETINKLIRI QRQLVQELGR DPLPEELSKQ
MDMPVDKVRE ILKIAQEPVS LETPIGEEED SHLGDFIPDD DAPAPAEAAA FTMLKEQLIN
VLDTLTPREE KVLRLRFGLD DGRARTLEEV GKEFNVTRER IRQIEAKALR KLRHPSRSKK
LKDYLD