Gene CPF_2264 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCPF_2264 
SymbolrpoD 
ID4202727 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium perfringens ATCC 13124 
KingdomBacteria 
Replicon accessionNC_008261 
Strand
Start bp2511938 
End bp2513038 
Gene Length1101 bp 
Protein Length366 aa 
Translation table11 
GC content32% 
IMG OID638083129 
ProductRNA polymerase sigma factor RpoD 
Protein accessionYP_696687 
Protein GI110798581 
COG category[K] Transcription 
COG ID[COG0568] DNA-directed RNA polymerase, sigma subunit (sigma70/sigma32) 
TIGRFAM ID[TIGR02393] RNA polymerase sigma factor RpoD, C-terminal domain
[TIGR02937] RNA polymerase sigma factor, sigma-70 family 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0028881 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAGCTA AAACAACAAA AGCAAAAAAA GGTGAAGGTA AAGAAAAAGT TGACAAAATG 
GTCTTAGTAA AAAGACTTAT AGATAAAGGT AAAAAAAGCG GTTCATTGAC TTACAAGGAG
ATAATGGATG AGCTTGATGA AATAGAATTA AACCCAGAAC AAATAGAAAA AATCTATGAG
GTTCTAGAAT CAATGGGTAT AGAGGTCATA AGTGAAATAG AGCAAGAAGA GGAAGAGGAG
GAAGAATTAG ATCTTTCTGT TCCAGAAGGT ATTGCTATTG ATGACCCTGT AAGAATGTAC
TTAAAAGAAA TAGGTAAAGT GCAACTATTA TCATCAGAGG ATGAAATAGA GCTTGCTAAA
AAAATAGAAG AAGGAAGCAA CTATGCTAAG AAAAAATTAG CAGAGGCTAA CTTAAGACTT
GTTGTAAGTA TAGCTAAAAG ATATGTTGGT AGAGGAATGT TATTCCTAGA TCTTATACAA
GAAGGTAACT TAGGTCTTAT AAAGGCTGTT GAAAAATTTG ATTACAGAAA AGGGTATAAG
TTCTCAACAT ATGCTACATG GTGGATAAGA CAGGCAATAA CTAGAGCTAT TGCTGACCAA
GCAAGAACTA TAAGAATACC AGTTCATATG GTAGAAACTA TAAATAAACT TATAAGAATA
CAAAGACAAT TAGTTCAAGA GTTAGGAAGA GATCCATTAC CAGAGGAATT ATCAAAACAA
ATGGATATGC CAGTAGATAA GGTAAGAGAA ATCTTAAAAA TAGCTCAAGA ACCAGTTTCA
TTAGAAACTC CAATTGGTGA AGAGGAAGAT TCACATTTAG GTGACTTTAT ACCAGATGAT
GATGCTCCAG CACCAGCAGA GGCAGCAGCC TTTACAATGT TAAAAGAACA ATTAATAAAT
GTTTTAGATA CTTTAACTCC TAGAGAGGAA AAAGTATTAA GATTAAGATT TGGATTAGAT
GATGGAAGAG CTAGAACTCT TGAAGAAGTT GGTAAAGAAT TCAACGTAAC TAGAGAGAGA
ATTAGACAGA TTGAAGCAAA AGCTTTAAGA AAATTAAGAC ATCCAAGTAG AAGTAAAAAG
TTAAAAGATT ATTTAGATTA G
 
Protein sequence
MKAKTTKAKK GEGKEKVDKM VLVKRLIDKG KKSGSLTYKE IMDELDEIEL NPEQIEKIYE 
VLESMGIEVI SEIEQEEEEE EELDLSVPEG IAIDDPVRMY LKEIGKVQLL SSEDEIELAK
KIEEGSNYAK KKLAEANLRL VVSIAKRYVG RGMLFLDLIQ EGNLGLIKAV EKFDYRKGYK
FSTYATWWIR QAITRAIADQ ARTIRIPVHM VETINKLIRI QRQLVQELGR DPLPEELSKQ
MDMPVDKVRE ILKIAQEPVS LETPIGEEED SHLGDFIPDD DAPAPAEAAA FTMLKEQLIN
VLDTLTPREE KVLRLRFGLD DGRARTLEEV GKEFNVTRER IRQIEAKALR KLRHPSRSKK
LKDYLD