Gene Ccel_0541 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCcel_0541 
Symbol 
ID7309413 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium cellulolyticum H10 
KingdomBacteria 
Replicon accessionNC_011898 
Strand
Start bp627725 
End bp628804 
Gene Length1080 bp 
Protein Length359 aa 
Translation table11 
GC content40% 
IMG OID643607475 
ProductRNA polymerase sigma factor RpoD 
Protein accessionYP_002504903 
Protein GI220927994 
COG category[K] Transcription 
COG ID[COG0568] DNA-directed RNA polymerase, sigma subunit (sigma70/sigma32) 
TIGRFAM ID[TIGR02393] RNA polymerase sigma factor RpoD, C-terminal domain
[TIGR02937] RNA polymerase sigma factor, sigma-70 family 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00000181279 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGAAACATA GTCAAGAAAG TAGAAAAGCA TTACTGAAAG AATTGATAGA CAGAGGCAAA 
CAAAAAGGAA TGCTAACCTA CAAAGAGATT ATGGATGCCT TTGAGGAAAT CGAGCTTGAG
CCGGAGCACA TTGAAAAAAT ATATGAAACA CTTGAAACCA TGGGCGTTGA TGTTATCGGA
GACATAGACT CCGAAATGGA AGAGATTCAG CTTACGGACG AGGAACTTGA TATTAGTGTT
CCAGAAGGTG TAAGTATAGA TGATCCAGTA CGTATGTACC TTAAAGAAAT CGGTAAGGTC
CCCCTTTTAA CAGCAGATGA GGAAATTGAC CTTGCCCATA GAATGGAAAA CGGTGATATA
GAAGCAAAAA GGAGATTAGC CGAGGCGAAC TTAAGGCTGG TTGTTAGTAT AGCAAAAAGG
TATGTTGGTA GGGGAATGCA GTTCCTTGAC TTGATTCAGG AAGGGAATCT CGGATTAATT
AAAGCAGTTG AAAAATTTGA TTATAGAAGA GGTTTCAAAT TCAGTACCTA CGCAACTTGG
TGGATAAGAC AAGCTATCAC AAGGGCAATT GCCGATCAGG CCAGAACTAT ACGTATCCCT
GTTCATATGG TGGAAACTAT CAATAAATTA ATCAGGGTAT CAAGGCAATT GTTGCAGGAA
TTAGGAAGAG AACCACAACC TGATGAAATT GCAAAAGAGA TTGGTATGTC TGTAGATAAA
GTTCGTGAAA TAATGAAGAT TTCACAAGAG CCCGTATCAC TTGAAACTCC TATTGGAGAA
GAGGAAGACA GCCATCTAGG TGACTTTATT CCGGACGATG ATGCTCCCGC TCCGGCTGAA
GCGGCTGCAT TTACGTTACT TAAAGAACAG CTTATTGATG TACTTGATAC TTTAACTCCA
CGTGAAGAGA AGGTTTTGAG GCTAAGATTC GGTTTAGATG ACGGAAGAGC CAGAACTCTT
GAGGAAGTGG GCAAGGAGTT TAACGTAACC AGAGAACGAA TACGCCAAAT AGAAGCAAAG
GCGTTAAGAA AACTGAGGCA TCCAAGTAGA AGTAAAAAGT TAAAGGATTA TTTAGACTGA
 
Protein sequence
MKHSQESRKA LLKELIDRGK QKGMLTYKEI MDAFEEIELE PEHIEKIYET LETMGVDVIG 
DIDSEMEEIQ LTDEELDISV PEGVSIDDPV RMYLKEIGKV PLLTADEEID LAHRMENGDI
EAKRRLAEAN LRLVVSIAKR YVGRGMQFLD LIQEGNLGLI KAVEKFDYRR GFKFSTYATW
WIRQAITRAI ADQARTIRIP VHMVETINKL IRVSRQLLQE LGREPQPDEI AKEIGMSVDK
VREIMKISQE PVSLETPIGE EEDSHLGDFI PDDDAPAPAE AAAFTLLKEQ LIDVLDTLTP
REEKVLRLRF GLDDGRARTL EEVGKEFNVT RERIRQIEAK ALRKLRHPSR SKKLKDYLD