Gene Cphy_3521 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCphy_3521 
Symbol 
ID5743633 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium phytofermentans ISDg 
KingdomBacteria 
Replicon accessionNC_010001 
Strand
Start bp4349614 
End bp4350732 
Gene Length1119 bp 
Protein Length372 aa 
Translation table11 
GC content38% 
IMG OID641294632 
ProductRpoD family RNA polymerase sigma factor 
Protein accessionYP_001560609 
Protein GI160881641 
COG category[K] Transcription 
COG ID[COG0568] DNA-directed RNA polymerase, sigma subunit (sigma70/sigma32) 
TIGRFAM ID[TIGR02393] RNA polymerase sigma factor RpoD, C-terminal domain
[TIGR02937] RNA polymerase sigma factor, sigma-70 family 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.0582772 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGATGAAA CCATGGCAAA ATTTCACGAG AAGCTGAAAG AATTGTTGGA GTATGCAAAA 
AAGAAAAAGA ATGTACTAGA ATACGAAGAG ATTAATGACT TCTTTTCTGA TATGGAGATT
GATGCGGACC GTATTGAGAA GATCTATGAG TATTTAGAGG CACATAATAT TGATGTTTTG
CGTGCACCAG AATTAGAGGA AGAAGAAGAG ATTGATGAAA GCGATTTAGC TTTGATCGAA
GCAGAAGAAG AGGAAATAAC TAACATCGAT TTAACGGTAC CAGAGGGAAT TAGTACAGAA
GATCCGGTGC GAATGTATCT TAAGGAAATT GGTAAAGTAC CGCTTTTAAG TGCAGATGAA
GAAATTGTTC TTGCTCAGCG TATGGAGCAA GGAGATAAAA ATGCAAAGAA AAGACTTGCA
GAAGCAAATT TAAGGCTGGT AGTTAGCATA GCAAAACGCT ATGTTGGCCG TGGCATGCAG
TTTTTAGATT TAATACAGGA AGGAAATCTT GGATTAATTA AAGCGGTAGA AAAATTCGAT
TATCGAAAAG GATATAAATT TAGTACGTAT GCAACTTGGT GGATTCGTCA GGCGATAACA
AGAGCCATTG CAGATCAGGC GAGAACAATC CGTATTCCAG TTCATATGGT AGAGACCATA
AATAAATTAA TCCGTGTGCA ACGCCAGCTA TTACAAGAGC TTGGTAGAGA ACCATTTCCA
GAAGAAGTTG CTAAGGAAAT GAATATTCCA GTAGAACGTG TACGTGAAAT TCAGAAAATT
TCTCAAGAAC CAGTTTCTCT TGAAACTCCA ATCGGTGAGG AAGAAGACAG TCACTTGGGT
GATTTTATTC AGGATGATAA CGTACCAGTT CCAGCAGAAG CAGCAGCTTT TACTTTATTA
AAAGAACAGT TAATGGAGGT ACTTGGCACA TTAACTGATC GTGAGCAAAA AGTATTGCGA
TTACGTTTTG GGTTAGATGA TGGTCGTGCC AGAACGCTGG AAGAAGTAGG AAAAGAGTTT
AATGTAACGA GAGAACGAAT CCGTCAAATC GAGGCAAAAG CACTTCGCAA ATTGAGACAC
CCAAGTCGTT CTCGCAAATT AAAGGATTAT TTAGAGTAA
 
Protein sequence
MDETMAKFHE KLKELLEYAK KKKNVLEYEE INDFFSDMEI DADRIEKIYE YLEAHNIDVL 
RAPELEEEEE IDESDLALIE AEEEEITNID LTVPEGISTE DPVRMYLKEI GKVPLLSADE
EIVLAQRMEQ GDKNAKKRLA EANLRLVVSI AKRYVGRGMQ FLDLIQEGNL GLIKAVEKFD
YRKGYKFSTY ATWWIRQAIT RAIADQARTI RIPVHMVETI NKLIRVQRQL LQELGREPFP
EEVAKEMNIP VERVREIQKI SQEPVSLETP IGEEEDSHLG DFIQDDNVPV PAEAAAFTLL
KEQLMEVLGT LTDREQKVLR LRFGLDDGRA RTLEEVGKEF NVTRERIRQI EAKALRKLRH
PSRSRKLKDY LE