Gene Cphy_0234 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCphy_0234 
Symbol 
ID5745102 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium phytofermentans ISDg 
KingdomBacteria 
Replicon accessionNC_010001 
Strand
Start bp289502 
End bp290617 
Gene Length1116 bp 
Protein Length371 aa 
Translation table11 
GC content38% 
IMG OID641291324 
ProductRpoD family RNA polymerase sigma factor 
Protein accessionYP_001557360 
Protein GI160878392 
COG category[K] Transcription 
COG ID[COG0568] DNA-directed RNA polymerase, sigma subunit (sigma70/sigma32) 
TIGRFAM ID[TIGR02393] RNA polymerase sigma factor RpoD, C-terminal domain
[TIGR02937] RNA polymerase sigma factor, sigma-70 family 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000000417606 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGAAGAAC AAGTAAATAC CTTTGAAGCA CGCCTGAAAG AATTAATTGC ATTTGCGAAT 
GATAATAAAG GTGTCATCGA AGTTGATAAA GTGAATGATT TCTTTAAAGA ATTAAATCTG
AATGTACGTC AGATTGATAA AATATATGAG TACCTTGAGG CAAACAATAT TGTTGTGCTT
AATCCGACGG ATGAGGACGA GCCTAACGAG GATGCCCTAC TCGAATTAGA AGATGATTCT
GATATGATAG GTGATACAGA AGATCTATCT GCTATGACGT CAACCATTTC TGACGACCCA
GTAAAACAAT ATCTTAAAGA AATCGGTAGC TACCCTCTTC TCTCTGTAGC AGAAGAAATT
GAGCTTGCTA AAAAAATTGA AGCTGGAGAT AATATGGCAA AGCAGATCCT TGCCGAATCA
AACCTTCGAT TAGTAGTCAG CATCGCAAAA CGATATGTAG GAAGAGGACT TTCTTTCCTT
GATTTAATTC AAGAAGGAAA TTTAGGACTT ATCAAAGCAG TTGACAAATT CGATTATAAC
AAAGGTTATA AATTTAGTAC CTACGCAACT TGGTGGATTC GTCAAGCAAT CACAAGATCC
ATTGCTGACC AGTCTCGTAC CATACGTATA CCGGTACATA TGTCAGAAGT TATCAATAAG
ACATATCGAG TATCAAGAAA TCTTCTCCAA GAATTAGGAC GTGAGCCTAG CGAACAGGAA
CTTGCAGATG CAATGAATCT CCCTATTGAA AAGGTACGTG AAATTCTTAA GGTATCTGCA
GACCCAATCT CCCTCGATAC ACCAATCGGT GAAGAGGACG ATAGCCATCT TGGTGATTTC
ATCAAAGATG ATACAATTAT GGGACCAGAA GATGCTGCAT CCTATGCCGT TTTACAAGAC
CAGATATCAA AACTACTAGA TACATTAACC GAGCGTGAAC AACGAGTTTT AATACTACGT
TTTGGTTTAC AAGATGGAAG AAGTCGTACT TTAGAAGAAG TTGGTAAAGA ATTTAACGTT
ACCAGAGAAC GTATCCGTCA GATTGAAGCA AAAGCACTTC GTAAATTAAG ACATCCAAGT
CGCGCACGGA TGTTAAAGGG TTATGAACTA AACTAA
 
Protein sequence
MEEQVNTFEA RLKELIAFAN DNKGVIEVDK VNDFFKELNL NVRQIDKIYE YLEANNIVVL 
NPTDEDEPNE DALLELEDDS DMIGDTEDLS AMTSTISDDP VKQYLKEIGS YPLLSVAEEI
ELAKKIEAGD NMAKQILAES NLRLVVSIAK RYVGRGLSFL DLIQEGNLGL IKAVDKFDYN
KGYKFSTYAT WWIRQAITRS IADQSRTIRI PVHMSEVINK TYRVSRNLLQ ELGREPSEQE
LADAMNLPIE KVREILKVSA DPISLDTPIG EEDDSHLGDF IKDDTIMGPE DAASYAVLQD
QISKLLDTLT EREQRVLILR FGLQDGRSRT LEEVGKEFNV TRERIRQIEA KALRKLRHPS
RARMLKGYEL N