Gene Rcas_0418 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRcas_0418 
Symbol 
ID5537880 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus castenholzii DSM 13941 
KingdomBacteria 
Replicon accessionNC_009767 
Strand
Start bp530441 
End bp531658 
Gene Length1218 bp 
Protein Length405 aa 
Translation table11 
GC content59% 
IMG OID640892580 
ProductRpoD family RNA polymerase sigma factor 
Protein accessionYP_001430567 
Protein GI156740438 
COG category[K] Transcription 
COG ID[COG0568] DNA-directed RNA polymerase, sigma subunit (sigma70/sigma32) 
TIGRFAM ID[TIGR02393] RNA polymerase sigma factor RpoD, C-terminal domain
[TIGR02937] RNA polymerase sigma factor, sigma-70 family 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.36258 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones27 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTGCGACT TTTGTGCTAT CTTACCATAT GCACATTATA ATAGGCCGTT ATGCACAGAA 
GGCTGGTGGA ATGTGAAAGA ACCCCTCGAC TCCTTCCTGG CCACGGCCCA CGAGTCGCAG
ACCTCTCCGC GCAATCACGT TCGTGTCCAT CGTCCTGTTG CCGAACCGGC GTCCGAGACT
GTCGAACGAA TACTTGAGCG TACTGTCGGC GATCACGATC TCTTCGATCA CCACGCCGAC
CCGTGTCATG CCCTCCACGA GCAGGACGAC GACTCGCTGG ATCACGATCT CGACGCCGAT
GTTGACGGCA TAGGAGTCGA TGATCCGGTC CGGGTCTACC TCCGTGAGAT CGGACGGGTC
AACCTGTTGA CTGCACAGGA AGAGATCATG CTGGCGCAAC AGGTCGAACG CGGCGAGCAG
GCGAACGAAC GGTTGCAGAA TGGCGATTAC ACTCCGGTTG AACGCCTCCA ACTTCACCGC
TGGGTTCAGG AAGGTCAGGC GGCGCGCGAG CGCCTCATCC AGGCGAACCT GCGCCTGGTC
GTATCCATTG CCAAAAAGTA CCTTGGGCGC GGTATGTCCC TGCTCGACCT GATCCAGGAG
GGCAACATCG GTTTAATGCG CGCCACCGAA AAGTTCGATT ATCGCAAAGG GTACAAGTTT
TCGACGTATG CCACGTGGTG GATCCGCCAG GCGATCACGC GCGTGATCGC CGATCAGAGT
CGCACGATCC GCCTGCCGGT GCACGTTGGC GAAACGATCA ATCGGGTGAT GCGCACCAGC
AACCGCATCC AGCAGACGAC CGGGCGCGAC CCGACGCCGG ACGAAATCGC GCTTGAACTC
GGCATTCCGG TTGAGAAGGT GCGGCGGGTG CTGGAAGCCG CGCGCCAGAC GATCTCGCTC
GAAACTCCGA TTGGCCCAGA AGGGGATTCG GTGCTGGCGG ATTTCATCGA GGATGGCAAG
GGCGCGACGC CGATGGAAAG CGCATCGAGC CATATTCTGC GCGAACAGAT CGACAGTGCG
CTCGAGAAGT TGCCCGAACG CGAACGCCGC ATCATTCAGT TGCGCTATGG GTTGTACGAT
GGGCACTACC GCACTCTGGA AGAGGTCGGG CGCGAGTTTG GCATCACTCG CGAGCGCATT
CGTCAGATCG AGGCGCGTGT GCTGCGCAAG TTGCGCCATC CGCACTATGG GCGCGGTTTG
CGTGGTTATC TCGAATAA
 
Protein sequence
MCDFCAILPY AHYNRPLCTE GWWNVKEPLD SFLATAHESQ TSPRNHVRVH RPVAEPASET 
VERILERTVG DHDLFDHHAD PCHALHEQDD DSLDHDLDAD VDGIGVDDPV RVYLREIGRV
NLLTAQEEIM LAQQVERGEQ ANERLQNGDY TPVERLQLHR WVQEGQAARE RLIQANLRLV
VSIAKKYLGR GMSLLDLIQE GNIGLMRATE KFDYRKGYKF STYATWWIRQ AITRVIADQS
RTIRLPVHVG ETINRVMRTS NRIQQTTGRD PTPDEIALEL GIPVEKVRRV LEAARQTISL
ETPIGPEGDS VLADFIEDGK GATPMESASS HILREQIDSA LEKLPERERR IIQLRYGLYD
GHYRTLEEVG REFGITRERI RQIEARVLRK LRHPHYGRGL RGYLE