Gene Rcas_4348 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRcas_4348 
Symbol 
ID5541861 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus castenholzii DSM 13941 
KingdomBacteria 
Replicon accessionNC_009767 
Strand
Start bp5599697 
End bp5600896 
Gene Length1200 bp 
Protein Length399 aa 
Translation table11 
GC content58% 
IMG OID640896454 
ProductRpoD family RNA polymerase sigma factor 
Protein accessionYP_001434390 
Protein GI156744261 
COG category[K] Transcription 
COG ID[COG0568] DNA-directed RNA polymerase, sigma subunit (sigma70/sigma32) 
TIGRFAM ID[TIGR02393] RNA polymerase sigma factor RpoD, C-terminal domain
[TIGR02937] RNA polymerase sigma factor, sigma-70 family 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.866985 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAAGAAG GCAGATCCAT GACCACCGAG ATCGTGGAGC AGACACGGCA GGCGTGGGCG 
CAAACGCTCG AATACCTCCT CGACATTGGG CGCACACGCG GGTTCCTCAC CTATAACGAA
ATTCTTGAAG CGCTTCCCCA ACCCGAATAT CACGTTGCCG ATGTTGATCA ATTGTACGCT
TCATTGCAGG CGGAAGGCAT TCGGGTGGTC GAGACCCCGC TCGATATGAG CGACCACGGC
GCGGTCGGCG ATGATGAATT GCTGGCGGAA ATGCCGGACC TGACCGATGT GGCGCTCGAT
GATCCGGTTC GGATGTATTT ACAGGAGATT GGTCAGGTGC CATTGTTGTC GGCAGAGCAG
GAAGTGATGC TGGCAAAGGC GATGGAGGCC GGGCATCGCG CCCGGCGCGC GCTTGAGTGC
GAGGAGTATA GCTCCTGGCA GGAACGGATG ATGTACGAAC AGCAGGTTGC ACAGGGGAAT
GAAGCGCGTC AGCACCTGAT CCAGGCCAAT CTGCGTCTGG TCGTCTCGAT CGCCAAAAAA
TATACGTCGT ATGGTCTGAC GATGATGGAC CTGGTGCAGG AAGGCAATAT CGGACTTATG
CGTGCAGTCG AAAAGTTCGA CTATACCAAG GGGCACAAGT TCTCTACCTA CGCCACCTGG
TGGATTCGCC AGGCGATCAC GCGCGCCATC GCCGATCAAA GCCGCACCAT CCGCCTGCCG
GTGCATATGG GTGAGGCGAT TAGTCAGGTC AAGCGCGCCT CGCATAAACT TCAGCAGATG
ATGCAGCGCG AACCGACACC GGAAGAGATC GCCGATGCGA TGGGCATCAG TTCGACGAAG
GTGCGTCGCA CGCTCGAAGC CTCGATGCAC CCGCTATCGC TGGAAATGCC GGTTGGGCAG
GAAGGCGAGG GCCGTATGGG CGACTTTATC GAAGATGATC GTATCTCGAC CCCGGCTGAA
GCTGCAGCTG CATCGATGCT GCGCGAGCAA CTCGAAGAGG TGCTGCAAAA ACTCCCTGAA
CGGGAACGGA AGATTATTCA GTTGCGCTAT GGGCTGAAGG ATGGTCGTTA CCGCACACTG
GAAGAAGTCG GTATGGAATT TGGCATCACC CGCGAGCGCA TCCGGCAGAT CGAAGCCGTG
GCGCTGCGGA AATTGCGCCA TCCCCACCTT GGCAAGAAGT TGCGCGGTTA CCTCGATTGA
 
Protein sequence
MQEGRSMTTE IVEQTRQAWA QTLEYLLDIG RTRGFLTYNE ILEALPQPEY HVADVDQLYA 
SLQAEGIRVV ETPLDMSDHG AVGDDELLAE MPDLTDVALD DPVRMYLQEI GQVPLLSAEQ
EVMLAKAMEA GHRARRALEC EEYSSWQERM MYEQQVAQGN EARQHLIQAN LRLVVSIAKK
YTSYGLTMMD LVQEGNIGLM RAVEKFDYTK GHKFSTYATW WIRQAITRAI ADQSRTIRLP
VHMGEAISQV KRASHKLQQM MQREPTPEEI ADAMGISSTK VRRTLEASMH PLSLEMPVGQ
EGEGRMGDFI EDDRISTPAE AAAASMLREQ LEEVLQKLPE RERKIIQLRY GLKDGRYRTL
EEVGMEFGIT RERIRQIEAV ALRKLRHPHL GKKLRGYLD