Gene Rcas_2972 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRcas_2972 
Symbol 
ID5540464 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus castenholzii DSM 13941 
KingdomBacteria 
Replicon accessionNC_009767 
Strand
Start bp3856647 
End bp3857693 
Gene Length1047 bp 
Protein Length348 aa 
Translation table11 
GC content64% 
IMG OID640895092 
ProductRpoD family RNA polymerase sigma factor 
Protein accessionYP_001433049 
Protein GI156742920 
COG category[K] Transcription 
COG ID[COG0568] DNA-directed RNA polymerase, sigma subunit (sigma70/sigma32) 
TIGRFAM ID[TIGR02937] RNA polymerase sigma factor, sigma-70 family 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000649715 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0000217703 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAGCCGAA TGATGCAGCA GCATCCGTGG GATGAACTCG ATGAGCGCCA GGAGATCGTC 
AATGAGGACG ACACGGCGCC TGTGACAGAT ATTGAGGAGA TGACAGCCGA GACACTGGAG
GAAACGCTGG AGCCGGATTC GACGCTCGAT TCGATCCAGC ACTACTTGCA GGAGATTGGC
CGCGTGCCGC TGTTGACAGC TGCGGAGGAG ATCGAACTCG CCGAGCGCAT GGAGCGCGGC
GCTGCCGCTG AACGTCGCCT GGCATCGGGG GAAGATCTCA GCCCGCAGTT GCGTCAGGCG
TTGCTCGCCG ATGTGGCCGC TGCTCAGGAG GCGCGCCGCC ATCTGATCCA GGCGAACCTG
CGCCTGGTGG TGAGCATTGC CAAGAAGTAT GTCGGGCGCG GACTCTCGTT GCTCGACCTC
ATCCAGGAAG GGAATATCGG ACTGATGCGC GCCGTCGAGA AGTTCGACTA CCACAAGGGG
AATCGTTTCT CGACGTATGC GACCTGGTGG ATCCGTCAGG CGGTGACCCG CGCAATTGCC
GAGCAGGGTC GCACTATTCG CCTGCCGGTG CATATGAGCG AATCGGTCGG GCAGGTTAAG
CGCACGGCGG ATCGCCTGGC GCAGGCGCTC GAGCGGCAGC CCACTCCTGA GGAGATCGCC
ACTGCACTTG GGCAGCCGAC CGAGCGGATT GAGCGCGTGC TCGAAGCGTC GCGCCGTCCG
GTGTCGCTCG AGACGCCGGT TGGCGAGGAC GGCGAGCATA CCCTAGGCGA TTTCTTGCAG
GACAGTGAAT TGCCCACACC GGTCGAAGCG GCGTCGCAGC AACTACTCCG GCGTGATCTG
GCGGCTGCGC TTGATCGCCT GAATGAGCGC GAACGCCGGA TCATTGATCT TCGCTATGGG
CTGGTGGACG GGCAGCGCCG CACACTCGAG GAGGTTGGGC GGGTGCTCGG AATGACCCGC
GAACGCGCGC GGCAGATCGA GGCGGAAGCG CTGCGGCGCC TGCGCGCGCC CGACGTTGGG
TTGCACCTGC GCGATTACCT TGAGTAG
 
Protein sequence
MSRMMQQHPW DELDERQEIV NEDDTAPVTD IEEMTAETLE ETLEPDSTLD SIQHYLQEIG 
RVPLLTAAEE IELAERMERG AAAERRLASG EDLSPQLRQA LLADVAAAQE ARRHLIQANL
RLVVSIAKKY VGRGLSLLDL IQEGNIGLMR AVEKFDYHKG NRFSTYATWW IRQAVTRAIA
EQGRTIRLPV HMSESVGQVK RTADRLAQAL ERQPTPEEIA TALGQPTERI ERVLEASRRP
VSLETPVGED GEHTLGDFLQ DSELPTPVEA ASQQLLRRDL AAALDRLNER ERRIIDLRYG
LVDGQRRTLE EVGRVLGMTR ERARQIEAEA LRRLRAPDVG LHLRDYLE