Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rcas_2972 |
Symbol | |
ID | 5540464 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Roseiflexus castenholzii DSM 13941 |
Kingdom | Bacteria |
Replicon accession | NC_009767 |
Strand | + |
Start bp | 3856647 |
End bp | 3857693 |
Gene Length | 1047 bp |
Protein Length | 348 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 640895092 |
Product | RpoD family RNA polymerase sigma factor |
Protein accession | YP_001433049 |
Protein GI | 156742920 |
COG category | [K] Transcription |
COG ID | [COG0568] DNA-directed RNA polymerase, sigma subunit (sigma70/sigma32) |
TIGRFAM ID | [TIGR02937] RNA polymerase sigma factor, sigma-70 family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 1 |
Plasmid unclonability p-value | 0.0000649715 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 5 |
Fosmid unclonability p-value | 0.0000217703 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGAGCCGAA TGATGCAGCA GCATCCGTGG GATGAACTCG ATGAGCGCCA GGAGATCGTC AATGAGGACG ACACGGCGCC TGTGACAGAT ATTGAGGAGA TGACAGCCGA GACACTGGAG GAAACGCTGG AGCCGGATTC GACGCTCGAT TCGATCCAGC ACTACTTGCA GGAGATTGGC CGCGTGCCGC TGTTGACAGC TGCGGAGGAG ATCGAACTCG CCGAGCGCAT GGAGCGCGGC GCTGCCGCTG AACGTCGCCT GGCATCGGGG GAAGATCTCA GCCCGCAGTT GCGTCAGGCG TTGCTCGCCG ATGTGGCCGC TGCTCAGGAG GCGCGCCGCC ATCTGATCCA GGCGAACCTG CGCCTGGTGG TGAGCATTGC CAAGAAGTAT GTCGGGCGCG GACTCTCGTT GCTCGACCTC ATCCAGGAAG GGAATATCGG ACTGATGCGC GCCGTCGAGA AGTTCGACTA CCACAAGGGG AATCGTTTCT CGACGTATGC GACCTGGTGG ATCCGTCAGG CGGTGACCCG CGCAATTGCC GAGCAGGGTC GCACTATTCG CCTGCCGGTG CATATGAGCG AATCGGTCGG GCAGGTTAAG CGCACGGCGG ATCGCCTGGC GCAGGCGCTC GAGCGGCAGC CCACTCCTGA GGAGATCGCC ACTGCACTTG GGCAGCCGAC CGAGCGGATT GAGCGCGTGC TCGAAGCGTC GCGCCGTCCG GTGTCGCTCG AGACGCCGGT TGGCGAGGAC GGCGAGCATA CCCTAGGCGA TTTCTTGCAG GACAGTGAAT TGCCCACACC GGTCGAAGCG GCGTCGCAGC AACTACTCCG GCGTGATCTG GCGGCTGCGC TTGATCGCCT GAATGAGCGC GAACGCCGGA TCATTGATCT TCGCTATGGG CTGGTGGACG GGCAGCGCCG CACACTCGAG GAGGTTGGGC GGGTGCTCGG AATGACCCGC GAACGCGCGC GGCAGATCGA GGCGGAAGCG CTGCGGCGCC TGCGCGCGCC CGACGTTGGG TTGCACCTGC GCGATTACCT TGAGTAG
|
Protein sequence | MSRMMQQHPW DELDERQEIV NEDDTAPVTD IEEMTAETLE ETLEPDSTLD SIQHYLQEIG RVPLLTAAEE IELAERMERG AAAERRLASG EDLSPQLRQA LLADVAAAQE ARRHLIQANL RLVVSIAKKY VGRGLSLLDL IQEGNIGLMR AVEKFDYHKG NRFSTYATWW IRQAVTRAIA EQGRTIRLPV HMSESVGQVK RTADRLAQAL ERQPTPEEIA TALGQPTERI ERVLEASRRP VSLETPVGED GEHTLGDFLQ DSELPTPVEA ASQQLLRRDL AAALDRLNER ERRIIDLRYG LVDGQRRTLE EVGRVLGMTR ERARQIEAEA LRRLRAPDVG LHLRDYLE
|
| |