Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RoseRS_3114 |
Symbol | |
ID | 5210083 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Roseiflexus sp. RS-1 |
Kingdom | Bacteria |
Replicon accession | NC_009523 |
Strand | + |
Start bp | 3912204 |
End bp | 3913253 |
Gene Length | 1050 bp |
Protein Length | 349 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 640596706 |
Product | RpoD family RNA polymerase sigma factor |
Protein accession | YP_001277427 |
Protein GI | 148657222 |
COG category | [K] Transcription |
COG ID | [COG0568] DNA-directed RNA polymerase, sigma subunit (sigma70/sigma32) |
TIGRFAM ID | [TIGR02937] RNA polymerase sigma factor, sigma-70 family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 0 |
Plasmid unclonability p-value | 0.000000106711 |
Plasmid hitchhiking | No |
Plasmid clonability | unclonable |
| |
Fosmid Coverage information |
Num covering fosmid clones | 3 |
Fosmid unclonability p-value | 0.00595378 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGAGCCGAA CAATGCATGA GCAGCATCCG TGGGATGAAC TTGACGAGCG CCAGGAGATC GCCGGTGAGG GCGAGATGGC GCCGATTGCA GATATTGAGG AACTTGCAGC CGAGAATCTG GAGGAGACGC TGGAGTCGGA TTCGACCCTG GATTCGATCC AGCACTACCT GCAGGAGATC GGGCGGGTTC CGTTGTTGAC AGCTGCCGAG GAGGTCGAAC TTGCCGAGCG CATGGAGCGC GGCGCTGCCG CCGCGCGGCG GCTGGCGTCG GCAGAGGATC TCAGCCCGCA GCTGCGTCAG GCGCTGCTCG ATGATGTCGC TGCCGGGCAG GAAGCGCGCC GACATTTGAT CCAGGCGAAC CTGCGTCTGG TGGTGAGCAT TGCCAAAAAG TATGTCGGGC GCGGTCTCTC GCTGCTCGAC CTGATCCAGG AAGGCAACAT CGGTCTGATG CGCGCCGTTG AGAAGTTCGA CCACCGCAAA GGCAACCGCT TCTCAACGTA TGCGACCTGG TGGATCCGTC AGGCGGTGAC ACGCGCTATC GCCGAGCAGG GGCGCACCAT TCGCCTGCCG GTGCATATGA GCGAGTCGGT CGGTCAGGTG AAGCGCGCCG CCGACCGCCT GGCGCAGGTG CTGGAGCGGC AGCCGACGCC GGAAGAGATT GCGACGGCGC TCGGTCAGCC GACTGAGCGT ATCGAGCGGG TGCTCGAGGC GTCGCGTCGT CCGGTGTCGC TGGAAATGCC GGTCGGCGAG GATGGGGAGC ATACGCTCGG TGATTTCTTG CAGGATAGCG ATCTGCCGAC GCCGGTCGAA GCAGCGTCGC ACCAGTTGCT CCGGCGTGAT CTTGCCGCAG CGCTCGACCG CCTGAATGAG CGCGAGCGAC GGATCATTGA TCTGCGCTAC GGTCTGGTGG ACGGGCAGCG CCGCACGCTC GAAGAGGTCG GGCGGGTGCT GGGGATGACC CGCGAGCGCG CGCGCCAGAT CGAGGCGGAG GCGTTGCGTC GCCTGCGCGC TCCTGACGTC GGTCTCCATC TGCGCGATTA CCTGGAGTAG
|
Protein sequence | MSRTMHEQHP WDELDERQEI AGEGEMAPIA DIEELAAENL EETLESDSTL DSIQHYLQEI GRVPLLTAAE EVELAERMER GAAAARRLAS AEDLSPQLRQ ALLDDVAAGQ EARRHLIQAN LRLVVSIAKK YVGRGLSLLD LIQEGNIGLM RAVEKFDHRK GNRFSTYATW WIRQAVTRAI AEQGRTIRLP VHMSESVGQV KRAADRLAQV LERQPTPEEI ATALGQPTER IERVLEASRR PVSLEMPVGE DGEHTLGDFL QDSDLPTPVE AASHQLLRRD LAAALDRLNE RERRIIDLRY GLVDGQRRTL EEVGRVLGMT RERARQIEAE ALRRLRAPDV GLHLRDYLE
|
| |