Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rcas_0418 |
Symbol | |
ID | 5537880 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Roseiflexus castenholzii DSM 13941 |
Kingdom | Bacteria |
Replicon accession | NC_009767 |
Strand | + |
Start bp | 530441 |
End bp | 531658 |
Gene Length | 1218 bp |
Protein Length | 405 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 640892580 |
Product | RpoD family RNA polymerase sigma factor |
Protein accession | YP_001430567 |
Protein GI | 156740438 |
COG category | [K] Transcription |
COG ID | [COG0568] DNA-directed RNA polymerase, sigma subunit (sigma70/sigma32) |
TIGRFAM ID | [TIGR02393] RNA polymerase sigma factor RpoD, C-terminal domain [TIGR02937] RNA polymerase sigma factor, sigma-70 family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.36258 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 27 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTGCGACT TTTGTGCTAT CTTACCATAT GCACATTATA ATAGGCCGTT ATGCACAGAA GGCTGGTGGA ATGTGAAAGA ACCCCTCGAC TCCTTCCTGG CCACGGCCCA CGAGTCGCAG ACCTCTCCGC GCAATCACGT TCGTGTCCAT CGTCCTGTTG CCGAACCGGC GTCCGAGACT GTCGAACGAA TACTTGAGCG TACTGTCGGC GATCACGATC TCTTCGATCA CCACGCCGAC CCGTGTCATG CCCTCCACGA GCAGGACGAC GACTCGCTGG ATCACGATCT CGACGCCGAT GTTGACGGCA TAGGAGTCGA TGATCCGGTC CGGGTCTACC TCCGTGAGAT CGGACGGGTC AACCTGTTGA CTGCACAGGA AGAGATCATG CTGGCGCAAC AGGTCGAACG CGGCGAGCAG GCGAACGAAC GGTTGCAGAA TGGCGATTAC ACTCCGGTTG AACGCCTCCA ACTTCACCGC TGGGTTCAGG AAGGTCAGGC GGCGCGCGAG CGCCTCATCC AGGCGAACCT GCGCCTGGTC GTATCCATTG CCAAAAAGTA CCTTGGGCGC GGTATGTCCC TGCTCGACCT GATCCAGGAG GGCAACATCG GTTTAATGCG CGCCACCGAA AAGTTCGATT ATCGCAAAGG GTACAAGTTT TCGACGTATG CCACGTGGTG GATCCGCCAG GCGATCACGC GCGTGATCGC CGATCAGAGT CGCACGATCC GCCTGCCGGT GCACGTTGGC GAAACGATCA ATCGGGTGAT GCGCACCAGC AACCGCATCC AGCAGACGAC CGGGCGCGAC CCGACGCCGG ACGAAATCGC GCTTGAACTC GGCATTCCGG TTGAGAAGGT GCGGCGGGTG CTGGAAGCCG CGCGCCAGAC GATCTCGCTC GAAACTCCGA TTGGCCCAGA AGGGGATTCG GTGCTGGCGG ATTTCATCGA GGATGGCAAG GGCGCGACGC CGATGGAAAG CGCATCGAGC CATATTCTGC GCGAACAGAT CGACAGTGCG CTCGAGAAGT TGCCCGAACG CGAACGCCGC ATCATTCAGT TGCGCTATGG GTTGTACGAT GGGCACTACC GCACTCTGGA AGAGGTCGGG CGCGAGTTTG GCATCACTCG CGAGCGCATT CGTCAGATCG AGGCGCGTGT GCTGCGCAAG TTGCGCCATC CGCACTATGG GCGCGGTTTG CGTGGTTATC TCGAATAA
|
Protein sequence | MCDFCAILPY AHYNRPLCTE GWWNVKEPLD SFLATAHESQ TSPRNHVRVH RPVAEPASET VERILERTVG DHDLFDHHAD PCHALHEQDD DSLDHDLDAD VDGIGVDDPV RVYLREIGRV NLLTAQEEIM LAQQVERGEQ ANERLQNGDY TPVERLQLHR WVQEGQAARE RLIQANLRLV VSIAKKYLGR GMSLLDLIQE GNIGLMRATE KFDYRKGYKF STYATWWIRQ AITRVIADQS RTIRLPVHVG ETINRVMRTS NRIQQTTGRD PTPDEIALEL GIPVEKVRRV LEAARQTISL ETPIGPEGDS VLADFIEDGK GATPMESASS HILREQIDSA LEKLPERERR IIQLRYGLYD GHYRTLEEVG REFGITRERI RQIEARVLRK LRHPHYGRGL RGYLE
|
| |