Gene RoseRS_0287 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRoseRS_0287 
Symbol 
ID5207222 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus sp. RS-1 
KingdomBacteria 
Replicon accessionNC_009523 
Strand
Start bp366008 
End bp367633 
Gene Length1626 bp 
Protein Length541 aa 
Translation table11 
GC content58% 
IMG OID640593913 
ProductRpoD family RNA polymerase sigma factor 
Protein accessionYP_001274669 
Protein GI148654464 
COG category[K] Transcription 
COG ID[COG0568] DNA-directed RNA polymerase, sigma subunit (sigma70/sigma32) 
TIGRFAM ID[TIGR02393] RNA polymerase sigma factor RpoD, C-terminal domain
[TIGR02937] RNA polymerase sigma factor, sigma-70 family 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.332062 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00728099 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGGAAAACG CAAACGAGTC ACTTCTGGAC GATGATCTGA TGCTGGACGA TCTGGATGAT 
GAGCTGGCTG GCGCTTCACG GGAACCCGAA GAAGAGCCGC AAACCATCAC CAGCCTGAGC
GATCTGCTGG CAATAGGCAA GCAACGCGGG TTCGTCACCG AGGGCGAGAT TGCGCAACTC
CTGGCAAATA GCGACGCCGA TAGCGACCGT CTGGCGGAGA TTCAGCAGGC GCTCCAGGCA
GCCGGCATTG CAACCCGCGA CGAAATGATC ACCGGCGGTG CGGAGATCGA TATCCCCTTC
GAGGAAGAGG GTGATTTCGA TGATCTGAAC GTTGAAGGCA TCAGTGTTAA TGATACGGTG
CGGATGTACC TGCGCGAGAT CGGGCGGGTG CCGCTGCTCA ATGCGCGCCA GGAGATCCTG
CTTGCACAGA AGATCGAGAT CGGCGAATAC CTCGAGTCGT ACCGGACGCA ACTGGCTGCT
GACTGGCGCC CGGAGCAGGT CGACGTGATC GGCGCGCAAA TGTATCAGCG GTTGCAAAAA
ACATGGCCCG TTGTTGCGGA TGCCGTTCGG CTGCTCTACG AGATTGTCGA GCAACCGTTG
CCCGATCCGT TGACTCCGGG ATGCCTGCGC GAAGTTTCCG CCATCCAGGA GCGCATGACC
TTCGAGCAGC GCCAGTCGTT CGAAAAACGG CGCAATGAAT TGATCAAACA GCACCGCATG
ACGGCGGAAC AGTTTGATCA GACGTTTGCG CAGGCGACGA TCATCTTCTC GCTCTTACCC
CATGTGGTGC AGCAACGGTT GATCGCCTTC ACCGACTGGC CCACGGTCGA GGACGTGCAG
CAGGCGTGCG CTGCGGCGCG CGATCAGCTC TGGCGCGACT GGAATCTCGC CATCGATCAG
GGGAAAACCG CACGCGAGGA TCTGACTCAG GCAAACCTGC GCCTGGTCGT ATCAGTGGCA
AAGAAGTACA TTGGTCGCGG GTTGCAACTC CTCGACCTGA TTCAGGAGGG GAACGTCGGG
CTGATCCGCG CGGTTGAAAA ATTCGACTAT CGTAAAGGGT TCAAGTTCTC GACGTATGCA
ACCTGGTGGA TCCGTCAGGC GATCACCCGC GCGATTGCCG ATCAGGCGCG CACCATCCGC
ATCCCGGTGC ATATGGTCGA GACAATCAAC CGCCTGATGC GCGAAAGTCG CCGGATGTTG
CAGGAGTTGG GGCGCGAACC GACCGATGAG GAATTGTCAC GCGCGCTGGG CATTCCGGTC
GATAAGGTGC GTTCGATCCG CAAAACATCG CTCGAACCCG TCTCGCTTGA AACACCGGTC
GGACAGGAAG AAGACAGTCA GCTCGGCGAT TTTATCGAAG ACTCGAAAGT GCTCGCTCCT
TCCGATGCCG CCAGTCATCA GATGCTACGC GAGCAGGTCG AGCAGGTGCT GAATCAACTC
ACCGAGCGTG AACGACGGGT GCTCCAGTTG CGTTTCGGTC TCGAAGATGG TCACAGTCGC
ACGCTTGAAG AAGTTGGCAA GGAGTTCGGC GTTACACGCG AGCGCATTCG CCAGATCGAG
GTAAAGGCGT TGCGGAAACT GCGTCATCCG CGGCTTGGCA AGAAACTGCG CGATTATCTT
GAGTAG
 
Protein sequence
MENANESLLD DDLMLDDLDD ELAGASREPE EEPQTITSLS DLLAIGKQRG FVTEGEIAQL 
LANSDADSDR LAEIQQALQA AGIATRDEMI TGGAEIDIPF EEEGDFDDLN VEGISVNDTV
RMYLREIGRV PLLNARQEIL LAQKIEIGEY LESYRTQLAA DWRPEQVDVI GAQMYQRLQK
TWPVVADAVR LLYEIVEQPL PDPLTPGCLR EVSAIQERMT FEQRQSFEKR RNELIKQHRM
TAEQFDQTFA QATIIFSLLP HVVQQRLIAF TDWPTVEDVQ QACAAARDQL WRDWNLAIDQ
GKTAREDLTQ ANLRLVVSVA KKYIGRGLQL LDLIQEGNVG LIRAVEKFDY RKGFKFSTYA
TWWIRQAITR AIADQARTIR IPVHMVETIN RLMRESRRML QELGREPTDE ELSRALGIPV
DKVRSIRKTS LEPVSLETPV GQEEDSQLGD FIEDSKVLAP SDAASHQMLR EQVEQVLNQL
TERERRVLQL RFGLEDGHSR TLEEVGKEFG VTRERIRQIE VKALRKLRHP RLGKKLRDYL
E