Gene Rcas_0257 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRcas_0257 
Symbol 
ID5537719 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus castenholzii DSM 13941 
KingdomBacteria 
Replicon accessionNC_009767 
Strand
Start bp315712 
End bp317337 
Gene Length1626 bp 
Protein Length541 aa 
Translation table11 
GC content58% 
IMG OID640892421 
ProductRpoD family RNA polymerase sigma factor 
Protein accessionYP_001430408 
Protein GI156740279 
COG category[K] Transcription 
COG ID[COG0568] DNA-directed RNA polymerase, sigma subunit (sigma70/sigma32) 
TIGRFAM ID[TIGR02393] RNA polymerase sigma factor RpoD, C-terminal domain
[TIGR02937] RNA polymerase sigma factor, sigma-70 family 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.201917 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAACAAG CGAACGAGCC ACTTCTGGAA GATGATCTGC TGCTCGACGA TCTGGACGAT 
GACATGACCG GCGCTTCCCG TGAACCCGAA GAGGAGCCGC AGACGATCAC AAGTCTAAGT
GATCTGCTGG CAATCGGCAA ACAGCGCGGT TTTGTCACCG AGGGAGAAAT TGCGCAACTG
CTTGCAGGCA GCGACGCCGA TAGCGACCGT CTGGCGGAAA TTCAACAGGC GCTTCAGTCG
GCAGGCATTG CCACGCGCGA TGAGATGATT GCCGGCGGCG CCGAGATCGA TGTGCACTTC
GAGGAGGAAG GCGATCTTGA CGACTTGAGC GTCGAGGGCA TCAGCATCAA CGATACGGTG
CGCATGTATC TGCGCGAGAT TGGACGGGTG CCGCTCCTGA CCGCGCGCCA GGAAACGCTG
CTGGCGCAGA AGATCGAGAT CGGCGAATAT CTGGAGTCGT ACCGCACCCA ACTGGCGGCG
GACTGGCGCC CTGAGCAGAT CGACTCTGTT GGCGCACAAA TGTATCAGCG CCTGCAAAAA
ACCTGGCCCG TGGTTGAGGA TGCCATTCGA TTGCTCTACG CGATTGTCGA GCAACCGTTG
CCCGATCCGC TGACCCCCGG TTGCCTGCGC GAGGTCTCCG CCATTCAGGA ACGTATGACG
TTTGAGCAGC GGCAGATGTT CGACAAGCGC CGCAATGAGT TGATCAAACA GCACCGGATG
ACGGCAGAAC AGTTTGATCA GACGTTCGCT CAGGCGACGA TTATCTTTTC GCTTCTGCCT
GCCGCCGTTC AGCAACGGCT GGCGCTATCC GCCAACTGGC CCTCCGTCGA GGAAGTGCAG
CACATGTGCG CCGATGTGCG CGATCAACTC TGGCGCGATT GGAATCTTGC GATTGAGCAG
GGCAAAACAG CGCGCGAGGA TTTGACCCAG GCAAATCTCC GGTTGGTCGT GTCGGTGGCG
AAGAAGTATA TCGGGCGTGG GTTACAACTC CTCGACCTGA TTCAGGAGGG GAATGTCGGG
CTGATCCGCG CGGTCGAAAA GTTCGACTAT CGCAAAGGGT TCAAGTTCTC GACCTACGCA
ACCTGGTGGA TTCGCCAGGC GATCACGCGC GCCATTGCCG ATCAGGCGCG CACTATTCGG
ATTCCGGTGC ATATGGTCGA GACCATCAAT CGCCTGATGC GTGAAAGCCG CCGCATGCTC
CAGGAGTTGG GGCGCGAACC AACCGACGAG GAGTTGTCGC GCGCATTGGG CATCCCGGTG
GATAAAGTGC GTTCGATCCG TAAAACGTCG CTCGAACCGG TCTCGCTCGA GACGCCGGTC
GGGCAGGAGG AAGACAGTCA ACTCGGCGAT TTTATCGAAG ACTCAAAAGT GCTTGCGCCT
TCCGACGCCG CCAGCCATCA GATGCTGCGT GAGCAGGTCG AACAGGTGCT GAATCAGCTC
ACAGAACGCG AACGTCGTGT CCTCCAATTG CGGTTTGGTC TTGAAGATGG TCACAGCCGC
ACCCTCGAGG AGGTTGGCAA GGAGTTCGGC GTGACTCGTG AGCGTATCCG CCAGATTGAG
GTGAAAGCGC TGCGAAAACT GCGTCATCCT CGCCTCGGCA AGAAACTTCG CGATTATCTT
GAATAA
 
Protein sequence
MEQANEPLLE DDLLLDDLDD DMTGASREPE EEPQTITSLS DLLAIGKQRG FVTEGEIAQL 
LAGSDADSDR LAEIQQALQS AGIATRDEMI AGGAEIDVHF EEEGDLDDLS VEGISINDTV
RMYLREIGRV PLLTARQETL LAQKIEIGEY LESYRTQLAA DWRPEQIDSV GAQMYQRLQK
TWPVVEDAIR LLYAIVEQPL PDPLTPGCLR EVSAIQERMT FEQRQMFDKR RNELIKQHRM
TAEQFDQTFA QATIIFSLLP AAVQQRLALS ANWPSVEEVQ HMCADVRDQL WRDWNLAIEQ
GKTAREDLTQ ANLRLVVSVA KKYIGRGLQL LDLIQEGNVG LIRAVEKFDY RKGFKFSTYA
TWWIRQAITR AIADQARTIR IPVHMVETIN RLMRESRRML QELGREPTDE ELSRALGIPV
DKVRSIRKTS LEPVSLETPV GQEEDSQLGD FIEDSKVLAP SDAASHQMLR EQVEQVLNQL
TERERRVLQL RFGLEDGHSR TLEEVGKEFG VTRERIRQIE VKALRKLRHP RLGKKLRDYL
E