Gene Dret_0858 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDret_0858 
Symbol 
ID8418677 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfohalobium retbaense DSM 5692 
KingdomBacteria 
Replicon accessionNC_013223 
Strand
Start bp1015966 
End bp1017018 
Gene Length1053 bp 
Protein Length350 aa 
Translation table11 
GC content54% 
IMG OID645037427 
Productputative RNA polymerase, sigma 70 family subunit 
Protein accessionYP_003197727 
Protein GI258404985 
COG category[K] Transcription 
COG ID[COG0568] DNA-directed RNA polymerase, sigma subunit (sigma70/sigma32) 
TIGRFAM ID[TIGR02392] alternative sigma factor RpoH
[TIGR02937] RNA polymerase sigma factor, sigma-70 family 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value3.47952e-11 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value0.310936 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAACATA CTGAAAATAA AGACATCAAT TCTATGGAAC ACGAGCACGA TCACGATATC 
GAACTCGATG AGCCCGAAGC CGGTGAGACC CTTGATGATT CCACCGCTGA AGAGGCGTCG
GCCGAATCCC TTCCGGCACC TGCCGCGGTG ACAGAACACC TGCCGTCGCT CATCCAGCCG
TCGAGCAACG ATGTCGCCAT TCAGGATCCG CTCAAACTTT ACTTGCGCGA GGTCAATCGC
TTTCCGTTGC TGGAGCCGGA CGAGGAAGTG GAATTGGCCC GGCGGGTCCG TGATGAAAAC
GATCAGTCTG CAGCCTTCCG CCTGATCAGT TCCCACCTGC GTTTGGTGGT CAAGATCGCC
ATGGAGTTCC AACGGCGGTG GATGAAAAAT GTCCTTGATC TGGTCCAGGA GGGCAATGTG
GGGCTGATGA AGGCGGTCCA AAAATTCGAC CCCGAGCGGG GTATTAAATT TTCCTACTAC
GCCTCGTTCT GGATCCGGGC CTATATCCTG AAGTTCATTA TGGACAACTG GCGTATGGTC
AAATTGGGAA CCACCCAGGC CCAGCGCAAA CTGTTCTACA ATTTGAGCCG AGAAAAACAG
CGGTTGCAGG CCCAGGGTTT CGACCCCGAC GCCAGTACTC TCTCAGAGAA TCTGGATGTC
AGTGAAGAAA GTGTCGTGGA AATGACCCAG CGCCTCGGCG GACATGATCT TTCCCTGGAC
GCCCCCCTGG GCGAAGACTC CTCGAGTTCG CGCATGGATT TCCTTCCAGC CTTGGGGGCA
GGCATCGAGG AGTCGCTGGC CCAGCAGGAA ATGGGCTCTG CCCTGCGTCA GCATCTGCAG
ACGATTCTGC CGAAACTCAA CGACAAGGAA AAGGAAATCC TGGAACACCG ATTGCTGACT
GACAGCCCGG TTACCCTGCG GGAAATCGGG GAAAAATACG GAATCACCCG CGAACGCGTC
AGACAAATCG AGTCCAGACT CCTGCAAAAA CTCAAAACCC ATCTCTCCTC AGAAATCCAA
GACTTTTCCG AAGACTGGAT CGAGCATGAA TAA
 
Protein sequence
MKHTENKDIN SMEHEHDHDI ELDEPEAGET LDDSTAEEAS AESLPAPAAV TEHLPSLIQP 
SSNDVAIQDP LKLYLREVNR FPLLEPDEEV ELARRVRDEN DQSAAFRLIS SHLRLVVKIA
MEFQRRWMKN VLDLVQEGNV GLMKAVQKFD PERGIKFSYY ASFWIRAYIL KFIMDNWRMV
KLGTTQAQRK LFYNLSREKQ RLQAQGFDPD ASTLSENLDV SEESVVEMTQ RLGGHDLSLD
APLGEDSSSS RMDFLPALGA GIEESLAQQE MGSALRQHLQ TILPKLNDKE KEILEHRLLT
DSPVTLREIG EKYGITRERV RQIESRLLQK LKTHLSSEIQ DFSEDWIEHE