Gene Dret_1894 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDret_1894 
Symbol 
ID8419737 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfohalobium retbaense DSM 5692 
KingdomBacteria 
Replicon accessionNC_013223 
Strand
Start bp2174960 
End bp2176723 
Gene Length1764 bp 
Protein Length587 aa 
Translation table11 
GC content57% 
IMG OID645038480 
ProductRNA polymerase, sigma 70 subunit, RpoD subfamily 
Protein accessionYP_003198756 
Protein GI258406014 
COG category[K] Transcription 
COG ID[COG0568] DNA-directed RNA polymerase, sigma subunit (sigma70/sigma32) 
TIGRFAM ID[TIGR02393] RNA polymerase sigma factor RpoD, C-terminal domain
[TIGR02937] RNA polymerase sigma factor, sigma-70 family 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.0780433 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.0147198 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTAACA TCAAGGATGT CCAGCAGATT AAAACGCTCA TCGCCGAGGG CAAAAAGAAG 
GGGTTTTTGA CCTTTGATGA GCTAAACAAA GCCCTGCCCA ACGAGGTGAA CCAACCGGAG
CAGATCGAAG AGATTATCAC CATCTTCGAC CAGTTGGACA TCGCCATCGT GGACGCCCAG
ACAGGCAAAA ATCTGGGCAT GGTGGCCGGT GACTCCGGGG GAGATTCCGA GGAGGAAGAA
GAACTCGAGA TCACGGAAAC CGAAGAGGTT GAAACCACAC GCAGTTCCGA TCCCGTGCGC
ATGTATCTGC GCGAGATGGG GATTGTGCCC CTCTTGGACC GCGAAGGCGA GGTCGAGATC
GCCAAGAAGA TCGAGGACGG AGAATTGGAG GTTCTCTACG CCTTGATCGA AGTCCCTGTG
GCCGTGGAAG AACTGGTCGG GGTCGGCGAC GACCTCCAGA AAGGGACCAT CAAACTCAAG
GATGTGGTCA AGACCATCGA GGAAGACGAC CCGTCCGAAG ATGAGATGAA CCAGCGCGAG
CGGGTCATTT ACCTGCTTGA AGAGCTGCGC AAGCTCTATA AGAAACGCGT CAATATCTAT
TACAAGCTTG ATGAGTGCGC GACGCTGGAC AAGCGGGTGC GCGGCGTCCA GAACAAGATC
ATCGACTACA AACACGAAGT CGTGCAGATC CTGCGGGATA TCAAGCTGGA AAAAACGCTC
ATCGACCGGG TTATCGAGAC CCTGCACGAT TACGTGCGCC AGATGCACAA CTGCCGGCGC
GACCTTTCCG CCTATGTCCT GTCTGTGGGC AAGACCCGGG GCGAGATCGA AGAGATCTTC
GCCGAACTCG ACCGGCGCGA GCTCAATCCG GTTATCGCTG CGGACAATCT GGGCATGACC
GTGGAAGAGC TCTTTTCTTT CAAGGAGATG CTGGCCGGGA AGATGGAGAT TTTGCAGCGC
CTGGAAGAGA AGTGCTGCCA TTCCGTGACC GACCTTGAAG AGATCCTGTG GCGCATCAAC
CGCGGCAATT ACGACGCCCT GCGGGCCAAG CAGGAGCTTA TCCGCTCCAA CCTCCGTCTG
GTGGTTTCCA TCGCCAAGAA ATACACCAAC CGCGGTCTGC AATTCCTGGA TCTCATCCAG
GAAGGCAATA TCGGCCTGAT GAAAGCCGTG GACAAATTCG AGTACCAGCG CGGCTACAAG
TTTTCGACCT ACGCCACGTG GTGGATCCGG CAGGCCATCA CCCGGGCCAT TGCCGACCAG
GCGCGGACCA TCCGTATTCC GGTGCACATG ATCGAGACTA TCAACAAGCT CATCCGGACC
TCGCGCTATC TGGTCCAGGA ACTGGGGCGT GATCCTCAGC CCGAGGAAAT CGCCGAGCGC
ATGGACTATC CGGTGGAGAA GGTCAAGAAG GTCCTCAAGA TCGCCAAGGA GCCCATCTCT
CTCGAAACGC CCATCGGCGA CGAGGAGGAT TCCAGTCTGG GCGATTTTAT CGAGGACAAG
AAAGCCGTGG CCCCGGCGGA AGAGGCGGTC AATTCCAAGC TGGCCGAGCA GATCTCCACA
GTGCTTTCCG AACTCACGCC GCGCGAGGAA CAGGTCCTGC GCAAACGGTT CGGCATCGGC
GAGAAATCCG ATCACACGCT GGAAGAAGTG GGCAAGCTGT TCAACGTCAC TCGGGAACGT
ATCCGGCAGA TCGAGGCCAA GGCGCTGCGG AAACTGCGCC ACCCCGTGCG CAGCCAGAGC
CTGCGTTCCT ATTACGAATC CTAG
 
Protein sequence
MSNIKDVQQI KTLIAEGKKK GFLTFDELNK ALPNEVNQPE QIEEIITIFD QLDIAIVDAQ 
TGKNLGMVAG DSGGDSEEEE ELEITETEEV ETTRSSDPVR MYLREMGIVP LLDREGEVEI
AKKIEDGELE VLYALIEVPV AVEELVGVGD DLQKGTIKLK DVVKTIEEDD PSEDEMNQRE
RVIYLLEELR KLYKKRVNIY YKLDECATLD KRVRGVQNKI IDYKHEVVQI LRDIKLEKTL
IDRVIETLHD YVRQMHNCRR DLSAYVLSVG KTRGEIEEIF AELDRRELNP VIAADNLGMT
VEELFSFKEM LAGKMEILQR LEEKCCHSVT DLEEILWRIN RGNYDALRAK QELIRSNLRL
VVSIAKKYTN RGLQFLDLIQ EGNIGLMKAV DKFEYQRGYK FSTYATWWIR QAITRAIADQ
ARTIRIPVHM IETINKLIRT SRYLVQELGR DPQPEEIAER MDYPVEKVKK VLKIAKEPIS
LETPIGDEED SSLGDFIEDK KAVAPAEEAV NSKLAEQIST VLSELTPREE QVLRKRFGIG
EKSDHTLEEV GKLFNVTRER IRQIEAKALR KLRHPVRSQS LRSYYES