Gene Dret_1335 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDret_1335 
Symbol 
ID8419164 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfohalobium retbaense DSM 5692 
KingdomBacteria 
Replicon accessionNC_013223 
Strand
Start bp1560413 
End bp1561402 
Gene Length990 bp 
Protein Length329 aa 
Translation table11 
GC content58% 
IMG OID645037911 
Productaldo/keto reductase 
Protein accessionYP_003198201 
Protein GI258405459 
COG category[C] Energy production and conversion 
COG ID[COG0667] Predicted oxidoreductases (related to aryl-alcohol dehydrogenases) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value0.546872 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAATACG TCACGCTCCC GAATTCAGAT CTCACTGTGA GCCGGGTCGC TCTGGGCACC 
TGGGCCATCG GAGGATGGAT GTGGGGCGGA ACCGATGTCG AACAATCAAT CCGGACCATT
GAAGCCGCCC TGGACCAGGG CATCACCATG ATCGACACCG CCCCGGTCTA CGGTTTCGGA
CGCTCCGAAG AAATTGTCGG CGAGGCCCTG CGCCGTTCCG GACGGCGCCA GGAGGTCGCT
CTGGCCACCA AAGTCGCCCT GGAATGGACC GAGGATGGCG CGGTGCGCCG CAATTCGACA
CCCCAGCGCA TCCGCCAGGA AGTCGAGGAT TCCCTGAAGC GGCTGCAAAC CGACGTCATT
GATCTCTACC AGATCCATTG GCCGGACAAG CTGGTTCCCT TCGAGGAAAC CGCTGCAGTT
ATGCAACAGC TCAAAGACGA AGGCAAAATC CGCGCCGTCG GAGTCAGCAA TTATAGTCCT
GAACAGATGG ACGATTTCCG GCAAAAAGCG GAATTGACCA ATTGCCAGCC GCCCTACAAT
CTCTTTGAAC GCAAGATCGA GGACGATGTC CTGCCGTATT GTCAAGAAAA TACAATCGCT
CTGGTCACCT ATGGCGCCCT CTGCCGGGGC CTTTTGAGCG GGCGCATGAG CGCGGACCGA
ACTTTCACCG GCGACGACCT GCGTAAAGTG GACCCCAAAT TCCAATCCCC CCGTTTCAGC
CAATATCTGC AAGCTGTTAC GGCCCTGGAC CAATGGGCCA AGGAAAAATA CGACAAACGG
ATCCTGCACC TGGCTGTGCG CTGGATCCTG GATCAGGGTG TCCATGTCGC CTTGTGGGGG
GGCAGACGGC CGGAACAGAT GGAACCGCTC CCGGAAATCT TCGGTTGGAC ACTCAAAGAG
CACGACAAAG AAGATATCGA GACCATTCTC CGCACTCACA TCACTGATCC GGTGGGTCCG
GAATTCATGG CCCCGCCGCA CCGGCAATAG
 
Protein sequence
MEYVTLPNSD LTVSRVALGT WAIGGWMWGG TDVEQSIRTI EAALDQGITM IDTAPVYGFG 
RSEEIVGEAL RRSGRRQEVA LATKVALEWT EDGAVRRNST PQRIRQEVED SLKRLQTDVI
DLYQIHWPDK LVPFEETAAV MQQLKDEGKI RAVGVSNYSP EQMDDFRQKA ELTNCQPPYN
LFERKIEDDV LPYCQENTIA LVTYGALCRG LLSGRMSADR TFTGDDLRKV DPKFQSPRFS
QYLQAVTALD QWAKEKYDKR ILHLAVRWIL DQGVHVALWG GRRPEQMEPL PEIFGWTLKE
HDKEDIETIL RTHITDPVGP EFMAPPHRQ