Gene Dret_1446 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDret_1446 
Symbol 
ID8419275 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfohalobium retbaense DSM 5692 
KingdomBacteria 
Replicon accessionNC_013223 
Strand
Start bp1676959 
End bp1678110 
Gene Length1152 bp 
Protein Length383 aa 
Translation table11 
GC content56% 
IMG OID645038021 
Productaldo/keto reductase 
Protein accessionYP_003198311 
Protein GI258405569 
COG category[R] General function prediction only 
COG ID[COG1453] Predicted oxidoreductases of the aldo/keto reductase family 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.00469922 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value0.148441 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAATACA GAGTCATGGG ACGCACCGGC GAACGCGTAG CCGCTTTAGG TCTCGGATGT 
ATGCGTTTTC CGGTCGTTGA TGGTGACGAC GGCTGCATTG ACGAGCCGCG GGCGACCGTG
TTGTTGCGCG AAGCCATTGA CGCTGGAGTG AATTATCTGG ACACGGCATA CCCCTACCAC
AAGGGGGCCA GCGAACCTTT TGTCGGTCGC GCTCTTCAGG GAGGCTACCG GGACAAGGTC
CATCTAGCGA CGAAATTGCC CTCCTGGGCC ATTGAAAGCG CCGAGGATTT TGACCGCTAC
CTGGATGAAC AATTACAACG TCTCCAGACC GGGCACATCG ACTTTTATCT TTTGCACGCC
TTGAAAGGGG AGTGGTGGCG GAAACTGCGT GATCTGGGCG TTCTGTCTTT TCTTGACCGG
GCCGTTGCCG ATGGCCGGAT TAAGTACGTC GGGTTTTCCT TTCATGATGA GTGGGCGCAG
TTTAAGGAGA TAGTCGACGC CTACGAATGG GATTTTTGTC AGATCCAGTA TAACTACATG
GACGAAGATA TTCAGGCCGG CAGTAAGGGT CTTTATTACG CCGCTAACAA GGGACTGGGC
GTTGTGGTCA TGGAGCCGTT GCGCGGCGGG AGTCTGGCCT CGACTGTACC AGAGCCGGTC
CAATCTATTT GGGATGAGGC CGAGCCGAAA CGGACACCGG CGGAATGGGC TTTGCGCTGG
GTCTGGGACC ATCCTGAAGT TTCGGTGGTC TTAAGCGGTA TGAACAGCCG GGCGCAGCTC
CACGAGAATT GCCGGGTCGC CGACGAAGCT ACGCCCGGCA GCTTGTCGAC CGACGATTAT
GAGCGCATCG GCCGTGTTCG ACAGATCTAC AGGGAACGCA TCCAGATCCC GTGCACGAGC
TGCGGTTATT GTCTGCCCTG TCCGAGCGGG GTGAATATTC CGCGGATCTT TTCGATCATG
AACGACAGGT TCATCTACGA CGCCACCCAT TGGTCGCAGG TCATGTATAA TGTGGCGACG
AACAGCGATG AAAACGCGGC CAATTGCGTT CAATGTGGGG CCTGTGAAGA GGTGTGCCCA
CAGCAGATAC CGATTATGGC CAAATTGCAG GAGTGTCACG AAACATTGGC ACAGGCGGAG
GAATCGGACT GA
 
Protein sequence
MQYRVMGRTG ERVAALGLGC MRFPVVDGDD GCIDEPRATV LLREAIDAGV NYLDTAYPYH 
KGASEPFVGR ALQGGYRDKV HLATKLPSWA IESAEDFDRY LDEQLQRLQT GHIDFYLLHA
LKGEWWRKLR DLGVLSFLDR AVADGRIKYV GFSFHDEWAQ FKEIVDAYEW DFCQIQYNYM
DEDIQAGSKG LYYAANKGLG VVVMEPLRGG SLASTVPEPV QSIWDEAEPK RTPAEWALRW
VWDHPEVSVV LSGMNSRAQL HENCRVADEA TPGSLSTDDY ERIGRVRQIY RERIQIPCTS
CGYCLPCPSG VNIPRIFSIM NDRFIYDATH WSQVMYNVAT NSDENAANCV QCGACEEVCP
QQIPIMAKLQ ECHETLAQAE ESD