Gene Dret_0483 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDret_0483 
Symbol 
ID8418289 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfohalobium retbaense DSM 5692 
KingdomBacteria 
Replicon accessionNC_013223 
Strand
Start bp588318 
End bp589343 
Gene Length1026 bp 
Protein Length341 aa 
Translation table11 
GC content63% 
IMG OID645037045 
Productdihydrouridine synthase DuS 
Protein accessionYP_003197358 
Protein GI258404616 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0042] tRNA-dihydrouridine synthase 
TIGRFAM ID[TIGR00737] putative TIM-barrel protein, nifR3 family 


Plasmid Coverage information

Num covering plasmid clones32 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value0.322917 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCAGCCA CACCGCTTGC TCCTTTGGAT CAGCCACTCG CTATAGGGGA CAAGTCCCTT 
ACAAACCGAT TCCTCCTCTC GCCTATGGTC GGGGTGACCC ATGTGGCCCT GCGCCGGCTG
CTCCACGACT TTGGGGGATT CGGACTGCAT TGGACGGAAA TGTGTTCCGC TTCAGCTGTG
CTCCAGGAGG ATCCCCGCAT CTCCCCGGTC TTCCGTTTCC ACCGCGATGA ACTTGCTACC
CTGGTCTGCC AGATCATGGG CAGCGAGCCC GAGACCATGG CCGCTGCAGC ACGCCGGATA
CAAGACGAGG GCTTTTTCGG TGTCGACATC AATATGGGCT GTTGCGTGGC CGCAGTCCGT
CAACAAGGGG CTGGAGCGGC CCTGCTGCGC GATATTTCCC GGGCGGCGGC GATAGTCGAT
GCCGTGCGTC AGGCCGTGGA CATCCCCCTC TTTGTCAAGC TCCGTAGCGG CTGGACTGAC
CAGGGGCCGG TCGTCGTCCA CGCGGCCCGA GCGTGTGCCA AGGCCGGAGC CGACGCCTTG
ATCGTGCACC CCCGCCTGGC CCCGGACCGC CGCACCCGGC CGCCCCAATG GAGAGACATC
CGGGCTGTCT GTGAGGCGGT GGATCTCCCT GTTTTCGGCA ACGGCAACGT TTTCACTGCT
GACGACGCAA CAGCCATGCT CCGCCAAACT GGCTGCCAGG GCATCGCCCT GGGCCGCATG
GCCGCGGCTC GTCCCTGGAT CGCCGCCGAG TGGCTCGGCC ATTTTCATCC TGCCCCTGAA
ACCTATCCGA AAGTGGCCCA ACGCATGGTC GAGCTCCTGT GGACGTCATT TCCCGAGGGG
CAAGCCCTGC GATTGTACCG CAAATTCATG AATTATTTCG CGGCGAACTT TGCTTTCGGG
CACCGGCTGC GCAGCGACTT GACCCGCTCC GCAACCCCGG AGGATCTTTA CAAGGAGATC
GCACACCACC TGACGCCGCT GCCGCAGCTC ACACTGCGCC CCAACAGCCT GCTGTTTGCC
GCGTGA
 
Protein sequence
MSATPLAPLD QPLAIGDKSL TNRFLLSPMV GVTHVALRRL LHDFGGFGLH WTEMCSASAV 
LQEDPRISPV FRFHRDELAT LVCQIMGSEP ETMAAAARRI QDEGFFGVDI NMGCCVAAVR
QQGAGAALLR DISRAAAIVD AVRQAVDIPL FVKLRSGWTD QGPVVVHAAR ACAKAGADAL
IVHPRLAPDR RTRPPQWRDI RAVCEAVDLP VFGNGNVFTA DDATAMLRQT GCQGIALGRM
AAARPWIAAE WLGHFHPAPE TYPKVAQRMV ELLWTSFPEG QALRLYRKFM NYFAANFAFG
HRLRSDLTRS ATPEDLYKEI AHHLTPLPQL TLRPNSLLFA A