Gene Dret_1079 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDret_1079 
Symbol 
ID8418904 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfohalobium retbaense DSM 5692 
KingdomBacteria 
Replicon accessionNC_013223 
Strand
Start bp1270692 
End bp1271678 
Gene Length987 bp 
Protein Length328 aa 
Translation table11 
GC content63% 
IMG OID645037651 
Productdihydrouridine synthase DuS 
Protein accessionYP_003197945 
Protein GI258405203 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0042] tRNA-dihydrouridine synthase 
TIGRFAM ID[TIGR00737] putative TIM-barrel protein, nifR3 family 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value0.224362 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value0.224142 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACAAACG TTTCTGCGAC GGGGCCCTCC CCGCTCCCCT TCGGGCCCCA GGCTCCGTGG 
CTGGCGCCGT TGGCCGGATT CACCGACCTG CCGTTTCGGC TCCTGTGCCG TGAAAACGGT
GCTCGTGTGG CCCACACAGA AATGATCAGC GTCAAAGGGC TGATATACAA CAGTCAGGGC
ACCTGGGACC TCCTGGCTAC CGCCCCCGCC GACACGCCCC TGGTGGTCCA ATTGTTCGGT
GCAGATCCCG ACTGTTTTGC GCAGCCGGTG CGCTGGCTCA CTGAACGGGG ATTTCACTGG
ATCGACCTCA ATGCGGGATG TCCGGTGCGC AAGGTGATCA AGACCGGGGC CGGCGCCGCG
CTTATGGAAG ACCCGCAGCG CCTTGTACGC ATCATGCAGA CCATTGGCAG GGCCGCCCCG
GTGCAGGCCG GGGTCAAACT CCGCCTCCCA GCGGACGGTA GCACAGACGG GTTACTCCGG
TTACGCGACA CTCTCGCACG TGCCGGGGTC AGTTGGATCA CCTTGCACCC GCGCACGGCC
AGACAGGGAT ATGGAGGATT GGCGCAGTGG AGTGCCTTGT CGCGAATGGC CGAGTCCAGC
CCGGTACCCA TCGTGGCCAG CGGCGATCTG TGGAACGCGC AAGCGGCCCG GCGATGCTTT
GAGCAAACCG GTGTGGATGG GATCATGTTC GCCCGTGGCG CCCTGCACAA TCCCCGGATC
TTCAAAGCTG ACCTCGCGAC TGGAGCCGAG GAAGATCCCT GTACCGACAC GGCTTCAATA
GCCGCTCTGG TTCGACGACA CGGCCAATTG TGCCGCCGCT ATGATCCCAG CCGGAGCATG
CTGCTGAAGA TGCGCTCATT TATCCCGCGC TACGTCAAGG GATTTCCCGG CGCCAAACAG
GCCCGGAAGG GAATTATCGC CTGCCAGGAT TGGGAGGCCT TTGAACAATA TGCCGATCAA
CTTGAAGAGG CCTTGGCCGG ACAATGA
 
Protein sequence
MTNVSATGPS PLPFGPQAPW LAPLAGFTDL PFRLLCRENG ARVAHTEMIS VKGLIYNSQG 
TWDLLATAPA DTPLVVQLFG ADPDCFAQPV RWLTERGFHW IDLNAGCPVR KVIKTGAGAA
LMEDPQRLVR IMQTIGRAAP VQAGVKLRLP ADGSTDGLLR LRDTLARAGV SWITLHPRTA
RQGYGGLAQW SALSRMAESS PVPIVASGDL WNAQAARRCF EQTGVDGIMF ARGALHNPRI
FKADLATGAE EDPCTDTASI AALVRRHGQL CRRYDPSRSM LLKMRSFIPR YVKGFPGAKQ
ARKGIIACQD WEAFEQYADQ LEEALAGQ