Gene Dret_0149 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDret_0149 
Symbol 
ID8417953 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfohalobium retbaense DSM 5692 
KingdomBacteria 
Replicon accessionNC_013223 
Strand
Start bp193688 
End bp194764 
Gene Length1077 bp 
Protein Length358 aa 
Translation table11 
GC content60% 
IMG OID645036714 
ProductRadical SAM domain protein 
Protein accessionYP_003197029 
Protein GI258404287 
COG category[H] Coenzyme transport and metabolism
[R] General function prediction only 
COG ID[COG1060] Thiamine biosynthesis enzyme ThiH and related uncharacterized enzymes 
TIGRFAM ID[TIGR00423] radical SAM domain protein, CofH subfamily 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.00717785 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value0.324103 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTGATA TACTCGACGC GCTACGTGAG AGAATTGAAC ATGGCGGGCG GTTGTCTCGC 
AGGGAGGCCG TGGAACTGGT CCAAGAAGCG ACGGTCCACG ACCTGGGCGA ACTCGGACTT
CTGGCCCGCG AACGACGCCA CGGCCGTCAG GCCTATTATG TTTACAACCA GCATCTGAAT
TATACGAATA TTTGTGAAAA TCGGTGCCGT TTTTGCGCCT ACAGCAAGCG GCGCGGTGAA
AAGGGCGGTT TCACCTATTC TGCTGCCCAG GCCCGTGCCC GGCTTGAAGA ACGCCAGGAC
GCCCCGATCC GGGAGGTGCA CATCGTCGGC GGGCTCAACC CGGCCCTGCC CTACGAGTAT
TATCTGGAGC TGATCCGGAC CGTGAAGCAG GCCCGGCCCG GTGCTCGGGT CAAAGCATTT
ACCGCGGTGG AAATCGCTTT TTTGGCCGAT ACGTACGGCA AATCGCAGAC CACCGTCCTG
GAAGAATTGA TGGCCGCCGG TCTGGACGCC CTGCCCGGGG GTGGGGCTGA GGTTTTTGAT
CCGCAACTGC GGCAGAAATT GTGCCCGGAA AAGGTCTCTG GACAACGGTG GCTGGATATC
CACCGCATCG CCCACGGGCT CGGTTTGCCC ACGAACGCGA CCATGCTCTT TGGGCATATT
GAGGGCTGGG AGGAGCGTTT GGACCATCTC GAGGCGCTGC GCGAGCTCCA GGATGAAACT
GGGGGCTTTC TGTGTTTCAT CCCGCTTCCC TATCAGCCAA AGAATAACCG CCTCGGGGGC
GTGGGGCCGG ATGGACAGGA CTATTTGCGC ATGATCGCCC TGTCGCGCCT CTTTTTGGAC
AATGTGTCGC ATCTCAAGGC GTATTGGGTC ATGGCCGGCA TCAAACCAGC CCAATTGGCT
TTGTGGGCGG GGGCGGACGA TTTTGACGGC ACCCTGGTCG AAGAACGCAT CGGTCACGCC
GCTGGAGCAG AAGCCCCGGC CGGCATGACG GTGCCGCAAC TCGAGCAGGC CATTGCCGCC
GCCGGGTTCT CGGCAGTGGA GCGGGATACC TTTTTTCAGC CAGTGGCGAC GGCGTAA
 
Protein sequence
MSDILDALRE RIEHGGRLSR REAVELVQEA TVHDLGELGL LARERRHGRQ AYYVYNQHLN 
YTNICENRCR FCAYSKRRGE KGGFTYSAAQ ARARLEERQD APIREVHIVG GLNPALPYEY
YLELIRTVKQ ARPGARVKAF TAVEIAFLAD TYGKSQTTVL EELMAAGLDA LPGGGAEVFD
PQLRQKLCPE KVSGQRWLDI HRIAHGLGLP TNATMLFGHI EGWEERLDHL EALRELQDET
GGFLCFIPLP YQPKNNRLGG VGPDGQDYLR MIALSRLFLD NVSHLKAYWV MAGIKPAQLA
LWAGADDFDG TLVEERIGHA AGAEAPAGMT VPQLEQAIAA AGFSAVERDT FFQPVATA