Gene Dret_0728 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDret_0728 
Symbol 
ID8418541 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfohalobium retbaense DSM 5692 
KingdomBacteria 
Replicon accessionNC_013223 
Strand
Start bp859871 
End bp860989 
Gene Length1119 bp 
Protein Length372 aa 
Translation table11 
GC content62% 
IMG OID645037292 
Productthiazole biosynthesis protein ThiH 
Protein accessionYP_003197598 
Protein GI258404856 
COG category[H] Coenzyme transport and metabolism
[R] General function prediction only 
COG ID[COG1060] Thiamine biosynthesis enzyme ThiH and related uncharacterized enzymes 
TIGRFAM ID[TIGR02351] thiazole biosynthesis protein ThiH 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.00541403 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAATTTG CCGATATCCT GGAACAATGG CCCGCGGATC GAGTAGAGGC GTTTTGCCGT 
ACCCGAAGTT CTGCAGATGT CCGGCGAGCC CTGCAGACAA TTCGCCTCAG AGCTGAGGAG
TATTTGACCC TGCTGTCGCC CGCGGCCCAG GACCATCTGG AGGCCATGGC CCGCCGAGCC
CAGGCCGAAA CCCGACGGAA CTTCGGCCGT GCCATTGTCC TGTTCACCCC ATTGTATCTC
TCGAATTATT GCCAGAACCA ATGTGTTTAC TGCGGCTTCA ACGCCGCTCA ACCCATTGCC
CGGCGCAAGC TCGAGGACCG GGAAGTGGAA GCCGAAGCCG AAGCCATCGC CGCCACTGGT
CTGGGCCATT TGCTTCTTCT CACCGGTGAG GCCCCGCAAC TCGCCGGAGT CGAGTATCTG
GAGCGGTGCC TGCGTCAGTT GACCCGGTGG TTTTCTTCGG TTGCCCTTGA AGTTTTTCCC
ATGGGGCGCA CGGAATATGC CCGTTTGGTC CGGGCGGGGG CGGATGGATT GACCTTGTAT
CAGGAAACCT ATGACCGGGA GCTGTATGCA GCGCTCCATC CGGCCGGACC GAAACGGGAT
TTTGATTTCC GCCTCGGGGC CCCGGAGCGG GCGTGTCAGG CCGGTATGCG ATCCGTGAGC
CTTGGAGCGC TTTTGGGATT GGGGGCTTGG CGGCACGACT CCTTTGCGAC CGGTTTGCAC
GCCGCTTTTT TGCAACACCG GTATCCCGGG GTGGAATTGG CCGTTTCCCT GCCCCGGATG
CGGCCGCATT GCGGCGGGTA TGAGCCGGCC CATCCGGTTT CGGATCGGGA ACTGGTCCAG
ATCATGCTGG CCCACAGGCT TTTTCTGCCC TATGCGGGCC TTACCCTTTC CACCCGGGAG
AGCGCTGCCC TGCGGGACAA TGTCCTGGAA CTGGGGGTGA CGAAATTGTC GGCAGGATCA
GTGACTGCGG TCGGCGGGCA CACGGACGGC CCTGAGACCG AGGGACAGTT CGACATCGCC
GACACTCGCG ATGTGGCAAC CCTGTCCAAT GCTTTGCGTG CTCGGCATTT CCAGCCGGTG
TTCAAGGATT GGGAGCCGCT TCTGGAGACC GGGACGTGA
 
Protein sequence
MEFADILEQW PADRVEAFCR TRSSADVRRA LQTIRLRAEE YLTLLSPAAQ DHLEAMARRA 
QAETRRNFGR AIVLFTPLYL SNYCQNQCVY CGFNAAQPIA RRKLEDREVE AEAEAIAATG
LGHLLLLTGE APQLAGVEYL ERCLRQLTRW FSSVALEVFP MGRTEYARLV RAGADGLTLY
QETYDRELYA ALHPAGPKRD FDFRLGAPER ACQAGMRSVS LGALLGLGAW RHDSFATGLH
AAFLQHRYPG VELAVSLPRM RPHCGGYEPA HPVSDRELVQ IMLAHRLFLP YAGLTLSTRE
SAALRDNVLE LGVTKLSAGS VTAVGGHTDG PETEGQFDIA DTRDVATLSN ALRARHFQPV
FKDWEPLLET GT