Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Dret_0728 |
Symbol | |
ID | 8418541 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Desulfohalobium retbaense DSM 5692 |
Kingdom | Bacteria |
Replicon accession | NC_013223 |
Strand | + |
Start bp | 859871 |
End bp | 860989 |
Gene Length | 1119 bp |
Protein Length | 372 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 645037292 |
Product | thiazole biosynthesis protein ThiH |
Protein accession | YP_003197598 |
Protein GI | 258404856 |
COG category | [H] Coenzyme transport and metabolism [R] General function prediction only |
COG ID | [COG1060] Thiamine biosynthesis enzyme ThiH and related uncharacterized enzymes |
TIGRFAM ID | [TIGR02351] thiazole biosynthesis protein ThiH |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.00541403 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 26 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGAATTTG CCGATATCCT GGAACAATGG CCCGCGGATC GAGTAGAGGC GTTTTGCCGT ACCCGAAGTT CTGCAGATGT CCGGCGAGCC CTGCAGACAA TTCGCCTCAG AGCTGAGGAG TATTTGACCC TGCTGTCGCC CGCGGCCCAG GACCATCTGG AGGCCATGGC CCGCCGAGCC CAGGCCGAAA CCCGACGGAA CTTCGGCCGT GCCATTGTCC TGTTCACCCC ATTGTATCTC TCGAATTATT GCCAGAACCA ATGTGTTTAC TGCGGCTTCA ACGCCGCTCA ACCCATTGCC CGGCGCAAGC TCGAGGACCG GGAAGTGGAA GCCGAAGCCG AAGCCATCGC CGCCACTGGT CTGGGCCATT TGCTTCTTCT CACCGGTGAG GCCCCGCAAC TCGCCGGAGT CGAGTATCTG GAGCGGTGCC TGCGTCAGTT GACCCGGTGG TTTTCTTCGG TTGCCCTTGA AGTTTTTCCC ATGGGGCGCA CGGAATATGC CCGTTTGGTC CGGGCGGGGG CGGATGGATT GACCTTGTAT CAGGAAACCT ATGACCGGGA GCTGTATGCA GCGCTCCATC CGGCCGGACC GAAACGGGAT TTTGATTTCC GCCTCGGGGC CCCGGAGCGG GCGTGTCAGG CCGGTATGCG ATCCGTGAGC CTTGGAGCGC TTTTGGGATT GGGGGCTTGG CGGCACGACT CCTTTGCGAC CGGTTTGCAC GCCGCTTTTT TGCAACACCG GTATCCCGGG GTGGAATTGG CCGTTTCCCT GCCCCGGATG CGGCCGCATT GCGGCGGGTA TGAGCCGGCC CATCCGGTTT CGGATCGGGA ACTGGTCCAG ATCATGCTGG CCCACAGGCT TTTTCTGCCC TATGCGGGCC TTACCCTTTC CACCCGGGAG AGCGCTGCCC TGCGGGACAA TGTCCTGGAA CTGGGGGTGA CGAAATTGTC GGCAGGATCA GTGACTGCGG TCGGCGGGCA CACGGACGGC CCTGAGACCG AGGGACAGTT CGACATCGCC GACACTCGCG ATGTGGCAAC CCTGTCCAAT GCTTTGCGTG CTCGGCATTT CCAGCCGGTG TTCAAGGATT GGGAGCCGCT TCTGGAGACC GGGACGTGA
|
Protein sequence | MEFADILEQW PADRVEAFCR TRSSADVRRA LQTIRLRAEE YLTLLSPAAQ DHLEAMARRA QAETRRNFGR AIVLFTPLYL SNYCQNQCVY CGFNAAQPIA RRKLEDREVE AEAEAIAATG LGHLLLLTGE APQLAGVEYL ERCLRQLTRW FSSVALEVFP MGRTEYARLV RAGADGLTLY QETYDRELYA ALHPAGPKRD FDFRLGAPER ACQAGMRSVS LGALLGLGAW RHDSFATGLH AAFLQHRYPG VELAVSLPRM RPHCGGYEPA HPVSDRELVQ IMLAHRLFLP YAGLTLSTRE SAALRDNVLE LGVTKLSAGS VTAVGGHTDG PETEGQFDIA DTRDVATLSN ALRARHFQPV FKDWEPLLET GT
|
| |