Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Dret_0149 |
Symbol | |
ID | 8417953 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Desulfohalobium retbaense DSM 5692 |
Kingdom | Bacteria |
Replicon accession | NC_013223 |
Strand | + |
Start bp | 193688 |
End bp | 194764 |
Gene Length | 1077 bp |
Protein Length | 358 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 645036714 |
Product | Radical SAM domain protein |
Protein accession | YP_003197029 |
Protein GI | 258404287 |
COG category | [H] Coenzyme transport and metabolism [R] General function prediction only |
COG ID | [COG1060] Thiamine biosynthesis enzyme ThiH and related uncharacterized enzymes |
TIGRFAM ID | [TIGR00423] radical SAM domain protein, CofH subfamily |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.00717785 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 20 |
Fosmid unclonability p-value | 0.324103 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGTGATA TACTCGACGC GCTACGTGAG AGAATTGAAC ATGGCGGGCG GTTGTCTCGC AGGGAGGCCG TGGAACTGGT CCAAGAAGCG ACGGTCCACG ACCTGGGCGA ACTCGGACTT CTGGCCCGCG AACGACGCCA CGGCCGTCAG GCCTATTATG TTTACAACCA GCATCTGAAT TATACGAATA TTTGTGAAAA TCGGTGCCGT TTTTGCGCCT ACAGCAAGCG GCGCGGTGAA AAGGGCGGTT TCACCTATTC TGCTGCCCAG GCCCGTGCCC GGCTTGAAGA ACGCCAGGAC GCCCCGATCC GGGAGGTGCA CATCGTCGGC GGGCTCAACC CGGCCCTGCC CTACGAGTAT TATCTGGAGC TGATCCGGAC CGTGAAGCAG GCCCGGCCCG GTGCTCGGGT CAAAGCATTT ACCGCGGTGG AAATCGCTTT TTTGGCCGAT ACGTACGGCA AATCGCAGAC CACCGTCCTG GAAGAATTGA TGGCCGCCGG TCTGGACGCC CTGCCCGGGG GTGGGGCTGA GGTTTTTGAT CCGCAACTGC GGCAGAAATT GTGCCCGGAA AAGGTCTCTG GACAACGGTG GCTGGATATC CACCGCATCG CCCACGGGCT CGGTTTGCCC ACGAACGCGA CCATGCTCTT TGGGCATATT GAGGGCTGGG AGGAGCGTTT GGACCATCTC GAGGCGCTGC GCGAGCTCCA GGATGAAACT GGGGGCTTTC TGTGTTTCAT CCCGCTTCCC TATCAGCCAA AGAATAACCG CCTCGGGGGC GTGGGGCCGG ATGGACAGGA CTATTTGCGC ATGATCGCCC TGTCGCGCCT CTTTTTGGAC AATGTGTCGC ATCTCAAGGC GTATTGGGTC ATGGCCGGCA TCAAACCAGC CCAATTGGCT TTGTGGGCGG GGGCGGACGA TTTTGACGGC ACCCTGGTCG AAGAACGCAT CGGTCACGCC GCTGGAGCAG AAGCCCCGGC CGGCATGACG GTGCCGCAAC TCGAGCAGGC CATTGCCGCC GCCGGGTTCT CGGCAGTGGA GCGGGATACC TTTTTTCAGC CAGTGGCGAC GGCGTAA
|
Protein sequence | MSDILDALRE RIEHGGRLSR REAVELVQEA TVHDLGELGL LARERRHGRQ AYYVYNQHLN YTNICENRCR FCAYSKRRGE KGGFTYSAAQ ARARLEERQD APIREVHIVG GLNPALPYEY YLELIRTVKQ ARPGARVKAF TAVEIAFLAD TYGKSQTTVL EELMAAGLDA LPGGGAEVFD PQLRQKLCPE KVSGQRWLDI HRIAHGLGLP TNATMLFGHI EGWEERLDHL EALRELQDET GGFLCFIPLP YQPKNNRLGG VGPDGQDYLR MIALSRLFLD NVSHLKAYWV MAGIKPAQLA LWAGADDFDG TLVEERIGHA AGAEAPAGMT VPQLEQAIAA AGFSAVERDT FFQPVATA
|
| |