Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Dole_1272 |
Symbol | |
ID | 5694107 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Desulfococcus oleovorans Hxd3 |
Kingdom | Bacteria |
Replicon accession | NC_009943 |
Strand | + |
Start bp | 1520688 |
End bp | 1521755 |
Gene Length | 1068 bp |
Protein Length | 355 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 641263866 |
Product | radical SAM domain-containing protein |
Protein accession | YP_001529155 |
Protein GI | 158521285 |
COG category | [H] Coenzyme transport and metabolism [R] General function prediction only |
COG ID | [COG1060] Thiamine biosynthesis enzyme ThiH and related uncharacterized enzymes |
TIGRFAM ID | [TIGR00423] radical SAM domain protein, CofH subfamily |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 38 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGACCGGC TGAAACCCGC AATCGAAAAA GCCATGGAAG GAGCCCGGCT GGACATCGAA GAGGCCCTGG CCCTTTATAC GAAAGCCGAC CTGCTTGTCC TGGGCCAGCT GGCGGTTCGG CAGCGGTGGC GGCTGCGGCC CGAGCCGGTG GCCACCTTTG TGGTGGACAG AAACATCAAC TACACCAATA TCTGCGTGTC CGGCTGCCTG TTCTGCGCCT TTCACGCGGC TCCGGGCAGC GCGGCCGGGT ATGTGCTGTC CGAGGCGGAA CTGAAAAAAA AGCTGGATGA GACCGTTATA CTGGGCGGCA CCCAGGTGCT GCTTCAGGGC GGCATGAACC CGGACCTGGA CCTGGCCTAT TACAAAGATC TGCTGAAGTT CATCAAGTCC GTATCCGGCC TCCATATTCA CGGGTTTTCC CCGCCGGAGA TCGCCTTTTT CTCAAAGCAG TGGGGGCTCT CCGTGGAGGC GGTGCTGGGG GAGCTTGTAG CCGCCGGCCT GGATTCGATT CCCGGCGGCG GGGCCGAAAT TCTGGCCGAT GAGATTCGCA ACAAAATATC GCCGGGCAAG TGCGATACTG CCACCTGGCT GGGTGTGATG GAGACGGCCC ACGCTATTGG CCTTCGGTCC TCGGCTACCA TGATGTTCGG CCATGTTGAA ACACCGCGCC ACCTGATTGA CCACCTGCTG GCGATCCGGG ATCTTCAGGA CCGCACCAGT GGTTTTACCG CCTTTATTCC CTGGAGCTTT CAGCCGCAAA ACACCCGGCT GAAAGGAACC CCGGCGTCGG CCGTGGATTA CCTGAAGGTG GTGGCCCTCT CCCGGCTGGT GCTGGACAAT GTGCCCAACA TTCAGGCGTC CTGGGTGACC CAGGGAGACA AGATCTCCCA GGTGGCCCTG GGGTTCGGCG CCAACGACAT GGGCAGCACT ATGATCGAAG AGAACGTGGT GGCTGCCGCC GGAGTCCGTT TCCGCCTGCC CAGGGAGGCC ATCATCCGCC TCATTGAAGG GGCCGGGCTT CGAGCCGCCC AGCGGGACTT CTTTTATCGG ATTCTGCAGT ACCACTGA
|
Protein sequence | MDRLKPAIEK AMEGARLDIE EALALYTKAD LLVLGQLAVR QRWRLRPEPV ATFVVDRNIN YTNICVSGCL FCAFHAAPGS AAGYVLSEAE LKKKLDETVI LGGTQVLLQG GMNPDLDLAY YKDLLKFIKS VSGLHIHGFS PPEIAFFSKQ WGLSVEAVLG ELVAAGLDSI PGGGAEILAD EIRNKISPGK CDTATWLGVM ETAHAIGLRS SATMMFGHVE TPRHLIDHLL AIRDLQDRTS GFTAFIPWSF QPQNTRLKGT PASAVDYLKV VALSRLVLDN VPNIQASWVT QGDKISQVAL GFGANDMGST MIEENVVAAA GVRFRLPREA IIRLIEGAGL RAAQRDFFYR ILQYH
|
| |