Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Dfer_2462 |
Symbol | |
ID | 8226034 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Dyadobacter fermentans DSM 18053 |
Kingdom | Bacteria |
Replicon accession | NC_013037 |
Strand | + |
Start bp | 3031780 |
End bp | 3032898 |
Gene Length | 1119 bp |
Protein Length | 372 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 644930294 |
Product | thiazole biosynthesis protein ThiH |
Protein accession | YP_003086845 |
Protein GI | 255036224 |
COG category | [H] Coenzyme transport and metabolism [R] General function prediction only |
COG ID | [COG1060] Thiamine biosynthesis enzyme ThiH and related uncharacterized enzymes |
TIGRFAM ID | [TIGR02351] thiazole biosynthesis protein ThiH |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 27 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 23 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCTTTA AGGAAGAATT CGCGCGGTAT TCGTGGGACA ACGTGAAGCA GGATATTTAT TCCAAAACGG CGGCCGACGT GGAAAGAGCA CTCATAGCCC CGAAAAGGAC TTTGGAAGAT TTCAAAGCAT TGATTTCGCC CGCCGCGGCC GCTTACCTGG AACCCATGGC CAGGCTGAGC CGGCAGCTGA CGCGCAAGCG CTTCGGCAAC ACCATTCAAA TGTACGTGCC GCTGTATTTG TCCAACGAGT GTACCAATAT ATGTACCTAC TGCGGGTTCA GCCTGGATAA TAAAGTGCGG CGTCGCACGC TGACAGAGCG TGAAATACTG CAAGAAGTGG AGGTGATTAA AGGCATGGGC TACGAACACG TGCTGCTGGT GACCGGCGAG GCGAACCAGA CGGTGCACGT GGACTATTTC AAAAAGGTGT TGGCCCTGAT CCGTCCGCAT TTCGCGCAGG TATCCATGGA AGTACAGCCG CTCGACCGCG ATGAATACGA GGAACTGATC CCGCTCGGCC TGCATTCGGT GCTGGTGTAC CAGGAAACTT ATCACCAGGA AGATTACCGC AAACACCACC CGAAAGGCAA AAAAGCGAAT TTCAACTACC GCCTCGAAAC GCCCGACCGT CTCGGGCAGG CAGGGATTCA CAAAATGGGC CTCGGCGTGC TGATCGGCCT GGAAGACTGG CGTACCGACA GTTTTTTCAC TGCCTTGCAT TTGCATTATC TTGAAAAAAC CTACTGGCAA ACGCGTTACA GCCTATCCTT TCCGCGCCTG CGCCCATTCT CCGGCGGCCT CGAACCGAAA GTGGAAATGA GCGACCGCGA ACTGGTGCAG CTGATTTGCG CCTACCGCAT ATTTGACGAG GAAGTAGAAC TATCCCTCTC CACACGCGAA TCGGACCGAT TCAGGAACCA TTGCATTCAA TTGGGTGTAA CATCCATCAG CGCCGGCTCC AAAACCAACC CCGGAGGCTA CGCCGTAGAA CCGGAATCAC TCGAACAATT CGAAATTTCC GACGAACGAA GCCCCGCGGA AATAGCACAA ATGATCCGCC AAGCCGGCTA CGAGCCAGTT TGGAAGGATT GGGACGCTGG GCTAATTTTG AATGAGTGA
|
Protein sequence | MSFKEEFARY SWDNVKQDIY SKTAADVERA LIAPKRTLED FKALISPAAA AYLEPMARLS RQLTRKRFGN TIQMYVPLYL SNECTNICTY CGFSLDNKVR RRTLTEREIL QEVEVIKGMG YEHVLLVTGE ANQTVHVDYF KKVLALIRPH FAQVSMEVQP LDRDEYEELI PLGLHSVLVY QETYHQEDYR KHHPKGKKAN FNYRLETPDR LGQAGIHKMG LGVLIGLEDW RTDSFFTALH LHYLEKTYWQ TRYSLSFPRL RPFSGGLEPK VEMSDRELVQ LICAYRIFDE EVELSLSTRE SDRFRNHCIQ LGVTSISAGS KTNPGGYAVE PESLEQFEIS DERSPAEIAQ MIRQAGYEPV WKDWDAGLIL NE
|
| |