Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | CHU_0245 |
Symbol | thiF |
ID | 4184996 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Cytophaga hutchinsonii ATCC 33406 |
Kingdom | Bacteria |
Replicon accession | NC_008255 |
Strand | + |
Start bp | 295292 |
End bp | 296329 |
Gene Length | 1038 bp |
Protein Length | 345 aa |
Translation table | 11 |
GC content | 44% |
IMG OID | 638070255 |
Product | thiamine biosynthesis protein (HesA/MoeB/ThiF family protein) |
Protein accession | YP_676877 |
Protein GI | 110636670 |
COG category | [H] Coenzyme transport and metabolism [P] Inorganic ion transport and metabolism |
COG ID | [COG0476] Dinucleotide-utilizing enzymes involved in molybdopterin and thiamine biosynthesis family 2 [COG0607] Rhodanese-related sulfurtransferase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 38 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 0.881416 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCAGAT ATTCCCGTCA AACCATCCTT CCCGAAGTTG GCATCGAAGG CCAGCAAAAA CTAACCAACG CATCTGTGCT TGTTGTAGGT GCTGGTGGAT TGGGTTGCCC GGTGTTGCTC TACCTGGCTG CAGCTGGTGT GGGACGGTTG GGAATTATTG ATGCGGACAA AGTGGATATC ACAAATCTGC AACGTCAGGT GTTGTATGTA ACGGAAGATG AAGGAAAGTC AAAAGCGGAA ACAGCGGCGA AACGTTTGAG TGCATTGAAT CCCGAGATTA ATATTGATGT GTATCCGGTT TGGCTTTCCA AAGAAAACGC GCTTGAAATT TTTTCTTCCT ATGATATAAT AGTTGATGGC TCGGATAATT TTGCGACACG CTATCTGGTG AGCGATGCCT GTGTGATTTT AAATAAGCCG CTTGTATTTG GTTCGATCTT TAAATTCGAA GGACAGGTGA GTGTATTTAA TTATAAAGGT GGGCCTACTT ACAGATGCCT GTTTCCGGAA CCGCCTGCTG CAGGCGAAGT GCCCAACTGC TCTGAGATTG GCGTGATCGG TGTGCTGCCG GGAATCATTG GTACGTTGCA GGCCAATGAA GTGATCAAGA TTATTCTGAA GAAAGGTGAT GTGATGAGCG GTGTGTTATA CATGTATGAT GCGTTGAGCA ACATGGTTCA GCAATTAAAA GTATTCAGAG ATCCTGTGGC AAGTGTTGTT ACTGAATTAG GCACGTACGA AGAAGTATGT GAGACATCAC CGGATATTGA TAAAAGGACC TTTGATGTGT GGAAAGAAAA AAATGTTGTT TACCAGCTGA TCGATGTGCG CGAACCGCAT GAATTTGAAA ATAAAAATAT CGGCGGAGAA TTAATCCCGA TGAATACCGT TAAAGACAAT CTGAATCGCA TACGTGAAGA CATTCCTGTA ATCGTGCACT GCCAGATGGG TGGCCGCAGC AGAAAGATTG TCGACTTTTT GTATGAGAAA GGATTTAAGA ACGTGTATAA TCTGAAGGGT GGATTGAGAG AGTTTTAA
|
Protein sequence | MSRYSRQTIL PEVGIEGQQK LTNASVLVVG AGGLGCPVLL YLAAAGVGRL GIIDADKVDI TNLQRQVLYV TEDEGKSKAE TAAKRLSALN PEINIDVYPV WLSKENALEI FSSYDIIVDG SDNFATRYLV SDACVILNKP LVFGSIFKFE GQVSVFNYKG GPTYRCLFPE PPAAGEVPNC SEIGVIGVLP GIIGTLQANE VIKIILKKGD VMSGVLYMYD ALSNMVQQLK VFRDPVASVV TELGTYEEVC ETSPDIDKRT FDVWKEKNVV YQLIDVREPH EFENKNIGGE LIPMNTVKDN LNRIREDIPV IVHCQMGGRS RKIVDFLYEK GFKNVYNLKG GLREF
|
| |