Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SeD_A4568 |
Symbol | thiF |
ID | 6871855 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salmonella enterica subsp. enterica serovar Dublin str. CT_02021853 |
Kingdom | Bacteria |
Replicon accession | NC_011205 |
Strand | - |
Start bp | 4409541 |
End bp | 4410299 |
Gene Length | 759 bp |
Protein Length | 252 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 642787476 |
Product | thiazole biosynthesis adenylyltransferase ThiF |
Protein accession | YP_002218078 |
Protein GI | 198245874 |
COG category | [H] Coenzyme transport and metabolism |
COG ID | [COG0476] Dinucleotide-utilizing enzymes involved in molybdopterin and thiamine biosynthesis family 2 |
TIGRFAM ID | [TIGR02356] thiazole biosynthesis adenylyltransferase ThiF, E. coli subfamily |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 42 |
Fosmid unclonability p-value | 0.00103081 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGAATGACC GCGATTTTAT GCGCTACAGC CGGCAAATCC TGCTCGGCGA TATCGCCATT GAAGGCCAGC AAAAGCTGCT CGCGAGTCAT GTACTGATCG TAGGTTTAGG CGGGTTAGGT TCACCCGCCG CGCTGTATCT GGCGGGAGCA GGCATTGGCA AACTGACGCT GGCAGACGAT GACGACATTC ATCTGAGCAA TTTGCAGCGC CAGATCCTGT TTACCACCGA TGATATCGCG CGTTCGAAAT CCCAGGTTGC CCGGCAGCGC CTGACGCAGC TCAACCCGGA TATCGAACTG GTCTCGCTCC AGCAGCGACT AAAAGGCGAT GCGCTCCGGC ATGCGGTCTC GCGAGCCGAC GTGGTGCTCG ACTGTACCGA TAACATGTCC ACGCGCCAGG AAATCAACGC CGCCTGCGTC GCGCTCAATA CCCCACTCAT CACCGCCAGC GCCGTCGGCT TTGGCGGCCA GTTGATGGTG CTCACGCCAC CGTGGGAACA AGGCTGTTAC CGCTGCCTGT GGCCAGATGA TGTGGAGCCG GAACGCAACT GCCGCACCGC TGGGGTGCTC GGTCCGGTGG TGGGCGTGAT GGGTACCTTG CAGGCGCTGG AGGCGATTAA ATTACTCAGC GGTATTGAAA CACCGAACGG GCAGCTACGT CTGTTTGACG GCAAAACCAG CCAGTGGCGC AGCCTCGCGC TGCGTCGCGC CAGCGGCTGT CCGGTATGCG GAGGGCAGCA TGCAAATTCA ATTCAATGA
|
Protein sequence | MNDRDFMRYS RQILLGDIAI EGQQKLLASH VLIVGLGGLG SPAALYLAGA GIGKLTLADD DDIHLSNLQR QILFTTDDIA RSKSQVARQR LTQLNPDIEL VSLQQRLKGD ALRHAVSRAD VVLDCTDNMS TRQEINAACV ALNTPLITAS AVGFGGQLMV LTPPWEQGCY RCLWPDDVEP ERNCRTAGVL GPVVGVMGTL QALEAIKLLS GIETPNGQLR LFDGKTSQWR SLALRRASGC PVCGGQHANS IQ
|
| |