Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SNSL254_A4494 |
Symbol | thiF |
ID | 6485828 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salmonella enterica subsp. enterica serovar Newport str. SL254 |
Kingdom | Bacteria |
Replicon accession | NC_011080 |
Strand | - |
Start bp | 4369242 |
End bp | 4370000 |
Gene Length | 759 bp |
Protein Length | 252 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 642739724 |
Product | thiazole biosynthesis adenylyltransferase ThiF |
Protein accession | YP_002043410 |
Protein GI | 194446256 |
COG category | [H] Coenzyme transport and metabolism |
COG ID | [COG0476] Dinucleotide-utilizing enzymes involved in molybdopterin and thiamine biosynthesis family 2 |
TIGRFAM ID | [TIGR02356] thiazole biosynthesis adenylyltransferase ThiF, E. coli subfamily |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.360756 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 34 |
Fosmid unclonability p-value | 0.00000302852 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGAATGACC GTGACTTTAT GCGCTACAGC CGGCAAATCC TGCTCGGCGA TATCGCGATT GAAGGCCAGC AAAAGTTGCT CAATAGCCAT GTATTGATTG TCGGCTTAGG CGGATTGGGT TCGCCTGCCG CGCTGTATCT GGCGGGAGCA GGCATCGGCA CACTGACGCT GGTAGACGAT GACGATATTC ATCTGAGCAA TTTACAGCGC CAGATTCTAT TTACCACCGA TGATATCGCG CGTTCAAAAT CTCAGGTTGC CCAGCAGCGC CTGACACAGC TCAACCCGGA TATCGAACTG GTATCACTCC AGCAGCGACT AAAAGGCGAG GCACTGCGGC ATGCGGTAGC ACACGCCGAC GTAGTGCTCG ACTGTACCGA TAACATGGCG ACGCGCCAGG AGATTAACAC CGCCTGCGTG GAGCTCAACA CTCCGTTAAT TTCCGCCAGC GCCGTCGGCT TTGGCGGCCA GCTAATGGTC CTCACGCCAC CGTGGGAACA AGGCTGTTAC CGCTGCCTGT GGCCGGACGA TGTCGAGCCT GAACGCAACT GCCGTACCGC CGGTATCGTC GGACCGGTTG TTGGCGTCAT GGGCGCCTTG CAGGCACTGG AGGCAATCAA ATTACTCAGC GGTATTGAGA CGCCCAGCGG CGAGCTACGC CTGTTTGACG GTAAAACCAG CCAGTGGCGC AGCCTGGCGC TGCGTCGTGC CAGCGGCTGT CCGGTATGCG GAGGGCGACA TGCAAATTCA ATTCAATGA
|
Protein sequence | MNDRDFMRYS RQILLGDIAI EGQQKLLNSH VLIVGLGGLG SPAALYLAGA GIGTLTLVDD DDIHLSNLQR QILFTTDDIA RSKSQVAQQR LTQLNPDIEL VSLQQRLKGE ALRHAVAHAD VVLDCTDNMA TRQEINTACV ELNTPLISAS AVGFGGQLMV LTPPWEQGCY RCLWPDDVEP ERNCRTAGIV GPVVGVMGAL QALEAIKLLS GIETPSGELR LFDGKTSQWR SLALRRASGC PVCGGRHANS IQ
|
| |