Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SeAg_B4405 |
Symbol | |
ID | 6795773 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salmonella enterica subsp. enterica serovar Agona str. SL483 |
Kingdom | Bacteria |
Replicon accession | NC_011149 |
Strand | - |
Start bp | 4297294 |
End bp | 4298052 |
Gene Length | 759 bp |
Protein Length | 252 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 642778500 |
Product | adenylyltransferase ThiF |
Protein accession | YP_002149070 |
Protein GI | 197251642 |
COG category | [H] Coenzyme transport and metabolism |
COG ID | [COG0476] Dinucleotide-utilizing enzymes involved in molybdopterin and thiamine biosynthesis family 2 |
TIGRFAM ID | [TIGR02356] thiazole biosynthesis adenylyltransferase ThiF, E. coli subfamily |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 35 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAATGACC GTGACTTTAT GCGCTACAGC CGCCAGATTC TGCTCGGCGA TATCGCCATT GAAGGCCAGC AACAGCTGCT CAATAGCCAT GTATTGATTG TCGGCTTAGG CGGATTGGGT TCGCCTGCCG CGCTGTATCT GGCGGGAGCA GGCATCGGCA CACTGACGCT GGCGGATGAT GACGATGTTC ACTTGAGCAA TCTCCAGCGG CAAATTCTGT TCACCACCGA CGATATCGCC CACCCGAAGG CGCAGGCGGC AAAGCTACGG CTGGCGCAGC TCAACCCCGG TAGCAAGCTG ATCGTATTGC AACAGCGTCT GACTGGCGAT GTGCTTAAAA ACGCGGTAGC ACGCGTCGAC GTAGTGCTCG ACTGTACCGA CAACATGGCC ACGCGCCAGG AAATTAACGC CGCCTGCGTG GCGCTCAACA CTCCGTTAAT TTCCGCCAGT GCCGTCGGCT TTGGCGGCCA GCTAATGGTC CTCACACCAC CGTGGGAACA AGGCTGTTAC CGCTGCCTGT GGCCGGACGA TGTCGAGCCT GAACGCAACT GCCGTACCGC CGGTATCGTC GGACCGGTTG TTGGCGTGAT GGGCACTTTG CAGGCGCTGG AGGCAATCAA ATTACTCAGC GGGATGGAAA CGCCGAGTGG CGAGCTACGC CTGTTTGACG GTAAAACCAG CCAGTGGCGC AGCCTGGCGC TGCGTCGTGC CAGCGGCTGT CAGGTCTGCG GAGGGCAACA TGCAGATTCA GTTCAATGA
|
Protein sequence | MNDRDFMRYS RQILLGDIAI EGQQQLLNSH VLIVGLGGLG SPAALYLAGA GIGTLTLADD DDVHLSNLQR QILFTTDDIA HPKAQAAKLR LAQLNPGSKL IVLQQRLTGD VLKNAVARVD VVLDCTDNMA TRQEINAACV ALNTPLISAS AVGFGGQLMV LTPPWEQGCY RCLWPDDVEP ERNCRTAGIV GPVVGVMGTL QALEAIKLLS GMETPSGELR LFDGKTSQWR SLALRRASGC QVCGGQHADS VQ
|
| |