Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH74115_5460 |
Symbol | thiF |
ID | 6968444 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli O157:H7 str. EC4115 |
Kingdom | Bacteria |
Replicon accession | NC_011353 |
Strand | - |
Start bp | 5104759 |
End bp | 5105514 |
Gene Length | 756 bp |
Protein Length | 251 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 643389107 |
Product | thiazole biosynthesis adenylyltransferase ThiF |
Protein accession | YP_002273508 |
Protein GI | 209398041 |
COG category | [H] Coenzyme transport and metabolism |
COG ID | [COG0476] Dinucleotide-utilizing enzymes involved in molybdopterin and thiamine biosynthesis family 2 |
TIGRFAM ID | [TIGR02356] thiazole biosynthesis adenylyltransferase ThiF, E. coli subfamily |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 25 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 47 |
Fosmid unclonability p-value | 0.218876 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAATGACC GTGATTTTAT GCGTTATAGC CGCCAAATCC TGCTCGACGA TATCGCTCTT GACGGGCAGC AAAAACTGCT CGACAGCCAG GTGCTGATTA TCGGTCTTGG CGGGCTGGGT ACACCTGCTG CGCTATACCT GGCGGGCGCT GGCGTCGGGA CGCTGGTACT GGCAGATGAC GACGATGTGC ATTTAAGCAA TCTGCAACGA CAAATCCTCT TTACCACTGA AGATATCGAT CGCCCGAAAT CGCAGGTCAG CCATCAGCGA CTGACACAGT TGAATCCCGA TATTCAACTG ACAGCATTAC AACAACGGTT AACGGGTGAG GCGTTAAAAG ATGCGGTTGC ACAAGCCGAT GTGGTGCTCG ACTGTACCGA CAATATGGCG ACTCGCCAGG AGATTAATGC CGCCTGCGTG GCACTCAACA CGCCGCTTAT CACCGCCAGC GCGGTCGGAT TTGGCGGTCA GTTGATGGTA CTGACGCCGC CCTGGGGGCA GGGTTGTTAC CGCTGCCTGT GGCCAGAAAA CCAGGAGCCA GAACGCAACT GCCGCACGGC GGGCGTGGTT GGCCCGGTGG TCGGGGTTAT GGGCACTTTG CAGGCACTGG AAGCCATTAA GTTATTAAGC GGTATAGAGA CGCCTGCGGG AGAACTCCGA CTGTTCGACG GTAAGTCGAG CCAGTGGCGC AGCCTGGCGT TGCGCCGCGC CAGTGGTTGC CCGGTATGCG GAGGAAGCAA TGCAGATCCT GTTTAA
|
Protein sequence | MNDRDFMRYS RQILLDDIAL DGQQKLLDSQ VLIIGLGGLG TPAALYLAGA GVGTLVLADD DDVHLSNLQR QILFTTEDID RPKSQVSHQR LTQLNPDIQL TALQQRLTGE ALKDAVAQAD VVLDCTDNMA TRQEINAACV ALNTPLITAS AVGFGGQLMV LTPPWGQGCY RCLWPENQEP ERNCRTAGVV GPVVGVMGTL QALEAIKLLS GIETPAGELR LFDGKSSQWR SLALRRASGC PVCGGSNADP V
|
| |