Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BCZK0643 |
Symbol | thiF |
ID | 3024580 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Bacillus cereus E33L |
Kingdom | Bacteria |
Replicon accession | NC_006274 |
Strand | + |
Start bp | 748302 |
End bp | 749321 |
Gene Length | 1020 bp |
Protein Length | 339 aa |
Translation table | 11 |
GC content | 38% |
IMG OID | 637544881 |
Product | thiamine/molybdopterin biosynthesis ThiF/MoeB-like protein |
Protein accession | YP_082248 |
Protein GI | 52144580 |
COG category | [H] Coenzyme transport and metabolism |
COG ID | [COG0476] Dinucleotide-utilizing enzymes involved in molybdopterin and thiamine biosynthesis family 2 |
TIGRFAM ID | [TIGR02356] thiazole biosynthesis adenylyltransferase ThiF, E. coli subfamily |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 39 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TTGAATAATC GATATTCTCG CCAAGAATTA TTTTCTCCGA TTGGGGAAGA AGGCCAGCAA AAGATAAGAG AAAAGCATGT GCTTATTATC GGCGCGGGCG CACTAGGTAG TGCAAATGCA GAAATGTTTG TAAGAGCAGG TGTTGGCACA GTAACAATTG TTGACCGTGA TTATGTCGAT TGGAGTAATT TACAAAGGCA GCAATTGTAT GCAGAGAGTG ATGTGGAAAA TAATCTTCCG AAGGCTGTAG CAGCAAAGAA GCGTCTAGAA GAGATTAATA GTGAAGTAAG AGTAAAAGCG CTCGTTCAAG ATGTAACAGC TGAGGAATTA GAAGAGCTTG TTACAAACGT TAATGTAATG ATTGATGCAA CTGATAATTT CGAAACGCGT TTCATTGTGA ATGATATAGC ACAAAAATAT TCTATTCCAT GGATTTACGG AGCATGTGTA GGGAGTTACG GCCTTTCTTA CACAATCCTT CCTAGTAAAA CGCCATGTTT ATCTTGTTTA TTACAATCGA TTCCGCTTGG CGGAGCGACA TGTGATACAG CGGGGATTAT ATCGCCTGCT GTATCTCTCG TCGTTTCTCA TCAAGTAACG GAAGCTCTTA AACTATTAGT GGAAGATTAC GAATCACTTC GAGATGGACT TGTATCGTTT GATGTATGGA AGAATGAATA TTCATGTATG AATGTGCAAA AGCTGCGTAA GCATAATTGT CCTTCGTGCG GAGAGAATGC ATTATATCCG TATTTAAACA AAGAAAATAC ATCGAAAACA GCAGTTTTAT GCGGGAGAAA TACAGTTCAA ATTAGACCAC CTTATAAAGA GGAAATGGAT TTTGAACGAT ACAAAGAGCT GCTGAATGAT CGTGTGAATG ATTTAAATGT AAATCCATAT TTATTATCAT TTTCTGTGGA AGAAAAGAGA TTAGTTGCTT TTAAAGATGG TCGCGTACTT GTACATGGAA CGAAAGATAT AAGTGAAGCA AAAACAGTTT ATCATCGTTA TTTTGGATAG
|
Protein sequence | MNNRYSRQEL FSPIGEEGQQ KIREKHVLII GAGALGSANA EMFVRAGVGT VTIVDRDYVD WSNLQRQQLY AESDVENNLP KAVAAKKRLE EINSEVRVKA LVQDVTAEEL EELVTNVNVM IDATDNFETR FIVNDIAQKY SIPWIYGACV GSYGLSYTIL PSKTPCLSCL LQSIPLGGAT CDTAGIISPA VSLVVSHQVT EALKLLVEDY ESLRDGLVSF DVWKNEYSCM NVQKLRKHNC PSCGENALYP YLNKENTSKT AVLCGRNTVQ IRPPYKEEMD FERYKELLND RVNDLNVNPY LLSFSVEEKR LVAFKDGRVL VHGTKDISEA KTVYHRYFG
|
| |