Gene BCZK0643 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBCZK0643 
SymbolthiF 
ID3024580 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBacillus cereus E33L 
KingdomBacteria 
Replicon accessionNC_006274 
Strand
Start bp748302 
End bp749321 
Gene Length1020 bp 
Protein Length339 aa 
Translation table11 
GC content38% 
IMG OID637544881 
Productthiamine/molybdopterin biosynthesis ThiF/MoeB-like protein 
Protein accessionYP_082248 
Protein GI52144580 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0476] Dinucleotide-utilizing enzymes involved in molybdopterin and thiamine biosynthesis family 2 
TIGRFAM ID[TIGR02356] thiazole biosynthesis adenylyltransferase ThiF, E. coli subfamily 


Plasmid Coverage information

Num covering plasmid clones39 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGAATAATC GATATTCTCG CCAAGAATTA TTTTCTCCGA TTGGGGAAGA AGGCCAGCAA 
AAGATAAGAG AAAAGCATGT GCTTATTATC GGCGCGGGCG CACTAGGTAG TGCAAATGCA
GAAATGTTTG TAAGAGCAGG TGTTGGCACA GTAACAATTG TTGACCGTGA TTATGTCGAT
TGGAGTAATT TACAAAGGCA GCAATTGTAT GCAGAGAGTG ATGTGGAAAA TAATCTTCCG
AAGGCTGTAG CAGCAAAGAA GCGTCTAGAA GAGATTAATA GTGAAGTAAG AGTAAAAGCG
CTCGTTCAAG ATGTAACAGC TGAGGAATTA GAAGAGCTTG TTACAAACGT TAATGTAATG
ATTGATGCAA CTGATAATTT CGAAACGCGT TTCATTGTGA ATGATATAGC ACAAAAATAT
TCTATTCCAT GGATTTACGG AGCATGTGTA GGGAGTTACG GCCTTTCTTA CACAATCCTT
CCTAGTAAAA CGCCATGTTT ATCTTGTTTA TTACAATCGA TTCCGCTTGG CGGAGCGACA
TGTGATACAG CGGGGATTAT ATCGCCTGCT GTATCTCTCG TCGTTTCTCA TCAAGTAACG
GAAGCTCTTA AACTATTAGT GGAAGATTAC GAATCACTTC GAGATGGACT TGTATCGTTT
GATGTATGGA AGAATGAATA TTCATGTATG AATGTGCAAA AGCTGCGTAA GCATAATTGT
CCTTCGTGCG GAGAGAATGC ATTATATCCG TATTTAAACA AAGAAAATAC ATCGAAAACA
GCAGTTTTAT GCGGGAGAAA TACAGTTCAA ATTAGACCAC CTTATAAAGA GGAAATGGAT
TTTGAACGAT ACAAAGAGCT GCTGAATGAT CGTGTGAATG ATTTAAATGT AAATCCATAT
TTATTATCAT TTTCTGTGGA AGAAAAGAGA TTAGTTGCTT TTAAAGATGG TCGCGTACTT
GTACATGGAA CGAAAGATAT AAGTGAAGCA AAAACAGTTT ATCATCGTTA TTTTGGATAG
 
Protein sequence
MNNRYSRQEL FSPIGEEGQQ KIREKHVLII GAGALGSANA EMFVRAGVGT VTIVDRDYVD 
WSNLQRQQLY AESDVENNLP KAVAAKKRLE EINSEVRVKA LVQDVTAEEL EELVTNVNVM
IDATDNFETR FIVNDIAQKY SIPWIYGACV GSYGLSYTIL PSKTPCLSCL LQSIPLGGAT
CDTAGIISPA VSLVVSHQVT EALKLLVEDY ESLRDGLVSF DVWKNEYSCM NVQKLRKHNC
PSCGENALYP YLNKENTSKT AVLCGRNTVQ IRPPYKEEMD FERYKELLND RVNDLNVNPY
LLSFSVEEKR LVAFKDGRVL VHGTKDISEA KTVYHRYFG