Gene BCZK4391 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBCZK4391 
SymbolthiI 
ID3026909 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBacillus cereus E33L 
KingdomBacteria 
Replicon accessionNC_006274 
Strand
Start bp4498974 
End bp4500188 
Gene Length1215 bp 
Protein Length404 aa 
Translation table11 
GC content38% 
IMG OID637548606 
Productthiamine biosynthesis protein ThiI 
Protein accessionYP_085969 
Protein GI52140860 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0301] Thiamine biosynthesis ATP pyrophosphatase 
TIGRFAM ID[TIGR00342] thiazole biosynthesis/tRNA modification protein ThiI 


Plasmid Coverage information

Num covering plasmid clones37 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATGACAT ATGAATATAT TTTAGTTCGT TATGGAGAGA TGACGACTAA AGGTAAGAAC 
CGTTCTAAAT TTGTAAGCAC ATTAAAAGAT AACGTAAAGT TCAAACTGAA AAAATTCCCA
AATATTAAAA TCGATGCAAC ACATGATCGT ATGTACATCC AATTAAATGG CGAAGATCAT
GAAGCGGTAT CTGAAAGATT GAAAGATGTG TTTGGTATTC ATAAGTTTAA CTTAGCGATG
AAAGTACCAT CAGAATTAGA AGACATTAAA AAAGGTGCAT TAGCAGCTTT CTTACAAGTA
AAAGGCGATG TGAAAACATT TAAAATTACT GTACACCGTT CTTATAAGCA TTTCCCAATG
AGAACGATGG AATTATTACC TGAGATTGGT GGACATATTC TAGAAAATAC AGAAGATATT
ACTGTAGATG TTCATAATCC AGATGTAAAT GTACGCGTAG AAATTCGTAG CGGTTATAGC
TACATTATGT GTGATGAGCG TATGGGAGCT GGCGGTTTAC CAGTTGGCGT TGGCGGAAAA
GTAATGGTAC TTCTTTCTGG CGGTATTGAT AGCCCAGTAG CAGCGTACTT AACGATGAAA
CGCGGCGTAT CTGTGGAAGC AGTTCACTTC CATAGCCCGC CTTTCACAAG TGAGCGCGCG
AAACAAAAAG TAATCGATTT AGCACAAGAA TTAACGAAAT ACTGTAAACG TGTAACACTT
CACCTTGTTC CATTTACAGA AGTGCAAAAA ACGATTAATA AAGAAATCCC ATCTAGCTAT
TCAATGACAG TTATGCGCCG TATGATGATG CGTATTACAG AGCGTATCGC AGAGGAGCGT
AACGCACTAG CAATCACGAC TGGTGAAAGT CTTGGACAAG TAGCAAGTCA AACGTTAGAT
AGTATGCATA CGATTAACGA AGTAACAAAC TATCCAGTTA TTCGTCCGCT TATTACGATG
GATAAATTAG AGATTATTAA AATCGCTGAA GAAATCGGCA CATATGAAAT TTCAATTCGT
CCGTACGAAG ATTGCTGTAC AGTATTCACA CCAGCAAGCC CAGCGACGAA GCCGAAGCGT
GAAAAAGCGA ATCGTTTTGA AGCGAAATAC GATTTCACAC CATTAATCGA TGAAGCTGTA
GCGAACAAAG AAACAATGGT ATTACAAACG GTAGAAGTAG TAGCGGAAGA AGAAAAATTC
GAAGAACTTT TCTAA
 
Protein sequence
MMTYEYILVR YGEMTTKGKN RSKFVSTLKD NVKFKLKKFP NIKIDATHDR MYIQLNGEDH 
EAVSERLKDV FGIHKFNLAM KVPSELEDIK KGALAAFLQV KGDVKTFKIT VHRSYKHFPM
RTMELLPEIG GHILENTEDI TVDVHNPDVN VRVEIRSGYS YIMCDERMGA GGLPVGVGGK
VMVLLSGGID SPVAAYLTMK RGVSVEAVHF HSPPFTSERA KQKVIDLAQE LTKYCKRVTL
HLVPFTEVQK TINKEIPSSY SMTVMRRMMM RITERIAEER NALAITTGES LGQVASQTLD
SMHTINEVTN YPVIRPLITM DKLEIIKIAE EIGTYEISIR PYEDCCTVFT PASPATKPKR
EKANRFEAKY DFTPLIDEAV ANKETMVLQT VEVVAEEEKF EELF