Gene BCB4264_A4756 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBCB4264_A4756 
SymbolthiI 
ID7098961 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBacillus cereus B4264 
KingdomBacteria 
Replicon accessionNC_011725 
Strand
Start bp4608367 
End bp4609581 
Gene Length1215 bp 
Protein Length404 aa 
Translation table11 
GC content38% 
IMG OID643472265 
Productthiamine biosynthesis protein ThiI 
Protein accessionYP_002369442 
Protein GI218232220 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0301] Thiamine biosynthesis ATP pyrophosphatase 
TIGRFAM ID[TIGR00342] thiazole biosynthesis/tRNA modification protein ThiI 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTTGAAAT ATGAATATAT TTTAGTGCGT TACGGAGAAA TGACGACAAA AGGTAAGAAC 
CGTTCTAAGT TTGTAAGCAC ATTAAAAGAT AACGTGAAGT TCAAACTGAA AAAATTCCCA
AATATTAAAA TCGATGCAAC ACATGATCGT ATGTACATCC AGTTAAATGG TGAAGATCAT
GAAGCAGTCT CTGAAAGATT GAAGGATGTA TTTGGTATTC ATAAGTTTAA CTTAGCGATG
AAAGTACCAT CAGAATTAGA AGACATTAAA GAAGGTGCAT TAGCAGCTTT CTTACAAGTA
AAAGGTGATG TGAAAACATT TAAAATTACT GTACACCGTT CTTATAAGCA TTTCCCAATG
AGAACGATGG AATTATTACC TGAGATTGGC GGACACATTC TAGAAAATAC AGAAGATATT
ACAGTAGATG TTCATAATCC AGATGTAAAT GTACGCGTAG AGATCCGCAG TGGCTATAGC
TATATCATGT GCGATGAGCG TATGGGAGCT GGCGGTTTAC CAGTTGGCGT TGGCGGAAAA
GTAATGGTAC TTCTTTCTGG TGGTATTGAT AGCCCAGTAG CAGCTTACTT AACGATGAAA
CGCGGCGTAT CTGTGGAAGC AGTTCACTTC CATAGTCCAC CTTTCACAAG TGAGCGTGCA
AAACAAAAAG TAATCGATTT AGCACAAGAG TTAACGAAGT ATTGTAAACG AGTAACGCTT
CACCTTGTTC CGTTTACAGA AGTGCAAAAA ACGATTAATA AAGAAATCCC ATCTAGCTAT
TCAATGACGG TTATGCGCCG TATGATGATG CGTATTACAG AACGTATTGC TGAGGAGCGT
AACGCACTTG CAATTACGAC TGGTGAAAGT CTTGGACAAG TAGCAAGCCA AACGTTAGAT
AGTATGCATA CGATTAACGA AGTAACAAAC TACCCAGTTA TTCGTCCGCT TATTACGATG
GATAAATTAG AGATTATTAA AATCGCTGAA GAAATCGGCA CGTATGATAT TTCAATTCGT
CCATATGAAG ATTGCTGTAC TGTATTCACA CCAGCAAGCC CAGCGACGAA GCCGAAACGT
GAAAAAGCAA ATCGTTTTGA AGCGAAATAC GATTTCACAC CATTAATCGA TGAAGCTGTA
GCGAACAAAG AAACAATGGT ATTACAAACG GTAGAAGTAG TGGCGGAAGA AGAAAAATTT
GAAGAACTTT TCTAA
 
Protein sequence
MLKYEYILVR YGEMTTKGKN RSKFVSTLKD NVKFKLKKFP NIKIDATHDR MYIQLNGEDH 
EAVSERLKDV FGIHKFNLAM KVPSELEDIK EGALAAFLQV KGDVKTFKIT VHRSYKHFPM
RTMELLPEIG GHILENTEDI TVDVHNPDVN VRVEIRSGYS YIMCDERMGA GGLPVGVGGK
VMVLLSGGID SPVAAYLTMK RGVSVEAVHF HSPPFTSERA KQKVIDLAQE LTKYCKRVTL
HLVPFTEVQK TINKEIPSSY SMTVMRRMMM RITERIAEER NALAITTGES LGQVASQTLD
SMHTINEVTN YPVIRPLITM DKLEIIKIAE EIGTYDISIR PYEDCCTVFT PASPATKPKR
EKANRFEAKY DFTPLIDEAV ANKETMVLQT VEVVAEEEKF EELF