Gene BCE_4784 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBCE_4784 
SymbolthiI 
ID2748263 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBacillus cereus ATCC 10987 
KingdomBacteria 
Replicon accessionNC_003909 
Strand
Start bp4427348 
End bp4428562 
Gene Length1215 bp 
Protein Length404 aa 
Translation table11 
GC content38% 
IMG OID637281583 
Productthiamine biosynthesis protein ThiI 
Protein accessionNP_981077 
Protein GI42783830 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0301] Thiamine biosynthesis ATP pyrophosphatase 
TIGRFAM ID[TIGR00342] thiazole biosynthesis/tRNA modification protein ThiI 


Plasmid Coverage information

Num covering plasmid clones30 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATGACAT ATGAATATAT TTTAGTTCGT TATGGAGAGA TGACGACTAA AGGTAAGAAC 
CGTTCTAAAT TTGTAAGCAC ATTAAAAGAT AACGTGAAGT TTAAACTGAA AAAATTCCCA
AATATTAAAA TCGATGCAAC ACATGACCGT ATGTATATCC AATTAAATGG CGAAGATCAT
GAAGCGGTAT CTGAAAGATT AAAAGATGTA TTTGGTATTC ATAAGTTTAA CTTAGCGATG
AAAGTACCAT CAGAATTAGA AGATATTAAA AAAGGTGCAT TAGCAGCTTT CTTACAAGTA
AAAGGCGATG TGAAAACATT TAAAATTACT GTACACCGTT CTTATAAGCA TTTCCCAATG
AGAACAATGG AATTATTACC TGAGATTGGT GGACACATTC TAGAAAATAC AGAAGATATT
ACTGTAGATG TTCATAATCC AGATGTAAAT GTACGTGTAG AAATTCGCAG TGGTTATAGC
TATATTATGT GCGATGAGCG TATGGGAGCT GGCGGTTTAC CAGTTGGCGT TGGCGGAAAA
GTAATGGTAC TTCTTTCTGG TGGTATCGAT AGCCCAGTAG CAGCGTACTT AACGATGAAA
CGCGGCGTAT CTGTGGAAGC AGTTCACTTC CATAGCCCAC CTTTCACAAG TGAGCGCGCG
AAACAAAAAG TAATCGATTT AGCACAAGAG TTAACGAAAT ACTGTAAACG TGTAACGCTT
CACCTTGTTC CATTTACAGA AGTGCAAAAA ACAATTAATA AAGAAATCCC ATCTAGCTAT
TCAATGACAG TTATGCGCCG TATGATGATG CGTATTACAG AGCGTATCGC AGAGGAGCGT
AACGCGCTAG CAATCACGAC TGGTGAAAGT CTTGGACAAG TAGCAAGTCA AACATTAGAT
AGTATGCATA CGATTAACGA AGTAACGAAC TATCCAGTTA TTCGTCCGCT TATTACGATG
GATAAATTAG AGATTATTAA AATCGCTGAA GAAATCGGCA CATATGATAT TTCGATTCGT
CCGTACGAAG ATTGCTGTAC AGTATTCACA CCAGCAAGCC CAGCGACGAA GCCGAAGCGT
GAAAAAGCGA ATCGTTTTGA AGCGAAATAC GATTTCACAA CATTAATCGA TGAAGCTGTA
GCAAACAAAG AAACAGTGGT ATTGCAAACG GTAGAAGTAG TGGCGGAAGA AGAAAAATTC
GAAGAACTTT TCTAA
 
Protein sequence
MMTYEYILVR YGEMTTKGKN RSKFVSTLKD NVKFKLKKFP NIKIDATHDR MYIQLNGEDH 
EAVSERLKDV FGIHKFNLAM KVPSELEDIK KGALAAFLQV KGDVKTFKIT VHRSYKHFPM
RTMELLPEIG GHILENTEDI TVDVHNPDVN VRVEIRSGYS YIMCDERMGA GGLPVGVGGK
VMVLLSGGID SPVAAYLTMK RGVSVEAVHF HSPPFTSERA KQKVIDLAQE LTKYCKRVTL
HLVPFTEVQK TINKEIPSSY SMTVMRRMMM RITERIAEER NALAITTGES LGQVASQTLD
SMHTINEVTN YPVIRPLITM DKLEIIKIAE EIGTYDISIR PYEDCCTVFT PASPATKPKR
EKANRFEAKY DFTTLIDEAV ANKETVVLQT VEVVAEEEKF EELF