Gene BCAH187_A4782 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBCAH187_A4782 
SymbolthiI 
ID7076747 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBacillus cereus AH187 
KingdomBacteria 
Replicon accessionNC_011658 
Strand
Start bp4434003 
End bp4435217 
Gene Length1215 bp 
Protein Length404 aa 
Translation table11 
GC content38% 
IMG OID643453194 
Productthiamine biosynthesis protein ThiI 
Protein accessionYP_002340705 
Protein GI217962135 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0301] Thiamine biosynthesis ATP pyrophosphatase 
TIGRFAM ID[TIGR00342] thiazole biosynthesis/tRNA modification protein ThiI 


Plasmid Coverage information

Num covering plasmid clones33 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATGACAT ATGAATATAT TTTAGTTCGT TATGGAGAGA TGACGACTAA AGGTAAGAAC 
CGTTCTAAAT TTGTAAGCAC ATTAAAAGAT AACGTAAAGT TCAAACTGAA AAAATTCCCA
AATATTAAAA TCGATGCAAC ACATGATCGT ATGTATATCC AATTAAATGG CGAAGATCAT
GAAGCGGTAT CTGAAAGATT AAAAGATGTG TTTGGTATTC ATAAGTTTAA CTTAGCGATG
AAAGTACCAT CAGAATTAGA AGACATTAAA AAAGGTGCAT TAGCAGCTTT CTTACAAGTA
AAAGGCGATG TGAAAACATT TAAAATTACT GTACACCGTT CTTATAAGCA TTTCCCAATG
AGAACGATGG AATTATTACC TGAGATTGGT GGACACATTC TAGAAAATAC AGAAGATATT
ACTGTAGATG TTCATAATCC AGATGTAAAT GTACGTGTAG AAATTCGCAG TGGCTATAGC
TACATTATGT GCGATGAGCG TATGGGAGCT GGCGGTTTAC CAGTTGGCGT TGGCGGAAAA
GTAATGGTAC TTCTTTCTGG CGGTATTGAT AGCCCAGTAG CAGCGTACTT AACGATGAAA
CGCGGCGTAT CTGTGGAAGC AGTTCACTTC CATAGCCCAC CTTTCACAAG TGAGCGCGCA
AAACAAAAAG TAATCGATTT AGCACAAGAG TTAACGAAAT ACTGTAAACG TGTAACACTT
CACCTTGTTC CATTTACAGA AGTGCAAAAA ACGATTAATA AAGAAATCCC ATCTAGTTAC
TCAATGACAG TTATGCGCCG TATGATGATG CGTATTACAG AGCGTATCGC AGAGGAGCGT
AACGCTCTAG CAATCACGAC TGGTGAAAGT CTTGGACAAG TAGCAAGTCA AACGTTAGAT
AGTATGCATA CGATTAACGA AGTAACAAAC TATCCAATTA TTCGTCCGCT TATTACGATG
GATAAATTAG AGATTATTAA AATCGCTGAA GAAATCGGCA CATATGATAT TTCAATTCGT
CCGTACGAAG ATTGCTGTAC AGTATTCACA CCAGCAAGCC CAGCGACGAA GCCGAAGCGT
GAAAAAGCGA ATCGTTTTGA AGCGAAATAC GATTTCACAC CATTAATCGA TGAAGCTGTA
GCGAACAAAG AAACTATGGT ATTACAAACG GTAGAAGTAG TGGCGGAAGA AGAAAAATTC
GAAGAACTTT TCTAA
 
Protein sequence
MMTYEYILVR YGEMTTKGKN RSKFVSTLKD NVKFKLKKFP NIKIDATHDR MYIQLNGEDH 
EAVSERLKDV FGIHKFNLAM KVPSELEDIK KGALAAFLQV KGDVKTFKIT VHRSYKHFPM
RTMELLPEIG GHILENTEDI TVDVHNPDVN VRVEIRSGYS YIMCDERMGA GGLPVGVGGK
VMVLLSGGID SPVAAYLTMK RGVSVEAVHF HSPPFTSERA KQKVIDLAQE LTKYCKRVTL
HLVPFTEVQK TINKEIPSSY SMTVMRRMMM RITERIAEER NALAITTGES LGQVASQTLD
SMHTINEVTN YPIIRPLITM DKLEIIKIAE EIGTYDISIR PYEDCCTVFT PASPATKPKR
EKANRFEAKY DFTPLIDEAV ANKETMVLQT VEVVAEEEKF EELF