Gene BCAH820_4765 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBCAH820_4765 
SymbolthiI 
ID7187238 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBacillus cereus AH820 
KingdomBacteria 
Replicon accessionNC_011773 
Strand
Start bp4511868 
End bp4513082 
Gene Length1215 bp 
Protein Length404 aa 
Translation table11 
GC content38% 
IMG OID643558175 
Productthiamine biosynthesis protein ThiI 
Protein accessionYP_002453711 
Protein GI218905877 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0301] Thiamine biosynthesis ATP pyrophosphatase 
TIGRFAM ID[TIGR00342] thiazole biosynthesis/tRNA modification protein ThiI 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones273 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATGACAT ATGAATATAT TTTAGTTCGT TATGGAGAGA TGACGACTAA AGGTAAGAAC 
CGTTCTAAAT TTGTAAGCAC ATTAAAAGAT AACGTAAAGT TCAAACTGAA AAAATTCCCA
AATATTAAAA TCGATGCAAC ACATGATCGC ATGTACATCC AATTAAATGG CGAAGATCAT
GAAGCGGTAT CTGAAAGATT GAAAGATGTA TTTGGTATTC ATAAGTTTAA CTTAGCGATG
AAAGTACCAT CAGAATTAGA AGACATTAAA AAAGGTGCAT TAGCAGCTTT CTTACAAGTA
AAAGGCGATG TGAAAACATT TAAAATTACT GTACACCGTT CTTATAAGCA TTTCCCAATG
AGAACGATGG AATTATTACC TGAGATTGGT GGACATATTC TAGAAAATAC AGAAGATATT
ACTGTAGATG TTCATAATCC AGATGTAAAT GTACGCGTAG AAATCCGTAG CGGTTATAGC
TACATTATGT GTGATGAGCG TATGGGAGCT GGCGGTTTAC CAGTTGGCGT TGGCGGAAAA
GTAATGGTAC TTCTTTCTGG CGGTATTGAT AGCCCAGTAG CAGCGTACTT AACGATGAAA
CGCGGCGTAT CTGTGGAAGC AGTTCACTTC CATAGCCCGC CTTTCACAAG TGAGCGCGCG
AAACAAAAAG TAATCGATTT AGCACAAGAA TTAACGAAAT ACTGTAAACG TGTAACACTT
CACCTTGTTC CATTTACAGA AGTGCAAAAA ACGATTAATA AAGAAATCCC ATCTAGCTAT
TCAATGACAG TTATGCGCCG TATGATGATG CGTATTACAG AGCGTATCGC AGAGGAGCGT
AACGCACTAG CAATCACGAC TGGTGAAAGT CTTGGACAAG TAGCAAGTCA AACGTTAGAT
AGTATGCATA CGATTAACGA AGTAACAAAC TATCCAGTTA TTCGTCCGCT TATTACGATG
GATAAATTAG AGATTATTAA AATCGCTGAA GAGATCGGCA CATATGATAT TTCAATTCGT
CCGTACGAAG ATTGCTGTAC AGTATTCACA CCAGCAAGTC CAGCGACGAA GCCGAAGCGT
GAAAAAGCGA ATCGTTTTGA AGCGAAATAC GATTTCACAC CATTAATCGA TGAAGCTGTA
GCGAACAAAG AAACAATGGT ATTACAAACG GTAGAAGTAG TAGCGGAAGA AGAAAAATTC
GAAGAACTTT TCTAA
 
Protein sequence
MMTYEYILVR YGEMTTKGKN RSKFVSTLKD NVKFKLKKFP NIKIDATHDR MYIQLNGEDH 
EAVSERLKDV FGIHKFNLAM KVPSELEDIK KGALAAFLQV KGDVKTFKIT VHRSYKHFPM
RTMELLPEIG GHILENTEDI TVDVHNPDVN VRVEIRSGYS YIMCDERMGA GGLPVGVGGK
VMVLLSGGID SPVAAYLTMK RGVSVEAVHF HSPPFTSERA KQKVIDLAQE LTKYCKRVTL
HLVPFTEVQK TINKEIPSSY SMTVMRRMMM RITERIAEER NALAITTGES LGQVASQTLD
SMHTINEVTN YPVIRPLITM DKLEIIKIAE EIGTYDISIR PYEDCCTVFT PASPATKPKR
EKANRFEAKY DFTPLIDEAV ANKETMVLQT VEVVAEEEKF EELF