Gene BCG9842_B0479 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBCG9842_B0479 
SymbolthiI 
ID7181419 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBacillus cereus G9842 
KingdomBacteria 
Replicon accessionNC_011772 
Strand
Start bp4573622 
End bp4574836 
Gene Length1215 bp 
Protein Length404 aa 
Translation table11 
GC content39% 
IMG OID643552546 
Productthiamine biosynthesis protein ThiI 
Protein accessionYP_002448213 
Protein GI218899802 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0301] Thiamine biosynthesis ATP pyrophosphatase 
TIGRFAM ID[TIGR00342] thiazole biosynthesis/tRNA modification protein ThiI 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value0.000000040209 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGATGACAT ATGAATATAT TTTAGTGCGT TACGGAGAAA TGACGACAAA AGGTAAGAAC 
CGTTCTAAAT TTGTAAGCAC ATTAAAAGAT AACGTGAAGT TCAAACTGAA AAAGTTCCCA
AACATTAAAA TTGATGCAAC GCATGACCGT ATGTACATCC AGTTAAACGG TGAAGATCAT
GAGGCAATCT CTGAAAGATT GAAAGACGTA TTTGGTATTC ATAAGTTTAA CTTAGCGATG
AAAGTACCAT CAGAATTAGA AGACATTAAA AAAGGTGCAT TAGCAGCTTT CTTACAAGTA
AAAGATGATG TGAAAACATT TAAAATTACA GTGCATCGTT CTGATAAGCG CTTCCCAATG
AAGACGATGG AGTTACTTCC AGAAATCGGT GGGCACATTT TAGAAAATAC AGAGGATATA
ACAGTAGATG TTCATAATCC AGATGTGAAT GTACGTATAG AAATTCGCAG TGGTTATAGT
TATATTATGT GTGGTGAGCA CATGGGAGCT GGCGGTTTAC CAGTTGGCGT TGGTGGAAAA
GTAATGGTAC TTCTTTCTGG TGGTATTGAT AGCCCGGTAG CAGCGTACTT AACGATGAAA
CGCGGCGTAT CTGTGGAAGC AGTTCACTTC CATAGCCCAC CTTTCACAAG TGAGCGTGCA
AAACAAAAAG TAATCGATTT AGCACAAGGG TTAACGAAAT ACTGTAAACG TGTAACGCTG
CACCTTGTTC CGTTTACAGA AGTGCAAAAA ACGATTAATA AAGAAATCCC ATCTAGCTAT
TCAATGACGG TTATGCGCCG TATGATGATG CGTATTACAG AGCAGATTGC TGAGGAGCGT
AACGCACTTG CAATTACGAC TGGTGAAAGT CTTGGACAAG TAGCAAGCCA AACATTAGAT
AGCATGCATA CGATTAACGA AGTAACAAAC TACCCAGTTA TTCGTCCGCT TATTACGATG
GATAAATTAG AAATTATTAA AATTGCTGAA GAAATCGGCA CGTATGATAT TTCAATTCGT
CCATACGAAG ATTGCTGTAC AGTATTCACA CCAGCTAGCC CGGCGACGAA GCCGAAGCGT
GAAAAAGCAA ATCGATTTGA AGCGAAATAC GATTTCACAC CGTTAATCGA AGAAGCTGTA
GCGAACAAAG AAACAATGGT ATTACAAACG GTTGAAGTAG TGGCGGAAGA AGAAAAATTT
GAAGAACTTT TCTAA
 
Protein sequence
MMTYEYILVR YGEMTTKGKN RSKFVSTLKD NVKFKLKKFP NIKIDATHDR MYIQLNGEDH 
EAISERLKDV FGIHKFNLAM KVPSELEDIK KGALAAFLQV KDDVKTFKIT VHRSDKRFPM
KTMELLPEIG GHILENTEDI TVDVHNPDVN VRIEIRSGYS YIMCGEHMGA GGLPVGVGGK
VMVLLSGGID SPVAAYLTMK RGVSVEAVHF HSPPFTSERA KQKVIDLAQG LTKYCKRVTL
HLVPFTEVQK TINKEIPSSY SMTVMRRMMM RITEQIAEER NALAITTGES LGQVASQTLD
SMHTINEVTN YPVIRPLITM DKLEIIKIAE EIGTYDISIR PYEDCCTVFT PASPATKPKR
EKANRFEAKY DFTPLIEEAV ANKETMVLQT VEVVAEEEKF EELF