Gene BCG9842_B5612 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBCG9842_B5612 
SymbolthiC 
ID7185150 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBacillus cereus G9842 
KingdomBacteria 
Replicon accessionNC_011772 
Strand
Start bp5091191 
End bp5092951 
Gene Length1761 bp 
Protein Length586 aa 
Translation table11 
GC content43% 
IMG OID643553115 
Productthiamine biosynthesis protein ThiC 
Protein accessionYP_002448756 
Protein GI218900345 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0422] Thiamine biosynthesis protein ThiC 
TIGRFAM ID[TIGR00190] thiamine biosynthesis protein ThiC 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.188984 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0000000000000085296 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAAACAAT CTGTTTCAGC TGAGCAAATT GAATTGAAAT CGAGTTTACC AGGGAGTAAG 
AAAGTATATG TGGATGGACC ACGAGAAGGT ATGAAAGTGC CGATGCGTGA GATTGAACAA
AGCGAAACGA ATGGCGTCCC AAATCCACCA ATCCGTGTGT ATGATACGAG CGGTCCTTAT
ACAGATCCTG CGTATAAGGT CGAGCTGGAA AAAGGCATTC CAACGCCGCG CCACTCCTGG
ATTATGGGGC GTGGTGATGT AGACGCATAC GAAGGGCGTG AAGTAAAACC AGAGGATGAC
GGTGTGAAAG TGGCTTCGAA ACATACACCT GTTTTCCCGC AAATGGATCG TAAGCCGCTT
AGAGCAAAGC AAGGTGCAAA TGTTACGCAA ATGCACTATG CGCGTAATGG CATTATTACG
TCTGAGATGG AATACGTTGC GATTCGTGAA GGGGTAGAGC CGGAGTTTGT TCGTAAGGAA
ATCGCAGAAG GCCGCGCTAT TTTACCAGCG AATATTAATC ATCCAGAAGC TGAACCGATG
ATTATCGGGC GTAACTTCCA TGTGAAGGTC AATGCAAATA TCGGAAACTC TGCTGTATCT
TCTTCTATTG CAGAAGAAGT AGAGAAGATG ACGTGGGCTA CTCGCTGGGG TGCAGATACA
ATTATGGATT TATCTACAGG TAAAAATATT CATACGACGC GTGAGTGGAT TATTCGTAAC
GCGCCTGTAC CAGTTGGAAC TGTACCGATC TATCAAGCGC TGGAAAAAGT AAACGGAATT
GCAGAAGATT TAACGTGGGA AGTATATCGT GATACGTTAA TTGAGCAGGC GGAGCAAGGC
GTTGATTACT TTACGATTCA CGCTGGTGTA TTGCTTCGTT ACATTCCAAT CACGGCAAAG
CGTACGACAG GTATCGTTTC ACGCGGTGGT TCGATTATGG CGCAGTGGTG TTTATTCCAT
CATAAAGAAA ACTTCCTATA TACTCATTTT GAAGAGATTT GTGAAATTAT GAAACAGTAC
GATGTTTCGT TCTCTCTTGG AGATGGATTA CGTCCAGGGT CTATTGCAGA TGCAAATGAT
GAAGCACAGT TTTCTGAGCT TGAAACACTT GGTGAATTAA CCAAAATTGC TTGGAAACAT
GATGTGCAAG TTATGATCGA AGGACCTGGA CACGTACCGA TGCACTTAAT TAAAGAAAAT
ATGGAGAAAG AGCTTGATAT TTGTCAGGGC GCGCCGTTTT ACACACTTGG GCCACTAACG
ACAGATATTG CACCAGGTTA TGACCATATT ACATCGGCGA TTGGAGCTGC GATGATCGGT
TGGTTTGGAA CGGCGATGCT TTGTTATGTA ACGCCGAAAG AACATTTAGG TTTACCGAAT
AAAGATGATG TACGCACGGG TGTTATTACG TACAAAATTG CAGCGCATGC GGCTGATCTT
GCGAAAGGAC ATAAAACGGC TCATCAGCGT GATGATGCAC TTTCAAAAGC ACGCTTCGAA
TTCCGTTGGC GCGATCAATT TAATCTATCT TTAGATCCTG AACGTGCGAT GGAATATCAC
GATGAAACGT TGCCTGCAGA AGGGGCAAAG ACAGCTCACT TCTGTTCAAT GTGTGGACCG
AAGTTTTGTA GTATGAGAAT TTCACATGAT ATTCGTGAAT ATGCAAAAGA AAATGATTTA
GAAACGACAG AAGCAATTGA AAAAGGAATG AAAGAGAAAG CAGAAGAATT TAAAGAAGCT
GGTAGTCATT TATATCAATA A
 
Protein sequence
MKQSVSAEQI ELKSSLPGSK KVYVDGPREG MKVPMREIEQ SETNGVPNPP IRVYDTSGPY 
TDPAYKVELE KGIPTPRHSW IMGRGDVDAY EGREVKPEDD GVKVASKHTP VFPQMDRKPL
RAKQGANVTQ MHYARNGIIT SEMEYVAIRE GVEPEFVRKE IAEGRAILPA NINHPEAEPM
IIGRNFHVKV NANIGNSAVS SSIAEEVEKM TWATRWGADT IMDLSTGKNI HTTREWIIRN
APVPVGTVPI YQALEKVNGI AEDLTWEVYR DTLIEQAEQG VDYFTIHAGV LLRYIPITAK
RTTGIVSRGG SIMAQWCLFH HKENFLYTHF EEICEIMKQY DVSFSLGDGL RPGSIADAND
EAQFSELETL GELTKIAWKH DVQVMIEGPG HVPMHLIKEN MEKELDICQG APFYTLGPLT
TDIAPGYDHI TSAIGAAMIG WFGTAMLCYV TPKEHLGLPN KDDVRTGVIT YKIAAHAADL
AKGHKTAHQR DDALSKARFE FRWRDQFNLS LDPERAMEYH DETLPAEGAK TAHFCSMCGP
KFCSMRISHD IREYAKENDL ETTEAIEKGM KEKAEEFKEA GSHLYQ