Gene BCZK4920 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBCZK4920 
SymbolthiC 
ID3024618 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBacillus cereus E33L 
KingdomBacteria 
Replicon accessionNC_006274 
Strand
Start bp5010938 
End bp5012698 
Gene Length1761 bp 
Protein Length586 aa 
Translation table11 
GC content43% 
IMG OID637549153 
Productthiamine biosynthesis protein ThiC 
Protein accessionYP_086490 
Protein GI52140340 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0422] Thiamine biosynthesis protein ThiC 
TIGRFAM ID[TIGR00190] thiamine biosynthesis protein ThiC 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAACAGT CTGTTTCAGC TGAGCAAATT GAATTGAAAT CGAGTTTACC GGGAAGTAAG 
AAAGTGTATG TGGATGGACC ACGCGAAGGT ATGAAAGTGC CGATGCGTGA GATTGAACAA
AGTGATACAA ATGGCGTTCC AAATCCGCCA ATTCGTGTGT ATGATACAAG CGGTCCTTAC
ACAGATCCTG CGTATAAAGT CGAGTTAGAG AAGGGGATTC CAACGCCGCG CCACTCTTGG
ATTCTAGAGC GCGGAGATGT AGAGGCATAC GAAGGGCGCG AAGTGAAACC AGAGGATGAC
GGTGTGAAGG TGGCTTCGAA ACATACACCT GTTTTCCCGC AAATGGATCG CAAACCGCTT
AGAGCGAAGC AAGGTGCAAA TGTTACGCAA ATGCATTATG CACGTAATGG CATCATTACG
TCTGAGATGG AATATGTTGC GATTCGTGAA GGAGTAGACC CGGAATTTGT TCGTAAGGAA
ATCGCGGAAG GTCGCGCTAT TTTACCAGCG AATATTAACC ATCCTGAAGC AGAACCGATG
ATTATTGGGC GTAATTTCCA TGTGAAGGTT AATGCGAATA TCGGAAACTC TGCTGTATCT
TCTTCTATTG CAGAAGAAGT AGAGAAGATG ACGTGGGCAA CTCGTTGGGG TGCAGATACA
ATTATGGATT TATCTACAGG TAAAAACATT CATACGACGC GCGAGTGGAT TATTCGTAAC
GCACCCGTAC CAGTTGGAAC TGTACCAATC TATCAAGCGC TGGAAAAAGT AAATGGAATT
GCAGAAGATT TAACGTGGGA AGTGTATCGT GATACGTTAA TTGAGCAAGC GGAGCAAGGC
GTAGATTACT TTACGATTCA CGCTGGCGTA TTACTTCGTT ACATTCCAAT TACGGCAAAG
CGTACGACAG GTATCGTTTC ACGCGGTGGT TCGATTATGG CACAGTGGTG TTTATTCCAT
CATAAAGAAA ACTTCCTATA CACTCATTTT GAAGAGATTT GTGAAATTAT GAAGCAGTAC
GATGTTTCGT TCTCTCTTGG AGATGGATTA CGTCCAGGTT CGATTGCAGA TGCAAATGAC
GAAGCACAGT TCTCTGAGCT TGAAACACTT GGTGAATTAA CGAAGATTGC TTGGAAACAC
GATGTGCAAG TTATGATTGA AGGGCCTGGG CATGTACCTA TGCATTTAAT TAAAGAGAAT
ATGGAGAAAG AACTTGATAT TTGTCAGGGC GCGCCGTTCT ATACACTGGG GCCGTTAACG
ACAGATATTG CACCAGGTTA TGACCATATT ACATCTGCGA TTGGAGCTGC GATGATTGGT
TGGTTTGGAA CGGCGATGCT TTGTTATGTA ACGCCGAAAG AACATTTAGG TTTACCAAAT
AAAGATGATG TTCGAGAAGG TGTTATTACG TACAAAATCG CTGCACATGC GGCTGATCTA
GCGAAAGGTC ACAAAACGGC TCATCAGCGT GATGATGCCC TTTCAAAAGC ACGCTTTGAA
TTCCGTTGGC GCGATCAATT TAATTTATCT TTAGATCCTG AACGCGCGAT GGAGTATCAC
GATGAAACAT TGCCAGCAGA AGGAGCGAAA ACGGCTCATT TCTGTTCCAT GTGTGGACCG
AAGTTTTGTA GTATGAGAAT TTCACATGAT ATTCGTGAAT ACGCAAAAGA AAATGATTTA
GAAACGACAG AAGCAATTGA AAAAGGAATG AAAGAGAAAG CGAAAGAATT TAAAGAAACT
GGTAGTCATT TATACCAATA A
 
Protein sequence
MKQSVSAEQI ELKSSLPGSK KVYVDGPREG MKVPMREIEQ SDTNGVPNPP IRVYDTSGPY 
TDPAYKVELE KGIPTPRHSW ILERGDVEAY EGREVKPEDD GVKVASKHTP VFPQMDRKPL
RAKQGANVTQ MHYARNGIIT SEMEYVAIRE GVDPEFVRKE IAEGRAILPA NINHPEAEPM
IIGRNFHVKV NANIGNSAVS SSIAEEVEKM TWATRWGADT IMDLSTGKNI HTTREWIIRN
APVPVGTVPI YQALEKVNGI AEDLTWEVYR DTLIEQAEQG VDYFTIHAGV LLRYIPITAK
RTTGIVSRGG SIMAQWCLFH HKENFLYTHF EEICEIMKQY DVSFSLGDGL RPGSIADAND
EAQFSELETL GELTKIAWKH DVQVMIEGPG HVPMHLIKEN MEKELDICQG APFYTLGPLT
TDIAPGYDHI TSAIGAAMIG WFGTAMLCYV TPKEHLGLPN KDDVREGVIT YKIAAHAADL
AKGHKTAHQR DDALSKARFE FRWRDQFNLS LDPERAMEYH DETLPAEGAK TAHFCSMCGP
KFCSMRISHD IREYAKENDL ETTEAIEKGM KEKAKEFKET GSHLYQ