Gene BCAH820_5316 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBCAH820_5316 
SymbolthiC 
ID7189706 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBacillus cereus AH820 
KingdomBacteria 
Replicon accessionNC_011773 
Strand
Start bp5010892 
End bp5012652 
Gene Length1761 bp 
Protein Length586 aa 
Translation table11 
GC content43% 
IMG OID643558726 
Productthiamine biosynthesis protein ThiC 
Protein accessionYP_002454236 
Protein GI218906402 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0422] Thiamine biosynthesis protein ThiC 
TIGRFAM ID[TIGR00190] thiamine biosynthesis protein ThiC 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value9.03557e-31 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAAACAGT CTGTTTCAGC TGAGCAAATT GAATTGAAAT CGAGTTTACC AGGAAGCAAG 
AAAGTGTATG TGGATGGACC ACGAGAAGGT ATGAAAGTGC CGATGCGTGA GATTGAACAA
AGTGATACAA ATGGCGTTCC AAATCCGCCA ATTCGTGTGT ATGATACAAG CGGTCCTTAC
ACAGATCCTG CGTATAAAGT CGAGTTAGAG AAGGGGATTC CAACGCCGCG CCACTCTTGG
ATTCTAGAGC GCGGAGATGT AGAGGCATAC GAAGGGCGCG AAGTGAAACC AGAGGATGAC
GGTGTGAAGG TGGCTTCGAA ACATACACCT GTTTTCCCGC AAATGGATCG CAAACCGCTT
AGAGCGAAGC AAGGTGCAAA TGTTACGCAA ATGCATTATG CACGTAATGG CATCATTACG
TCTGAGATGG AATATGTTGC GATTCGTGAA GGAGTAGACC CGGAATTTGT TCGTAAGGAA
ATCGCGGAAG GTCGCGCTAT TTTACCAGCG AATATTAACC ATCCTGAAGC AGAACCGATG
ATTATTGGGC GTAATTTCCA TGTGAAGGTT AATGCGAATA TCGGAAACTC TGCTGTATCT
TCTTCTATTG CAGAAGAAGT AGAGAAGATG ACGTGGGCAA CTCGCTGGGG TGCAGATACG
ATTATGGATT TATCTACAGG TAAAAACATT CATACGACGC GCGAGTGGAT TATTCGTAAC
GCACCTGTAC CAGTTGGAAC TGTACCAATC TATCAAGCAC TGGAAAAAGT AAACGGAATT
GCAGAAGATT TAACGTGGGA AGTGTATCGT GATACGTTAA TTGAGCAAGC GGAGCAAGGC
GTAGATTACT TTACGATTCA CGCTGGCGTA TTACTTCGTT ACATTCCAAT TACGGCGAAA
CGTACGACAG GTATCGTTTC ACGCGGTGGT TCAATTATGG CACAGTGGTG TTTATTCCAT
CATAAAGAAA ACTTCCTATA CACTCATTTT GAAGAGATTT GTGAAATTAT GAAGCAATAC
GATGTTTCGT TCTCTCTTGG AGATGGATTA CGTCCAGGTT CGATTGCAGA TGCAAATGAC
GAAGCACAGT TTTCTGAGCT TGAAACACTT GGTGAATTAA CGAAGATTGC TTGGAAACAT
GATGTGCAAG TGATGATTGA AGGGCCTGGG CATGTACCGA TGCATTTAAT TAAAGAGAAT
ATGGAGAAAG AACTTGATAT TTGTCAGGGC GCGCCGTTCT ATACACTTGG GCCGTTAACG
ACAGATATTG CACCAGGTTA TGACCATATT ACATCTGCGA TTGGAGCTGC GATGATTGGT
TGGTTTGGAA CGGCGATGCT TTGTTATGTA ACGCCGAAAG AACATTTAGG TTTACCAAAT
AAAGATGATG TTCGAGAAGG TGTTATTACG TACAAAATCG CTGCACATGC GGCTGATCTA
GCGAAAGGTC ACAAAACGGC TCATCAGCGT GATGATGCCC TTTCAAAAGC CCGCTTTGAA
TTCCGTTGGC GCGATCAATT TAATTTATCT TTAGATCCTG AACGCGCGAT GGAGTATCAC
GATGAAACAT TGCCAGCAGA AGGCGCGAAA ACAGCTCATT TCTGTTCCAT GTGTGGACCG
AAGTTTTGTA GTATGAGAAT TTCACATGAT ATTCGTGAAT ACGCAAAAGA AAATGATTTA
GAAACGACAG AAGCAATTGA AAAAGGAATG AAAGAGAAAG CGAAAGAATT TAAAGAAACT
GGTAGTCATT TATATCAATA A
 
Protein sequence
MKQSVSAEQI ELKSSLPGSK KVYVDGPREG MKVPMREIEQ SDTNGVPNPP IRVYDTSGPY 
TDPAYKVELE KGIPTPRHSW ILERGDVEAY EGREVKPEDD GVKVASKHTP VFPQMDRKPL
RAKQGANVTQ MHYARNGIIT SEMEYVAIRE GVDPEFVRKE IAEGRAILPA NINHPEAEPM
IIGRNFHVKV NANIGNSAVS SSIAEEVEKM TWATRWGADT IMDLSTGKNI HTTREWIIRN
APVPVGTVPI YQALEKVNGI AEDLTWEVYR DTLIEQAEQG VDYFTIHAGV LLRYIPITAK
RTTGIVSRGG SIMAQWCLFH HKENFLYTHF EEICEIMKQY DVSFSLGDGL RPGSIADAND
EAQFSELETL GELTKIAWKH DVQVMIEGPG HVPMHLIKEN MEKELDICQG APFYTLGPLT
TDIAPGYDHI TSAIGAAMIG WFGTAMLCYV TPKEHLGLPN KDDVREGVIT YKIAAHAADL
AKGHKTAHQR DDALSKARFE FRWRDQFNLS LDPERAMEYH DETLPAEGAK TAHFCSMCGP
KFCSMRISHD IREYAKENDL ETTEAIEKGM KEKAKEFKET GSHLYQ