Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BCG9842_B5612 |
Symbol | thiC |
ID | 7185150 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Bacillus cereus G9842 |
Kingdom | Bacteria |
Replicon accession | NC_011772 |
Strand | - |
Start bp | 5091191 |
End bp | 5092951 |
Gene Length | 1761 bp |
Protein Length | 586 aa |
Translation table | 11 |
GC content | 43% |
IMG OID | 643553115 |
Product | thiamine biosynthesis protein ThiC |
Protein accession | YP_002448756 |
Protein GI | 218900345 |
COG category | [H] Coenzyme transport and metabolism |
COG ID | [COG0422] Thiamine biosynthesis protein ThiC |
TIGRFAM ID | [TIGR00190] thiamine biosynthesis protein ThiC |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.188984 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.0000000000000085296 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGAAACAAT CTGTTTCAGC TGAGCAAATT GAATTGAAAT CGAGTTTACC AGGGAGTAAG AAAGTATATG TGGATGGACC ACGAGAAGGT ATGAAAGTGC CGATGCGTGA GATTGAACAA AGCGAAACGA ATGGCGTCCC AAATCCACCA ATCCGTGTGT ATGATACGAG CGGTCCTTAT ACAGATCCTG CGTATAAGGT CGAGCTGGAA AAAGGCATTC CAACGCCGCG CCACTCCTGG ATTATGGGGC GTGGTGATGT AGACGCATAC GAAGGGCGTG AAGTAAAACC AGAGGATGAC GGTGTGAAAG TGGCTTCGAA ACATACACCT GTTTTCCCGC AAATGGATCG TAAGCCGCTT AGAGCAAAGC AAGGTGCAAA TGTTACGCAA ATGCACTATG CGCGTAATGG CATTATTACG TCTGAGATGG AATACGTTGC GATTCGTGAA GGGGTAGAGC CGGAGTTTGT TCGTAAGGAA ATCGCAGAAG GCCGCGCTAT TTTACCAGCG AATATTAATC ATCCAGAAGC TGAACCGATG ATTATCGGGC GTAACTTCCA TGTGAAGGTC AATGCAAATA TCGGAAACTC TGCTGTATCT TCTTCTATTG CAGAAGAAGT AGAGAAGATG ACGTGGGCTA CTCGCTGGGG TGCAGATACA ATTATGGATT TATCTACAGG TAAAAATATT CATACGACGC GTGAGTGGAT TATTCGTAAC GCGCCTGTAC CAGTTGGAAC TGTACCGATC TATCAAGCGC TGGAAAAAGT AAACGGAATT GCAGAAGATT TAACGTGGGA AGTATATCGT GATACGTTAA TTGAGCAGGC GGAGCAAGGC GTTGATTACT TTACGATTCA CGCTGGTGTA TTGCTTCGTT ACATTCCAAT CACGGCAAAG CGTACGACAG GTATCGTTTC ACGCGGTGGT TCGATTATGG CGCAGTGGTG TTTATTCCAT CATAAAGAAA ACTTCCTATA TACTCATTTT GAAGAGATTT GTGAAATTAT GAAACAGTAC GATGTTTCGT TCTCTCTTGG AGATGGATTA CGTCCAGGGT CTATTGCAGA TGCAAATGAT GAAGCACAGT TTTCTGAGCT TGAAACACTT GGTGAATTAA CCAAAATTGC TTGGAAACAT GATGTGCAAG TTATGATCGA AGGACCTGGA CACGTACCGA TGCACTTAAT TAAAGAAAAT ATGGAGAAAG AGCTTGATAT TTGTCAGGGC GCGCCGTTTT ACACACTTGG GCCACTAACG ACAGATATTG CACCAGGTTA TGACCATATT ACATCGGCGA TTGGAGCTGC GATGATCGGT TGGTTTGGAA CGGCGATGCT TTGTTATGTA ACGCCGAAAG AACATTTAGG TTTACCGAAT AAAGATGATG TACGCACGGG TGTTATTACG TACAAAATTG CAGCGCATGC GGCTGATCTT GCGAAAGGAC ATAAAACGGC TCATCAGCGT GATGATGCAC TTTCAAAAGC ACGCTTCGAA TTCCGTTGGC GCGATCAATT TAATCTATCT TTAGATCCTG AACGTGCGAT GGAATATCAC GATGAAACGT TGCCTGCAGA AGGGGCAAAG ACAGCTCACT TCTGTTCAAT GTGTGGACCG AAGTTTTGTA GTATGAGAAT TTCACATGAT ATTCGTGAAT ATGCAAAAGA AAATGATTTA GAAACGACAG AAGCAATTGA AAAAGGAATG AAAGAGAAAG CAGAAGAATT TAAAGAAGCT GGTAGTCATT TATATCAATA A
|
Protein sequence | MKQSVSAEQI ELKSSLPGSK KVYVDGPREG MKVPMREIEQ SETNGVPNPP IRVYDTSGPY TDPAYKVELE KGIPTPRHSW IMGRGDVDAY EGREVKPEDD GVKVASKHTP VFPQMDRKPL RAKQGANVTQ MHYARNGIIT SEMEYVAIRE GVEPEFVRKE IAEGRAILPA NINHPEAEPM IIGRNFHVKV NANIGNSAVS SSIAEEVEKM TWATRWGADT IMDLSTGKNI HTTREWIIRN APVPVGTVPI YQALEKVNGI AEDLTWEVYR DTLIEQAEQG VDYFTIHAGV LLRYIPITAK RTTGIVSRGG SIMAQWCLFH HKENFLYTHF EEICEIMKQY DVSFSLGDGL RPGSIADAND EAQFSELETL GELTKIAWKH DVQVMIEGPG HVPMHLIKEN MEKELDICQG APFYTLGPLT TDIAPGYDHI TSAIGAAMIG WFGTAMLCYV TPKEHLGLPN KDDVRTGVIT YKIAAHAADL AKGHKTAHQR DDALSKARFE FRWRDQFNLS LDPERAMEYH DETLPAEGAK TAHFCSMCGP KFCSMRISHD IREYAKENDL ETTEAIEKGM KEKAEEFKEA GSHLYQ
|
| |