Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GWCH70_1241 |
Symbol | |
ID | 7976028 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacillus sp. WCH70 |
Kingdom | Bacteria |
Replicon accession | NC_012793 |
Strand | + |
Start bp | 1293983 |
End bp | 1295989 |
Gene Length | 2007 bp |
Protein Length | 668 aa |
Translation table | 11 |
GC content | 47% |
IMG OID | 644798187 |
Product | transketolase |
Protein accession | YP_002949360 |
Protein GI | 239826736 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG0021] Transketolase |
TIGRFAM ID | [TIGR00232] transketolase, bacterial and yeast |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 0 |
Plasmid unclonability p-value | 0.00000000546415 |
Plasmid hitchhiking | No |
Plasmid clonability | unclonable |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACGCATA CAATCGAAGA ATTAGCGATT ACGACGATTC GAACATTGTC GATCGATGCC ATCGAGAAAG CGAAATCCGG TCATCCGGGA ATGCCGATGG GCGCCGCTCC GATGGCGTAC ACGTTGTGGA CGAAATTTAT GAACCATAAT CCGAGCAACC CGAAATGGTT TAACCGCGAC CGTTTCGTGT TATCGGCAGG GCATGGGTCG ATGCTGTTAT ACAGCTTGCT TCATTTAAGC GGTTACGATG TTTCCATGGA AGATATTAAA CAATTTCGTC AATGGGGCAG CAAAACGCCA GGACATCCAG AATATGGCCA TACACCAGGG GTAGAAGCAA CGACTGGTCC GCTCGGCCAG GGGATTGCAA TGGCCGTTGG TATGGCAATG GCAGAGCGCC ATTTAGCAGC TACATATAAC CGGGAAAATT TTGAAATTAT AAACCACTAT ACGTATGCGA TTTGCGGCGA TGGCGACTTA ATGGAAGGGG TTGCCTCTGA AGCAGCATCA TTAGCTGGAC ATTTAAAACT CGGCCGCCTT ATCGTTCTTT ATGACTCGAA CGATATTTCG CTTGACGGAG AATTAAACCT TTCCTTTTCT GAAAATGTAG AACAGCGCTT TAAAGCGTAC GGCTGGCAAT ATTTGCGCGT AGAAGATGGA AATAATATTG AGGAAATCGC TAAAGCCATT GAAGAAGCGA AAGCGGACAC GACTCGTCCG ACATTGATTG AAGTGAAAAC GACCATTGGC TATGGTTCGC CAAATAAAGC TGGAACTTCT AACGTGCACG GCGCTCCGCT TGGGGCGGAA GAATTGAAAC TTACGAAAGA AGCGTATAAA TGGACATTTG AAGAAGATTT TTACGTTCCG CAAGAAGTAT ACGATCACTT CCGGCAAGTG GTAAAAGAAG CGGGAGAGAA AAAAGAAGCG GAATGGAATG AACTATTTGC CCAGTACGAA AAAGCGTATC CGGATTTGGC GAAACAATTG AAATTAGCGA TGAACGGAGA ACTTCCAGAA GGATGGGAAA AAGCTCTTCC GGTATATGAA GAAGGAAAAA GCCTGGCAAC ACGCGCTTCT TCCGGCGAAG TGCTAAACGC GATTGCTAAA GTGGTTCCGC AATTGATCGG CGGTTCAGCT GACCTTGCAA GTTCTAATAA AACATTAATC AAAGGCGCCG GAAACTTTTT GCCAGAAAGC TATGAAGGGC GAAACATTTG GTTTGGCGTG CGTGAATTTG CGATGGGCGC AGCACTTAAC GGAATGGCAT TACATGGCGG TTTAAAAGTA TTCGGCGGTA CGTTCTTTGT TTTCTCTGAT TATTTGCGTC CTGCCATCCG TCTCGCTGCA TTAATGGGAT TGCCAGTGAC GTACGTTCTT ACCCATGACA GCATCGCTGT TGGCGAAGAC GGGCCGACGC ATGAGCCGAT CGAGCATTTG CCATCGCTGC GCGCGATGCC GAATTTATCG GTCATTCGCC CGGCAGATGC CAACGAAACG GCGGCTGCAT GGCGTTTAGC CGTAGAGTCA ACCGATCAAC CGACCGCATT AGTGTTAACG CGGCAAAATG TCCCAACATT GCCGAATACG GCAGAACGGG CATACGAAGG CGTGAAAAAA GGTGCTTATG TCCTATCTGA GGCGAAAAAC GGCAATCCAG AAGCATTATT GCTTGCGTCT GGATCTGAGG TAAGCCTTGC GGTGAAAGCA CAACAAGCAT TAGCGGAAGA AGGCATTCAC GTTTCTGTCA TCAGCATGCC ATCATGGGAT CGTTTTGAAA AACAACCAGA TGAGTATAAA CAACAAGTGC TTCCGCGTAC AGTGAAAAAA CGTTTAGCGA TTGAAATGGC AGCTTCGCTC GGCTGGGAGC GTTATGTCGG TGATGAAGGC GATATTTTAG CGATCGACCG CTTCGGAGCT TCCGCACCAG GAGAAAAAAT CATGGAAGAG TACGGATTTA CAGTAGAAAA TGTGGTCAAA CGAGTAAAAG CATTGCTTGG CAAATAA
|
Protein sequence | MTHTIEELAI TTIRTLSIDA IEKAKSGHPG MPMGAAPMAY TLWTKFMNHN PSNPKWFNRD RFVLSAGHGS MLLYSLLHLS GYDVSMEDIK QFRQWGSKTP GHPEYGHTPG VEATTGPLGQ GIAMAVGMAM AERHLAATYN RENFEIINHY TYAICGDGDL MEGVASEAAS LAGHLKLGRL IVLYDSNDIS LDGELNLSFS ENVEQRFKAY GWQYLRVEDG NNIEEIAKAI EEAKADTTRP TLIEVKTTIG YGSPNKAGTS NVHGAPLGAE ELKLTKEAYK WTFEEDFYVP QEVYDHFRQV VKEAGEKKEA EWNELFAQYE KAYPDLAKQL KLAMNGELPE GWEKALPVYE EGKSLATRAS SGEVLNAIAK VVPQLIGGSA DLASSNKTLI KGAGNFLPES YEGRNIWFGV REFAMGAALN GMALHGGLKV FGGTFFVFSD YLRPAIRLAA LMGLPVTYVL THDSIAVGED GPTHEPIEHL PSLRAMPNLS VIRPADANET AAAWRLAVES TDQPTALVLT RQNVPTLPNT AERAYEGVKK GAYVLSEAKN GNPEALLLAS GSEVSLAVKA QQALAEEGIH VSVISMPSWD RFEKQPDEYK QQVLPRTVKK RLAIEMAASL GWERYVGDEG DILAIDRFGA SAPGEKIMEE YGFTVENVVK RVKALLGK
|
| |