Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GWCH70_0826 |
Symbol | |
ID | 7979316 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacillus sp. WCH70 |
Kingdom | Bacteria |
Replicon accession | NC_012793 |
Strand | + |
Start bp | 890729 |
End bp | 891844 |
Gene Length | 1116 bp |
Protein Length | 371 aa |
Translation table | 11 |
GC content | 43% |
IMG OID | 644797801 |
Product | Mandelate racemase/muconate lactonizing protein |
Protein accession | YP_002948974 |
Protein GI | 239826350 |
COG category | [M] Cell wall/membrane/envelope biogenesis [R] General function prediction only |
COG ID | [COG4948] L-alanine-DL-glutamate epimerase and related enzymes of enolase superfamily |
TIGRFAM ID | [TIGR01928] o-succinylbenzoic acid (OSB) synthetase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 36 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGGAATAA AAGTGAAACG AGTCATACTG CGCCACTTAC AAATGGAACT AAAGTCACCG TTTACCACAA GTTTCGGTTC GTTTCAAAAG AAGGAGTTTA TTTTAGTGGA AGCGATGGAT GAAGATGGGC TATCAGGATG GGGGGAATCG GTGGCCTTCC CTTCACCTTG GTATAATGAA GAAACCGTAA AAACGAATTG GCATATGATA GAAGATTTTT TATTGCCTCT TTTATTTCAA GCGCCAATTG CCCACCCAGA AGAGTTGCGG CAACGTTTCT CCGTCATTCG CAAAAATCAA ATGGCAAAAG CGGCGATAGA AGGGGCAATA TGGGATTTAT TTGCGAAAAG ACAGCAACTG CCGTTACATA AAGCACTCGG TGGAAATAAG AATCGTATTG AAGTAGGGGT AAGCATCGGG ATTCAAAAAA GCATCGATGA TTTATTGCGT ATTATCGAAC GGTACGTACA AGAAGGGTAT CGGCGCATCA AAATAAAAAT TAAACCCGGA TGGGACGTTG AGGTTGTTCG GGAAGTTCGT CGTCGTTTTC CAGATGTTCC GCTCATGGTC GATGCAAACT CTGCTTATTC GTTAGAAGAT ATCGACCGAC TAAAGGCGCT AGATGAGTTT CAGCTAATGA TGATTGAACA GCCGCTTGCT CCTGATGATA TCGTAGATCA TGCAACATTG CAGGCACAGT TAAATACGCC AATCTGTTTA GATGAAAGCA TTCATTCCGC TGAAGATGCA AGGAAAGCCA TTCAGCTTGG CAGTTGTCGA ATTATCAATA TAAAAATTGG CCGTGTCGGA GGATTGGCGG AGGCGAAGCG TATTCACGAT ATTTGTAAGG AGAACGATAT TCCTGTCTGG TGCGGCGGCA TGTTAGAAGC TGGTGTCGGA AGAGCGCACA ATATTGCCAT TACGACATTA GATAATTTTA CGTTGCCTGG TGATACAGCC GCTTCTTCCC ATTACTGGAC GAAAGATATT ATCGTTCCTG AAGTCACTGT TCATCATGGA ACGATTACAG TACCGGAAAA GCCGGGCATC GGTTATGATG TCGATCGGCA GCAAGTAGAT ATTTATACAA GTTATGCTAA GATGTACCAC GCTTAA
|
Protein sequence | MGIKVKRVIL RHLQMELKSP FTTSFGSFQK KEFILVEAMD EDGLSGWGES VAFPSPWYNE ETVKTNWHMI EDFLLPLLFQ APIAHPEELR QRFSVIRKNQ MAKAAIEGAI WDLFAKRQQL PLHKALGGNK NRIEVGVSIG IQKSIDDLLR IIERYVQEGY RRIKIKIKPG WDVEVVREVR RRFPDVPLMV DANSAYSLED IDRLKALDEF QLMMIEQPLA PDDIVDHATL QAQLNTPICL DESIHSAEDA RKAIQLGSCR IINIKIGRVG GLAEAKRIHD ICKENDIPVW CGGMLEAGVG RAHNIAITTL DNFTLPGDTA ASSHYWTKDI IVPEVTVHHG TITVPEKPGI GYDVDRQQVD IYTSYAKMYH A
|
| |