Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GWCH70_3171 |
Symbol | |
ID | 7977024 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacillus sp. WCH70 |
Kingdom | Bacteria |
Replicon accession | NC_012793 |
Strand | + |
Start bp | 3199699 |
End bp | 3200769 |
Gene Length | 1071 bp |
Protein Length | 356 aa |
Translation table | 11 |
GC content | 41% |
IMG OID | 644799956 |
Product | Mandelate racemase/muconate lactonizing protein |
Protein accession | YP_002951095 |
Protein GI | 239828471 |
COG category | [M] Cell wall/membrane/envelope biogenesis [R] General function prediction only |
COG ID | [COG4948] L-alanine-DL-glutamate epimerase and related enzymes of enolase superfamily |
TIGRFAM ID | [TIGR01928] o-succinylbenzoic acid (OSB) synthetase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 45 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAATAA AAGACGCAAC CATCGCCACA CAATCGACTC CATTAATAAA ACCTTTCAAA ACGGCATTGC GGACAGCTAC GCAAATTGAA AGCATCGTCG TAAAAATCAC GCTAGATAAC GGAATTGAAG GGTATGGGGC CGCCGTCCCG ACCGAAGCCA TCACGGGAGA AACGAAACAA GGAATCATAG GTATTTTAGA AAATGTACTC ATCCCTAAAA TCATCGGTAA AGAAATCGAA GAAATAGCAA AAAACAGTAA AGATATTCAA ACTAGTTGCA TCGAAAACAC AAGCGCGAAA GCAGCATTAG AAATGGCCAT GTACGATGCC CTTTGCAAAC TACTAAACAT TCCTCTCTAT CAATTATTCG GAGGAAAGAC GAACCGTCAT GTCAACGATA TGACAATTAG CGTAAATAGT GTAGAAGAAA TGGTCAATGA CGCAAAAAAA GTCACAGAAA AAGGCTTTTC GATTTTAAAA ATTAAAGTAG GAAAAGAAGC CGAAAAGGAT ATCGAACGAA TCGAACGAAT TTATGAAGAA GTGGGGCCAA ATGTTTCACT GCGCATTGAC GCTAATCAAG GATGGACAGC GAAAGAAGCT GTGGAAATCA TTCAAACGTT AGAACGGCTC CAACTTCCGA TCGAATTTAT TGAGCAGCCC GTTCCTAAAT ATGACATAAA AGGTCTTCAG TTTATCCGAG AACGAGTCAA CATACCGATT ATGGCGGACG AAAGTGTATT CTCGGCTCGA GATGCACTAG AACTGATCCG TCATCACGCT GTCGATTTGA TCAATATTAA GTTAATGAAA ACGGGAGGAT TACGGGAAGC CTACAAGATT GCTAGTCTAG CAGAAGCGGC CGGTATCGAA TGCATGATCG GAAGCATGAT GGAACCAACT CTTTCCGTAC TGGCAGCAAC CCATTTAGCA ATTGCCCATC CGAACATTAC AAAAGTCGAT TTAGATGCGC CTCTATGGAT AGATGATGAT AGCAGTCGCT CGTTCTTTCA AGGAAGCGAA ATTAACGTTC CTGATTTACC AGGGATCGGT TATGTTCCTT TATATAACTA A
|
Protein sequence | MKIKDATIAT QSTPLIKPFK TALRTATQIE SIVVKITLDN GIEGYGAAVP TEAITGETKQ GIIGILENVL IPKIIGKEIE EIAKNSKDIQ TSCIENTSAK AALEMAMYDA LCKLLNIPLY QLFGGKTNRH VNDMTISVNS VEEMVNDAKK VTEKGFSILK IKVGKEAEKD IERIERIYEE VGPNVSLRID ANQGWTAKEA VEIIQTLERL QLPIEFIEQP VPKYDIKGLQ FIRERVNIPI MADESVFSAR DALELIRHHA VDLINIKLMK TGGLREAYKI ASLAEAAGIE CMIGSMMEPT LSVLAATHLA IAHPNITKVD LDAPLWIDDD SSRSFFQGSE INVPDLPGIG YVPLYN
|
| |