Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GWCH70_1283 |
Symbol | |
ID | 7976064 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacillus sp. WCH70 |
Kingdom | Bacteria |
Replicon accession | NC_012793 |
Strand | + |
Start bp | 1333743 |
End bp | 1335017 |
Gene Length | 1275 bp |
Protein Length | 424 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 644798227 |
Product | imidazolonepropionase |
Protein accession | YP_002949400 |
Protein GI | 239826776 |
COG category | [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG1228] Imidazolonepropionase and related amidohydrolases |
TIGRFAM ID | [TIGR01224] imidazolonepropionase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.00000151927 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCGCCCAC TGTTTATCCG CAACGCCAGC CAGCTCGTGA CGCTGGCCGG CAGCTCCACG GCCCCGCTTG TGAAGGAGAA AATGAACGAA CTTCATATCA TTGAAAATGG CAGCGTCTGG GTGGAAGACG GAAAAATTGC CGCTGTTGGA ACGGACGAGG AGCTTTCGCA GCAATTTCAA GAGCGAATCG CGGAAGCGGA GATCGTCGAT GCGACAGGGA AAACGGTGAC ACCGGGGCTT GTCGATCCGC ACACGCATTT CGTATATGCG GGAAGCCGCG AAAGCGAATT CGCGATGCGT CTTAGCGGGG CGACATACAT GGAAATAATG AACGCCGGCG GCGGTATTCA CGCGACGACA AAAGCAACTC GGGAAGCATC GAAAGAAACA TTGTATGAAG AAAGCAAGCG GCGGCTCGAT CAGTTTTTGC TTCACGGCGT CACGACCGTG GAGGCGAAAA GCGGCTATGG CTTGAGTATT GAGCACGAAG TCAAACAGCT GACGGTGGCG AAACAGCTCG ATGAAACCCA TCCCGTCGAT GTCGTGTCCA CGTTTATGGG AGCGCATGCC GTACCCGCCG AGTGGAAAGA CAATCCTGAC GGCTTTGTCC GCGTCATCGT TGAAGAGATG ATTCCGAAAG TAAGCGAGCT CGGGCTTGCC GAATTTAATG ACGTCTTTTG CGAACGCGGC GTGTTCACTC CAGAACAGGC AAGAATCATT TTAGAGGCAG GAAAAGCGTA CGGGCTGATG CCGAAAATTC ATGCCGATGA AATCGAGCCA TACGGCGGCG CGGAGCTGGC CGCGGAAGTC GGGGCGGTTT CCGCCGACCA TCTCCTACGC GCTTCGGACG AAGGCATTCG CCGCATGGCG GAAAAAGGAG TGATTGCGGT GCTGCTGCCG GGCACGGCGT TTTTCCTGAT GACCAAGGCC GCCAATGCCC GCAAGATCAT CGACGCCGGC GCAGCGGTCG CGCTTTCCAC CGACTGCAAT CCCGGCTCCT CGCCAACCGT ATCGCTCCCG CTGATCATGA ACCTCGGCTG CCTGCAGATG GGCATGACCC CTGCCGAAGC GCTGGCGGCC GTCACGATCA ACGCCGCGCA CGCGATCAAC CGCGGCCACG AAATCGGAAG CATTGAAGTC GGGAAAAAAG CCGATTTGGT CCTTTTCGAC GTCCCGAATT ATATGCAGCT CATCTACCAT TACGGCATGA ACCATACCGA TACAGTCGTG AAAAACGGCC GGGTGGTGGT GAAAAGCGGG AGGCTTTGCT ACTAG
|
Protein sequence | MRPLFIRNAS QLVTLAGSST APLVKEKMNE LHIIENGSVW VEDGKIAAVG TDEELSQQFQ ERIAEAEIVD ATGKTVTPGL VDPHTHFVYA GSRESEFAMR LSGATYMEIM NAGGGIHATT KATREASKET LYEESKRRLD QFLLHGVTTV EAKSGYGLSI EHEVKQLTVA KQLDETHPVD VVSTFMGAHA VPAEWKDNPD GFVRVIVEEM IPKVSELGLA EFNDVFCERG VFTPEQARII LEAGKAYGLM PKIHADEIEP YGGAELAAEV GAVSADHLLR ASDEGIRRMA EKGVIAVLLP GTAFFLMTKA ANARKIIDAG AAVALSTDCN PGSSPTVSLP LIMNLGCLQM GMTPAEALAA VTINAAHAIN RGHEIGSIEV GKKADLVLFD VPNYMQLIYH YGMNHTDTVV KNGRVVVKSG RLCY
|
| |