Gene GWCH70_1283 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGWCH70_1283 
Symbol 
ID7976064 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacillus sp. WCH70 
KingdomBacteria 
Replicon accessionNC_012793 
Strand
Start bp1333743 
End bp1335017 
Gene Length1275 bp 
Protein Length424 aa 
Translation table11 
GC content57% 
IMG OID644798227 
Productimidazolonepropionase 
Protein accessionYP_002949400 
Protein GI239826776 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG1228] Imidazolonepropionase and related amidohydrolases 
TIGRFAM ID[TIGR01224] imidazolonepropionase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00000151927 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGCCCAC TGTTTATCCG CAACGCCAGC CAGCTCGTGA CGCTGGCCGG CAGCTCCACG 
GCCCCGCTTG TGAAGGAGAA AATGAACGAA CTTCATATCA TTGAAAATGG CAGCGTCTGG
GTGGAAGACG GAAAAATTGC CGCTGTTGGA ACGGACGAGG AGCTTTCGCA GCAATTTCAA
GAGCGAATCG CGGAAGCGGA GATCGTCGAT GCGACAGGGA AAACGGTGAC ACCGGGGCTT
GTCGATCCGC ACACGCATTT CGTATATGCG GGAAGCCGCG AAAGCGAATT CGCGATGCGT
CTTAGCGGGG CGACATACAT GGAAATAATG AACGCCGGCG GCGGTATTCA CGCGACGACA
AAAGCAACTC GGGAAGCATC GAAAGAAACA TTGTATGAAG AAAGCAAGCG GCGGCTCGAT
CAGTTTTTGC TTCACGGCGT CACGACCGTG GAGGCGAAAA GCGGCTATGG CTTGAGTATT
GAGCACGAAG TCAAACAGCT GACGGTGGCG AAACAGCTCG ATGAAACCCA TCCCGTCGAT
GTCGTGTCCA CGTTTATGGG AGCGCATGCC GTACCCGCCG AGTGGAAAGA CAATCCTGAC
GGCTTTGTCC GCGTCATCGT TGAAGAGATG ATTCCGAAAG TAAGCGAGCT CGGGCTTGCC
GAATTTAATG ACGTCTTTTG CGAACGCGGC GTGTTCACTC CAGAACAGGC AAGAATCATT
TTAGAGGCAG GAAAAGCGTA CGGGCTGATG CCGAAAATTC ATGCCGATGA AATCGAGCCA
TACGGCGGCG CGGAGCTGGC CGCGGAAGTC GGGGCGGTTT CCGCCGACCA TCTCCTACGC
GCTTCGGACG AAGGCATTCG CCGCATGGCG GAAAAAGGAG TGATTGCGGT GCTGCTGCCG
GGCACGGCGT TTTTCCTGAT GACCAAGGCC GCCAATGCCC GCAAGATCAT CGACGCCGGC
GCAGCGGTCG CGCTTTCCAC CGACTGCAAT CCCGGCTCCT CGCCAACCGT ATCGCTCCCG
CTGATCATGA ACCTCGGCTG CCTGCAGATG GGCATGACCC CTGCCGAAGC GCTGGCGGCC
GTCACGATCA ACGCCGCGCA CGCGATCAAC CGCGGCCACG AAATCGGAAG CATTGAAGTC
GGGAAAAAAG CCGATTTGGT CCTTTTCGAC GTCCCGAATT ATATGCAGCT CATCTACCAT
TACGGCATGA ACCATACCGA TACAGTCGTG AAAAACGGCC GGGTGGTGGT GAAAAGCGGG
AGGCTTTGCT ACTAG
 
Protein sequence
MRPLFIRNAS QLVTLAGSST APLVKEKMNE LHIIENGSVW VEDGKIAAVG TDEELSQQFQ 
ERIAEAEIVD ATGKTVTPGL VDPHTHFVYA GSRESEFAMR LSGATYMEIM NAGGGIHATT
KATREASKET LYEESKRRLD QFLLHGVTTV EAKSGYGLSI EHEVKQLTVA KQLDETHPVD
VVSTFMGAHA VPAEWKDNPD GFVRVIVEEM IPKVSELGLA EFNDVFCERG VFTPEQARII
LEAGKAYGLM PKIHADEIEP YGGAELAAEV GAVSADHLLR ASDEGIRRMA EKGVIAVLLP
GTAFFLMTKA ANARKIIDAG AAVALSTDCN PGSSPTVSLP LIMNLGCLQM GMTPAEALAA
VTINAAHAIN RGHEIGSIEV GKKADLVLFD VPNYMQLIYH YGMNHTDTVV KNGRVVVKSG
RLCY