Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GWCH70_0824 |
Symbol | |
ID | 7977769 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacillus sp. WCH70 |
Kingdom | Bacteria |
Replicon accession | NC_012793 |
Strand | + |
Start bp | 888744 |
End bp | 889913 |
Gene Length | 1170 bp |
Protein Length | 389 aa |
Translation table | 11 |
GC content | 44% |
IMG OID | 644797799 |
Product | amidohydrolase |
Protein accession | YP_002948972 |
Protein GI | 239826348 |
COG category | [R] General function prediction only |
COG ID | [COG1473] Metal-dependent amidase/aminoacylase/carboxypeptidase |
TIGRFAM ID | [TIGR01891] amidohydrolase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.0305704 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGAGAAA TGATTCATGC GATGAAGGAA GAATTATGGG AGATTTTTGA CCACCTTCAT CGTCATCCGG AAGTTAGTTG GGAAGAGTGG AAAACGACGG AGTTTATTAA GCAGCAACTG ATTCAAGAAG GATATCGTGC CAACACGTTT TCCGATTGTC CGGGAGTGAT TGGGGAAATC GGCAGCGGGC CGTTTACAGT CGGATTGCGC AGCGATATGG ATGCGCTTTG GCAAGAAGTA AATGGAGCAT GGCAAGCAAA CCATGCATGT GGGCATGATG CACATATGAC AATGGTATTA GGGGTTGCTA AACTGTTTAA TCGCATCGGT TATAAGCCGC CTGGAACATT GAGGTTTCTT TTTCAACCAG CAGAAGAAAA AGGAACAGGA GCATTGAAGT TTTTGGAAAA AGGAGTTATC GATGATATTG ACTTTTTATA CGGTGTTCAT TTGCGGCCGA TTCAAGAAGT AAAAAGCGGT TATGCTGTTC CCGCGATTTT GCACGGAGCA GCGCAATGCA TTGATGGAGA AATTAAAGGA GTTGCCGCGC ATGCGGCAAG ACCACACCTT GGAGTCAATG TCATTGAAGT TGGGAGTGCG ATCGTACAAG AGCTGAGCAA AATCCATATT GATCCACAAG TGCCTGCATC GATCAAAATG ACAAGATTTC ATGCAGGAGA AAAAAATGCG AACATCATTC CAGATCATGC GGAATTTTCG CTTGATTTGC GAGCGCAAAC GAACGAGGCG ATGGAACAGT TGATAGAAGG ATTGAATCAC GTGGTGAAAG GGATTGCATC TATTTATGAT GCGGATATTC AGCTTCATTC AGGAGTTCGT ATTGCCGCTG CACGGCCGCA TCCACAAGCA CAGCAACTAA TGGAGCGTGC CATTGTTGCT ACGTTAGGGG AAGAAAAGTG TCTGCCGCCA GTAGTCACTT CAGGTGGGGA AGATTTCCAT TTTTATTCAT TGATGAAACC ACAGTTAAAA ACGACGATGC TAGGGCTTGG TTGTGATTTA AAACCGGGAT TGCACCATCC GCAAATGACG TTCCGCCGCG AAGATTTATT ATCTGGTATT GAAATATTAG CAAGAGTCAT TATGGAAACG TTTGAGCATT TTGCATCACG AGGGGAGACA GAAAGTGCGT ATCTCACTAC AAAAAATTGA
|
Protein sequence | MREMIHAMKE ELWEIFDHLH RHPEVSWEEW KTTEFIKQQL IQEGYRANTF SDCPGVIGEI GSGPFTVGLR SDMDALWQEV NGAWQANHAC GHDAHMTMVL GVAKLFNRIG YKPPGTLRFL FQPAEEKGTG ALKFLEKGVI DDIDFLYGVH LRPIQEVKSG YAVPAILHGA AQCIDGEIKG VAAHAARPHL GVNVIEVGSA IVQELSKIHI DPQVPASIKM TRFHAGEKNA NIIPDHAEFS LDLRAQTNEA MEQLIEGLNH VVKGIASIYD ADIQLHSGVR IAAARPHPQA QQLMERAIVA TLGEEKCLPP VVTSGGEDFH FYSLMKPQLK TTMLGLGCDL KPGLHHPQMT FRREDLLSGI EILARVIMET FEHFASRGET ESAYLTTKN
|
| |