Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GWCH70_2440 |
Symbol | |
ID | 7978999 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacillus sp. WCH70 |
Kingdom | Bacteria |
Replicon accession | NC_012793 |
Strand | - |
Start bp | 2472874 |
End bp | 2474022 |
Gene Length | 1149 bp |
Protein Length | 382 aa |
Translation table | 11 |
GC content | 40% |
IMG OID | 644799242 |
Product | coproporphyrinogen III oxidase |
Protein accession | YP_002950402 |
Protein GI | 239827778 |
COG category | [H] Coenzyme transport and metabolism |
COG ID | [COG0635] Coproporphyrinogen III oxidase and related Fe-S oxidoreductases |
TIGRFAM ID | [TIGR00539] putative oxygen-independent coproporphyrinogen III oxidase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.000264436 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TTGGTAAAAT CAGCTTATTT CCATATTCCG TTTTGTGCGC AAATATGTTA TTACTGCGAT TTTAATAAAG TATTTTTTCA TGGACAGCCA GTCGATGAAT ATTTAGAAGC AATGGAAAAT GAAATGAAAT GGACGGTGGA AGCATTTCCA ACGGATCGGC TTGATACGTT GTTTGTTGGC GGGGGAACCC CGACTGTTTT AGAAATGAAG CAGCTCGACT TTTTTCTGCA AAGCATTTAT AAGCATTTTC GCTTTTCTAT TCATGAAGTA GAGTTTACGT TTGAGGCAAA TCCGAATGAA CTTTCCAAAG AAAAACTACA GCTGTTAAAA GAGGCAGGGG TCAATCGCTT AAGTTTTGGT GTTCAAACGT TTGATGATTC ATTATTAAAA GCAATTGGAC GGACTCATCG TTATGAGGAT GTCATGAAAA CGATTGCCTT AGCAAAAGAA ATCGGCTTTG AAAATATTAG CATTGATTTA ATGTACGGGC TGCCGCAACA GACGTTAGCA CAGTTCCAAG CGGATTTAGA GATCGCCTTT TCTCTTGACA TTCAACATAT TTCTGCTTAT TCTCTCATCA TTGAACCGAA AACCATTTTT TACAATTTGA TGAGAAAAGG AAAACTGCCA TTACCAACTG AGGAAGAAGA AGCGCAAATG TATGAAGAAG CGATGCGGCA AATGGAAATA CACGGGTACC ACCAATATGA AATTAGTAAT TACGCGCGTC CTTTCTTTGA AAGCCGCCAT AACTTAACGT ATTGGAACAA TGAAGAATAT TATGGAATTG GTGCGGGCGC TCACAGCTAC GTAGGCGGCG TTCGGCGTGC GAATATCAAA CCGATCAATA AATATATTGA GACAGTTCAA GAAACAGGTT TCCCGTATTT GGAAGTTCAC CATGTAACGG TATCGGAACA AATGGAAGAA GAAATGTTTT TGGGGCTAAG GAAAACGGAA GGAGTATCTA AGCAGCGCTT TCTTGAAAAG TTTGGGATGA GCGTCCATGA TGTATTTGGC CGAGCGATCG CTGCGGAAAA ACAAAAAGGG CTGCTCGAAG AAACACAAAC ACATATTCGA TTGACTCATC GGGGGAAATT GCTAGGAAAC GAGGTGTTTC AAGCGTTTAT CGCGGAATCT AAACATTGA
|
Protein sequence | MVKSAYFHIP FCAQICYYCD FNKVFFHGQP VDEYLEAMEN EMKWTVEAFP TDRLDTLFVG GGTPTVLEMK QLDFFLQSIY KHFRFSIHEV EFTFEANPNE LSKEKLQLLK EAGVNRLSFG VQTFDDSLLK AIGRTHRYED VMKTIALAKE IGFENISIDL MYGLPQQTLA QFQADLEIAF SLDIQHISAY SLIIEPKTIF YNLMRKGKLP LPTEEEEAQM YEEAMRQMEI HGYHQYEISN YARPFFESRH NLTYWNNEEY YGIGAGAHSY VGGVRRANIK PINKYIETVQ ETGFPYLEVH HVTVSEQMEE EMFLGLRKTE GVSKQRFLEK FGMSVHDVFG RAIAAEKQKG LLEETQTHIR LTHRGKLLGN EVFQAFIAES KH
|
| |