Gene GWCH70_2440 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGWCH70_2440 
Symbol 
ID7978999 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacillus sp. WCH70 
KingdomBacteria 
Replicon accessionNC_012793 
Strand
Start bp2472874 
End bp2474022 
Gene Length1149 bp 
Protein Length382 aa 
Translation table11 
GC content40% 
IMG OID644799242 
Productcoproporphyrinogen III oxidase 
Protein accessionYP_002950402 
Protein GI239827778 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0635] Coproporphyrinogen III oxidase and related Fe-S oxidoreductases 
TIGRFAM ID[TIGR00539] putative oxygen-independent coproporphyrinogen III oxidase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000264436 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGGTAAAAT CAGCTTATTT CCATATTCCG TTTTGTGCGC AAATATGTTA TTACTGCGAT 
TTTAATAAAG TATTTTTTCA TGGACAGCCA GTCGATGAAT ATTTAGAAGC AATGGAAAAT
GAAATGAAAT GGACGGTGGA AGCATTTCCA ACGGATCGGC TTGATACGTT GTTTGTTGGC
GGGGGAACCC CGACTGTTTT AGAAATGAAG CAGCTCGACT TTTTTCTGCA AAGCATTTAT
AAGCATTTTC GCTTTTCTAT TCATGAAGTA GAGTTTACGT TTGAGGCAAA TCCGAATGAA
CTTTCCAAAG AAAAACTACA GCTGTTAAAA GAGGCAGGGG TCAATCGCTT AAGTTTTGGT
GTTCAAACGT TTGATGATTC ATTATTAAAA GCAATTGGAC GGACTCATCG TTATGAGGAT
GTCATGAAAA CGATTGCCTT AGCAAAAGAA ATCGGCTTTG AAAATATTAG CATTGATTTA
ATGTACGGGC TGCCGCAACA GACGTTAGCA CAGTTCCAAG CGGATTTAGA GATCGCCTTT
TCTCTTGACA TTCAACATAT TTCTGCTTAT TCTCTCATCA TTGAACCGAA AACCATTTTT
TACAATTTGA TGAGAAAAGG AAAACTGCCA TTACCAACTG AGGAAGAAGA AGCGCAAATG
TATGAAGAAG CGATGCGGCA AATGGAAATA CACGGGTACC ACCAATATGA AATTAGTAAT
TACGCGCGTC CTTTCTTTGA AAGCCGCCAT AACTTAACGT ATTGGAACAA TGAAGAATAT
TATGGAATTG GTGCGGGCGC TCACAGCTAC GTAGGCGGCG TTCGGCGTGC GAATATCAAA
CCGATCAATA AATATATTGA GACAGTTCAA GAAACAGGTT TCCCGTATTT GGAAGTTCAC
CATGTAACGG TATCGGAACA AATGGAAGAA GAAATGTTTT TGGGGCTAAG GAAAACGGAA
GGAGTATCTA AGCAGCGCTT TCTTGAAAAG TTTGGGATGA GCGTCCATGA TGTATTTGGC
CGAGCGATCG CTGCGGAAAA ACAAAAAGGG CTGCTCGAAG AAACACAAAC ACATATTCGA
TTGACTCATC GGGGGAAATT GCTAGGAAAC GAGGTGTTTC AAGCGTTTAT CGCGGAATCT
AAACATTGA
 
Protein sequence
MVKSAYFHIP FCAQICYYCD FNKVFFHGQP VDEYLEAMEN EMKWTVEAFP TDRLDTLFVG 
GGTPTVLEMK QLDFFLQSIY KHFRFSIHEV EFTFEANPNE LSKEKLQLLK EAGVNRLSFG
VQTFDDSLLK AIGRTHRYED VMKTIALAKE IGFENISIDL MYGLPQQTLA QFQADLEIAF
SLDIQHISAY SLIIEPKTIF YNLMRKGKLP LPTEEEEAQM YEEAMRQMEI HGYHQYEISN
YARPFFESRH NLTYWNNEEY YGIGAGAHSY VGGVRRANIK PINKYIETVQ ETGFPYLEVH
HVTVSEQMEE EMFLGLRKTE GVSKQRFLEK FGMSVHDVFG RAIAAEKQKG LLEETQTHIR
LTHRGKLLGN EVFQAFIAES KH