Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hhal_0019 |
Symbol | |
ID | 4710197 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halorhodospira halophila SL1 |
Kingdom | Bacteria |
Replicon accession | NC_008789 |
Strand | + |
Start bp | 19722 |
End bp | 21116 |
Gene Length | 1395 bp |
Protein Length | 464 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 639854475 |
Product | oxygen-independent coproporphyrinogen III oxidase |
Protein accession | YP_001001616 |
Protein GI | 121996829 |
COG category | [H] Coenzyme transport and metabolism |
COG ID | [COG0635] Coproporphyrinogen III oxidase and related Fe-S oxidoreductases |
TIGRFAM ID | [TIGR00538] oxygen-independent coproporphyrinogen III oxidase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGAACAAC GTCTTCAATT CGATCGCGAC CTGCTCCGCC GGTACGATGT AAGCGGGCCC CGTTACACCT CGTACCCGAC GGCGCCGCAA TTCCATGAAG GGTTTGATGC CCAGGCCTAC GCCGCGGCAG CCCGGGCGAC CAACGAGGCG GACCAGCCCA ATCCCCTGTC GGTCTACGTC CACGTACCGT TCTGCCGTAA CGTGTGCTTC TACTGCGCGT GCAACAAGAT CGTGACGGCG AACTACAGCC GCGCCCAGGA GTATCTTGAG CACGTCTTCA AGGAGATCGA GCTGCAGGCG CAGCTCTTCG GCGAACACCG TCGCGTTGAG CAGCTCCACC TCGGGGGCGG CACGCCGAAT TACCTCAAGA TCGACGATCT GGGGCGCCTG GTCAGCAAGC TGCGCGAGCA GTTCACCCTC GACGATACGG ACAACCGCGA GTTCTCTATC GAGATTGACC CGCGAGACGT CGAGCTTGAG GACATCGGTC GCCTGGCGGA ACTCGGTTTC AATCGTATGA GCGTCGGCGT GCAGGACTTC AACGAGGAGG TCCAGCACGC GGTCAACCGT GTCCAGAGCG CGGAACTGTG CCGCTCGATC ATTGAGGAGG GGCGCCGCCA CGGCTTCCGC TCGACCAACG TCGACCTCAT CTACGGTCTG CCGAAGCAGA CGGTGGAATC CTTCGAGCAG ACCCTCGACG AGGTCATCGA GCTGCGCCCC GAGCGCCTGG CCATCTATAA CTACGCTCAC CTCCCGCATC TATTCAAGAT CCAGCGGCAG ATCAACGAGG ATGAGCTCCC AGGCCCTGAG GACAAGCTCA CCATCTTCGG GCGTACCATC GAGAAGCTCA CCGATGCCGG CTACGTCTTC ATCGGTATGG ACCACTTCGC CCTGCCCGAT GACGAGCTGG CCGTTGCGCA GAAGAAGGGA ACCCTGCACC GGAACTTCCA GGGCTACTCC ACGCGCGCGG AGTGCGACCT CATCGCACTG GGATCCACCT CCATCGGCAA GATCGGCAAC ACCTACAGTC AGAACCTGCG GGATCCCGAG GAATACCAGC AGCGCATTGC CAACGGCGAT CTGGCGGTGT TCCGTGGTTA CGAGCTCAAC CAGGACGATC TGCTGCGCCG GGAAGTGATC ATCGAGGTCA TGTGCCACTC CCGCCTGAAC TTCGCCGACA TCGAGGCGCG GCACGGCATC GACTTCAATG AGTACTTCGC TGACGCCCTG GAGCGCCTCC AGCCCCTGGT GGAGGACGGT CTGATGGAGA TGGACGACCG CCACCTGCAG ATCCTGCCGC GGGGGCGCCT GATGCTCCGC CACGTCGCCA TGGCCTTCGA TGCCTACCTG GAGCGCGAAG CCAATGAAGG CAAGCGCTAT AGCCGGGTGT TGTAA
|
Protein sequence | MEQRLQFDRD LLRRYDVSGP RYTSYPTAPQ FHEGFDAQAY AAAARATNEA DQPNPLSVYV HVPFCRNVCF YCACNKIVTA NYSRAQEYLE HVFKEIELQA QLFGEHRRVE QLHLGGGTPN YLKIDDLGRL VSKLREQFTL DDTDNREFSI EIDPRDVELE DIGRLAELGF NRMSVGVQDF NEEVQHAVNR VQSAELCRSI IEEGRRHGFR STNVDLIYGL PKQTVESFEQ TLDEVIELRP ERLAIYNYAH LPHLFKIQRQ INEDELPGPE DKLTIFGRTI EKLTDAGYVF IGMDHFALPD DELAVAQKKG TLHRNFQGYS TRAECDLIAL GSTSIGKIGN TYSQNLRDPE EYQQRIANGD LAVFRGYELN QDDLLRREVI IEVMCHSRLN FADIEARHGI DFNEYFADAL ERLQPLVEDG LMEMDDRHLQ ILPRGRLMLR HVAMAFDAYL EREANEGKRY SRVL
|
| |