Gene Hhal_0019 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHhal_0019 
Symbol 
ID4710197 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorhodospira halophila SL1 
KingdomBacteria 
Replicon accessionNC_008789 
Strand
Start bp19722 
End bp21116 
Gene Length1395 bp 
Protein Length464 aa 
Translation table11 
GC content62% 
IMG OID639854475 
Productoxygen-independent coproporphyrinogen III oxidase 
Protein accessionYP_001001616 
Protein GI121996829 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0635] Coproporphyrinogen III oxidase and related Fe-S oxidoreductases 
TIGRFAM ID[TIGR00538] oxygen-independent coproporphyrinogen III oxidase 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGAACAAC GTCTTCAATT CGATCGCGAC CTGCTCCGCC GGTACGATGT AAGCGGGCCC 
CGTTACACCT CGTACCCGAC GGCGCCGCAA TTCCATGAAG GGTTTGATGC CCAGGCCTAC
GCCGCGGCAG CCCGGGCGAC CAACGAGGCG GACCAGCCCA ATCCCCTGTC GGTCTACGTC
CACGTACCGT TCTGCCGTAA CGTGTGCTTC TACTGCGCGT GCAACAAGAT CGTGACGGCG
AACTACAGCC GCGCCCAGGA GTATCTTGAG CACGTCTTCA AGGAGATCGA GCTGCAGGCG
CAGCTCTTCG GCGAACACCG TCGCGTTGAG CAGCTCCACC TCGGGGGCGG CACGCCGAAT
TACCTCAAGA TCGACGATCT GGGGCGCCTG GTCAGCAAGC TGCGCGAGCA GTTCACCCTC
GACGATACGG ACAACCGCGA GTTCTCTATC GAGATTGACC CGCGAGACGT CGAGCTTGAG
GACATCGGTC GCCTGGCGGA ACTCGGTTTC AATCGTATGA GCGTCGGCGT GCAGGACTTC
AACGAGGAGG TCCAGCACGC GGTCAACCGT GTCCAGAGCG CGGAACTGTG CCGCTCGATC
ATTGAGGAGG GGCGCCGCCA CGGCTTCCGC TCGACCAACG TCGACCTCAT CTACGGTCTG
CCGAAGCAGA CGGTGGAATC CTTCGAGCAG ACCCTCGACG AGGTCATCGA GCTGCGCCCC
GAGCGCCTGG CCATCTATAA CTACGCTCAC CTCCCGCATC TATTCAAGAT CCAGCGGCAG
ATCAACGAGG ATGAGCTCCC AGGCCCTGAG GACAAGCTCA CCATCTTCGG GCGTACCATC
GAGAAGCTCA CCGATGCCGG CTACGTCTTC ATCGGTATGG ACCACTTCGC CCTGCCCGAT
GACGAGCTGG CCGTTGCGCA GAAGAAGGGA ACCCTGCACC GGAACTTCCA GGGCTACTCC
ACGCGCGCGG AGTGCGACCT CATCGCACTG GGATCCACCT CCATCGGCAA GATCGGCAAC
ACCTACAGTC AGAACCTGCG GGATCCCGAG GAATACCAGC AGCGCATTGC CAACGGCGAT
CTGGCGGTGT TCCGTGGTTA CGAGCTCAAC CAGGACGATC TGCTGCGCCG GGAAGTGATC
ATCGAGGTCA TGTGCCACTC CCGCCTGAAC TTCGCCGACA TCGAGGCGCG GCACGGCATC
GACTTCAATG AGTACTTCGC TGACGCCCTG GAGCGCCTCC AGCCCCTGGT GGAGGACGGT
CTGATGGAGA TGGACGACCG CCACCTGCAG ATCCTGCCGC GGGGGCGCCT GATGCTCCGC
CACGTCGCCA TGGCCTTCGA TGCCTACCTG GAGCGCGAAG CCAATGAAGG CAAGCGCTAT
AGCCGGGTGT TGTAA
 
Protein sequence
MEQRLQFDRD LLRRYDVSGP RYTSYPTAPQ FHEGFDAQAY AAAARATNEA DQPNPLSVYV 
HVPFCRNVCF YCACNKIVTA NYSRAQEYLE HVFKEIELQA QLFGEHRRVE QLHLGGGTPN
YLKIDDLGRL VSKLREQFTL DDTDNREFSI EIDPRDVELE DIGRLAELGF NRMSVGVQDF
NEEVQHAVNR VQSAELCRSI IEEGRRHGFR STNVDLIYGL PKQTVESFEQ TLDEVIELRP
ERLAIYNYAH LPHLFKIQRQ INEDELPGPE DKLTIFGRTI EKLTDAGYVF IGMDHFALPD
DELAVAQKKG TLHRNFQGYS TRAECDLIAL GSTSIGKIGN TYSQNLRDPE EYQQRIANGD
LAVFRGYELN QDDLLRREVI IEVMCHSRLN FADIEARHGI DFNEYFADAL ERLQPLVEDG
LMEMDDRHLQ ILPRGRLMLR HVAMAFDAYL EREANEGKRY SRVL