Gene Hhal_0971 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHhal_0971 
Symbol 
ID4709490 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorhodospira halophila SL1 
KingdomBacteria 
Replicon accessionNC_008789 
Strand
Start bp1043301 
End bp1044527 
Gene Length1227 bp 
Protein Length408 aa 
Translation table11 
GC content73% 
IMG OID639855440 
Productputative oxygen-independent coproporphyrinogen III oxidase 
Protein accessionYP_001002549 
Protein GI121997762 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0635] Coproporphyrinogen III oxidase and related Fe-S oxidoreductases 
TIGRFAM ID[TIGR00539] putative oxygen-independent coproporphyrinogen III oxidase 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCCTCG GCGCCGGGGC GCAGCTGCCG CCGCTGGCGG TCTATATCCA TCTGCCCTGG 
TGTCTGCAGC GCTGCGGCTA CTGCGACTTC AACGCCCACG CCCTGCGCGG AGAGCTCCCC
GCCCGGGCCT ACGTGGACGC CCTGGGGCGC GAGCTGCGCC GCCAGGCCCC GGCGGCGCTC
GGGCGCCGCG TGACTAGCGT CTTCATCGGG GGCGGCACAC CGAGTCTCTT CCCGCCCGGT
CCGATCGGCG AGCTGCTCGA CACCCTGGAC GCGGTGCTGG GCCTGGAGTC GGGGGCAGAG
ATCACCCTGG AGGCCAACCC AGGCGCCAGC GAGAACGCGC GGCTGCGGGG GTTCCGGCAG
GCCGGGGTGA CCCGTCTGTC CCTGGGGGTG CAGAGCCTGG ACGACACGCT GCTGGCCCGC
CTGGGGCGGA TCCACGACGC CGTCGCCGCG CGCGAGGCCG TGGCTGAGGC GTATGCCGTC
GGGTTCCGCG GTATCAACGC CGACGTCATG TACGGCCTGC CCGGGCAGGG GGTCACCGGG
GCCGAGGCCG ACGTGGCCGG AGTCATCGCG CTGGGGGCCG ACCACGTCAG CCATTATCAG
CTGACCATCG AGCCGGAGAC GCCCTTTGGG CGCCGCCCAC CTCCGGGTCT GCCCGACGAG
GAGACGGTGC TGGAGATGGA GGCGGTGTGC CGGCAGCGGC TGGCGCAGGC CGGTCTGGAG
CGTTACGAGG TCTCGGCCTT CGCCCGCCCC GGGCAGCGCA GCGTCCACAA TCTCGGTTAC
TGGACCTTCG GCGACTACCT GGGCCTGGGC GCCGGGGCGG CCGGCAAGCG GACCGGCGCC
GACGGCCGGG TGCTGCGCAC CCGCCAGCGC CGCTCGCCGC GGGCCTGGAT GGCCGCCGTC
GGCAGCGCCC GGGTCGAGGC TGAGTGTATT GAGCTGACAC CGTCGGAGCA GGCCTTTGAG
GTCCTGCTCA ACGGGCTGCG CCTGCGTGAG GGGCTTCCGG AACGACTGGC GGTGGCGCGC
AGTGGCTGCA GCCTGCCGGC ATTGCGCGAC TGGCTGGCGC CCCTGTGTGC CGGCGGCTGG
CTGGAGTGGC GTGGCGGGCG GATTCGGGCC AGCGCCGCCG GCTACGAGAT GCTCGATACC
CTGCTGCTCG AGCTGCTCCC CGGCCCGCCG GATCCGGGTT CCGGGGGCAG TGCACCTTCC
TCGCACGGCG GCAACGCGCT AAAATAA
 
Protein sequence
MSLGAGAQLP PLAVYIHLPW CLQRCGYCDF NAHALRGELP ARAYVDALGR ELRRQAPAAL 
GRRVTSVFIG GGTPSLFPPG PIGELLDTLD AVLGLESGAE ITLEANPGAS ENARLRGFRQ
AGVTRLSLGV QSLDDTLLAR LGRIHDAVAA REAVAEAYAV GFRGINADVM YGLPGQGVTG
AEADVAGVIA LGADHVSHYQ LTIEPETPFG RRPPPGLPDE ETVLEMEAVC RQRLAQAGLE
RYEVSAFARP GQRSVHNLGY WTFGDYLGLG AGAAGKRTGA DGRVLRTRQR RSPRAWMAAV
GSARVEAECI ELTPSEQAFE VLLNGLRLRE GLPERLAVAR SGCSLPALRD WLAPLCAGGW
LEWRGGRIRA SAAGYEMLDT LLLELLPGPP DPGSGGSAPS SHGGNALK