Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hhal_0971 |
Symbol | |
ID | 4709490 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halorhodospira halophila SL1 |
Kingdom | Bacteria |
Replicon accession | NC_008789 |
Strand | - |
Start bp | 1043301 |
End bp | 1044527 |
Gene Length | 1227 bp |
Protein Length | 408 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | 639855440 |
Product | putative oxygen-independent coproporphyrinogen III oxidase |
Protein accession | YP_001002549 |
Protein GI | 121997762 |
COG category | [H] Coenzyme transport and metabolism |
COG ID | [COG0635] Coproporphyrinogen III oxidase and related Fe-S oxidoreductases |
TIGRFAM ID | [TIGR00539] putative oxygen-independent coproporphyrinogen III oxidase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGCCTCG GCGCCGGGGC GCAGCTGCCG CCGCTGGCGG TCTATATCCA TCTGCCCTGG TGTCTGCAGC GCTGCGGCTA CTGCGACTTC AACGCCCACG CCCTGCGCGG AGAGCTCCCC GCCCGGGCCT ACGTGGACGC CCTGGGGCGC GAGCTGCGCC GCCAGGCCCC GGCGGCGCTC GGGCGCCGCG TGACTAGCGT CTTCATCGGG GGCGGCACAC CGAGTCTCTT CCCGCCCGGT CCGATCGGCG AGCTGCTCGA CACCCTGGAC GCGGTGCTGG GCCTGGAGTC GGGGGCAGAG ATCACCCTGG AGGCCAACCC AGGCGCCAGC GAGAACGCGC GGCTGCGGGG GTTCCGGCAG GCCGGGGTGA CCCGTCTGTC CCTGGGGGTG CAGAGCCTGG ACGACACGCT GCTGGCCCGC CTGGGGCGGA TCCACGACGC CGTCGCCGCG CGCGAGGCCG TGGCTGAGGC GTATGCCGTC GGGTTCCGCG GTATCAACGC CGACGTCATG TACGGCCTGC CCGGGCAGGG GGTCACCGGG GCCGAGGCCG ACGTGGCCGG AGTCATCGCG CTGGGGGCCG ACCACGTCAG CCATTATCAG CTGACCATCG AGCCGGAGAC GCCCTTTGGG CGCCGCCCAC CTCCGGGTCT GCCCGACGAG GAGACGGTGC TGGAGATGGA GGCGGTGTGC CGGCAGCGGC TGGCGCAGGC CGGTCTGGAG CGTTACGAGG TCTCGGCCTT CGCCCGCCCC GGGCAGCGCA GCGTCCACAA TCTCGGTTAC TGGACCTTCG GCGACTACCT GGGCCTGGGC GCCGGGGCGG CCGGCAAGCG GACCGGCGCC GACGGCCGGG TGCTGCGCAC CCGCCAGCGC CGCTCGCCGC GGGCCTGGAT GGCCGCCGTC GGCAGCGCCC GGGTCGAGGC TGAGTGTATT GAGCTGACAC CGTCGGAGCA GGCCTTTGAG GTCCTGCTCA ACGGGCTGCG CCTGCGTGAG GGGCTTCCGG AACGACTGGC GGTGGCGCGC AGTGGCTGCA GCCTGCCGGC ATTGCGCGAC TGGCTGGCGC CCCTGTGTGC CGGCGGCTGG CTGGAGTGGC GTGGCGGGCG GATTCGGGCC AGCGCCGCCG GCTACGAGAT GCTCGATACC CTGCTGCTCG AGCTGCTCCC CGGCCCGCCG GATCCGGGTT CCGGGGGCAG TGCACCTTCC TCGCACGGCG GCAACGCGCT AAAATAA
|
Protein sequence | MSLGAGAQLP PLAVYIHLPW CLQRCGYCDF NAHALRGELP ARAYVDALGR ELRRQAPAAL GRRVTSVFIG GGTPSLFPPG PIGELLDTLD AVLGLESGAE ITLEANPGAS ENARLRGFRQ AGVTRLSLGV QSLDDTLLAR LGRIHDAVAA REAVAEAYAV GFRGINADVM YGLPGQGVTG AEADVAGVIA LGADHVSHYQ LTIEPETPFG RRPPPGLPDE ETVLEMEAVC RQRLAQAGLE RYEVSAFARP GQRSVHNLGY WTFGDYLGLG AGAAGKRTGA DGRVLRTRQR RSPRAWMAAV GSARVEAECI ELTPSEQAFE VLLNGLRLRE GLPERLAVAR SGCSLPALRD WLAPLCAGGW LEWRGGRIRA SAAGYEMLDT LLLELLPGPP DPGSGGSAPS SHGGNALK
|
| |