Gene Hhal_1933 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHhal_1933 
Symbol 
ID4710778 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorhodospira halophila SL1 
KingdomBacteria 
Replicon accessionNC_008789 
Strand
Start bp2130459 
End bp2131946 
Gene Length1488 bp 
Protein Length495 aa 
Translation table11 
GC content73% 
IMG OID639856406 
Producturoporphyrin-III C-methyltransferase 
Protein accessionYP_001003499 
Protein GI121998712 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0007] Uroporphyrinogen-III methylase
[COG1648] Siroheme synthase (precorrin-2 oxidase/ferrochelatase domain) 
TIGRFAM ID[TIGR01469] uroporphyrin-III C-methyltransferase
[TIGR01470] siroheme synthase, N-terminal domain 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGATCACT ACCCGATCTT CCTCAACCTC CACGGCCGGC ACTGCGTGGT GATCGGCGGC 
AACGAGACCG CCGCGCGCAA GGGCGAAGAC CTGCTCGACA GTGGCGCGAT CATCACCCTG
ATCGCCCCGG ATCTCGGCGG AGACTGTGAG GATTTGCTGC AGCGCTACCC CGACCGGGCC
CACCACCGCG CCGAGGACTA CAAGCCCGGC ATGGAGCAGG GGGCCGCGCT GGTGCTCAGT
GCCAGCGGCC ACGACGCCAC CGACCGGCTG GTGTACCGGC AGTGCACCCG GCTGGGCATC
CCGGTTAACA CCGTGGACCG CCCCGAGTAC TGCAGCTACA TCACCCCAGC GGTGGTCGAC
CGCTCTCCCC TACAGGTGGC CATCACCAGC GGGGGCGCCG CCCCGGTGCT GGCCCGTCAG
GTGCGCAGCC AGATAGAGAC GCTGCTGCCC ACCGCCTACG GGCGGCTGGC CGCCCTCGCC
GGGCGTCTGC GCGAGCGCGT GGCCGCCGTC CTGCCCACCG GCCGACAACG GCTGCGCTTC
TGGGAGCAGG TCTTCGACGG CCCGGCCGCC GAGTCGATGC TGGCCGGACG GGAACGGGAG
GCCGAACAGG CCATGCTGGA GCTACTGCGC CGGGAGCAGG CGCGCCGCGA CGAGCGCGGC
GAGGTCTATC TGGTCGGTGC CGGCCCCGGC GACCCCGACC TGCTGACTTT CCGCGCGCTG
CGCCTGATGC AGCGGGCCGA TGTGGTGCTC TACGACCACC TGGCCGCACC CGGCCTGCTG
CGCCTGGTGC GCAAGGATGC CGAGCGGATC CCCGTCGGCA AGCGCCGCGG TCAGCACACC
CTCCCCCAGG AGGCGATCAA CGACAAGCTC ATCGAACTGG CCGCCGCCGG CAAACGGGTA
CTGCGGCTCA AGGGCGGCGA TCCGTTCATC TTCGGTCGCG GCGGCGAGGA GATCGAGGGG
CTGATCGAGC ACGGCATCCC CTTCCAGGTC GTGCCCGCGG TGACCGCTGC CCAGGGCGCG
GCGGCCTACG CCGGCATCCC CCTGACCCAC CGGGACCACG CCCAGAGCTG TCGCTTCCTG
ACCGGCCACC GCCGCCACGG CGCCCTGGAA CTGGGCCAGT GGGCGCCGTT CCGCAGCGAC
GAGACGCTGG TGGTCTACAT GGGGCTGACC CACCTGGAGA CGGTGAGTGC CCAGCTGCAG
GCGGGGGGGC TACCGCCGGA CCAGCCGGCC GCCGCCGTCG ATCAGGCCAC CACCCCGGCC
CAGCGGGTGA TCACCGCCCC CCTGGCCGAG CTGCCGGAGC GGGTCCGCAC GGCCCGCCTC
CAGGGCCCGG CGCTGATCGT GGTCGGCGCC ACGGTCACCC TCCAGCCACA GCTGGGCTGG
TACCACAGCT CCCCCAACGC CGAACCCGCC TTCCCGGAGC ACGGCTGCCT GCGCGGCGAG
CCGCGGCCGA CCCGCCACCC GGCACCGGCA GACACCGAGC AGGCCTGA
 
Protein sequence
MDHYPIFLNL HGRHCVVIGG NETAARKGED LLDSGAIITL IAPDLGGDCE DLLQRYPDRA 
HHRAEDYKPG MEQGAALVLS ASGHDATDRL VYRQCTRLGI PVNTVDRPEY CSYITPAVVD
RSPLQVAITS GGAAPVLARQ VRSQIETLLP TAYGRLAALA GRLRERVAAV LPTGRQRLRF
WEQVFDGPAA ESMLAGRERE AEQAMLELLR REQARRDERG EVYLVGAGPG DPDLLTFRAL
RLMQRADVVL YDHLAAPGLL RLVRKDAERI PVGKRRGQHT LPQEAINDKL IELAAAGKRV
LRLKGGDPFI FGRGGEEIEG LIEHGIPFQV VPAVTAAQGA AAYAGIPLTH RDHAQSCRFL
TGHRRHGALE LGQWAPFRSD ETLVVYMGLT HLETVSAQLQ AGGLPPDQPA AAVDQATTPA
QRVITAPLAE LPERVRTARL QGPALIVVGA TVTLQPQLGW YHSSPNAEPA FPEHGCLRGE
PRPTRHPAPA DTEQA