Gene Hhal_1058 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHhal_1058 
Symbol 
ID4709824 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorhodospira halophila SL1 
KingdomBacteria 
Replicon accessionNC_008789 
Strand
Start bp1146729 
End bp1147823 
Gene Length1095 bp 
Protein Length364 aa 
Translation table11 
GC content69% 
IMG OID639855529 
Producturoporphyrinogen decarboxylase 
Protein accessionYP_001002636 
Protein GI121997849 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0407] Uroporphyrinogen-III decarboxylase 
TIGRFAM ID[TIGR01463] methyltransferase, MtaA/CmuA family
[TIGR01464] uroporphyrinogen decarboxylase 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGAGCAGCT CTGAGCTCAA GAACGATCGA ATCCTGCGCG CTTTCCAGCG CCAGCCCGTG 
GACCGGACCC CGGTCTGGAT GATGCGCCAG GCCGGCCGCT ATCTGGCCGA ATATCGAGAG
GTGCGGGCCC AGGCCGGCAG CTTCATGGGC CTGTGCCGCA GCCCCGAGCT GGCGGCCCGG
GTGACCATGC AGCCGCTGGA GCGCTACGAG CTCGACGCCG CCATTCTCTT CTCGGACATC
CTCACCATCC CCGAGGCCAT GGGGCTCGGC CTGAACTTCG TCACCGGCGA GGGGCCGGTC
TTCGAGCACC GGGTCAAGAC CGCCGCCGAC ATCGACCGGC TGCCCCAGCC CTCGGCCCAG
AAAGAGCTGC GCTACGTCAT GGATGCGGTG GCAGCCTGCC GCAAGGAGCT GAACGGACAG
GTGCCGCTGA TCGGCTTCAC CGGCAGCCCG TGGACCCTGG CCACTTACAT GATCGAAGGC
GGCTCGAGCA AGACCTTCGC CGCCAGCAAG AGTCTGCTCT TCAACGAGCC GGAGGCCGCG
CACCGGCTGA TGGCCAAGCT CGCCGACACC GTGGCCGACT ACCTCAACGG CCAGGTAGAG
GCTGGCGCGC AGGCGCTGAT GATCTTTGAC ACCTGGGGCG GGGCCCTGGA TCCGGTGCGT
TACCGGGAGT TCTCGCTGGC CTATATGCAG CGCATCCTCG AGCAACTCCC CCGCGAGCGC
GAGGGGCGTC GTATCCCGGT CACCCTGTTC ACCAAGGGCG GCGGCCAGTG GCTGGAGGAT
ATCGCCGACA CCGGCTGTGA CGGCGTCGGC CTCGACTGGA CGACCTCGCT GGCCGACGCC
CGGCGCCGGA TCGGCGGCCG GGTGGCCCTG CAGGGGAACC TCGATCCGTG CATGCTCCAC
GCCAACCCCG AGGTCATCCG CCGCGAGGTG GCCCGCTGCC TGGAAGAGTT CGGCCACGGT
CCGGGCCACG TGTTCAACCT TGGCCACGGC ATCCAGCCGG AGACGCCGCC GGAGAATGTC
GATGCCATGA TCCGGGCCCT CCACGAACTC TCGCCGGCCT ACCATGACGC AACGGCCACC
TCGGCCACGT CGTAG
 
Protein sequence
MSSSELKNDR ILRAFQRQPV DRTPVWMMRQ AGRYLAEYRE VRAQAGSFMG LCRSPELAAR 
VTMQPLERYE LDAAILFSDI LTIPEAMGLG LNFVTGEGPV FEHRVKTAAD IDRLPQPSAQ
KELRYVMDAV AACRKELNGQ VPLIGFTGSP WTLATYMIEG GSSKTFAASK SLLFNEPEAA
HRLMAKLADT VADYLNGQVE AGAQALMIFD TWGGALDPVR YREFSLAYMQ RILEQLPRER
EGRRIPVTLF TKGGGQWLED IADTGCDGVG LDWTTSLADA RRRIGGRVAL QGNLDPCMLH
ANPEVIRREV ARCLEEFGHG PGHVFNLGHG IQPETPPENV DAMIRALHEL SPAYHDATAT
SATS