Gene Hhal_0547 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHhal_0547 
Symbol 
ID4709731 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorhodospira halophila SL1 
KingdomBacteria 
Replicon accessionNC_008789 
Strand
Start bp615383 
End bp617557 
Gene Length2175 bp 
Protein Length724 aa 
Translation table11 
GC content65% 
IMG OID639855005 
Productcatalase/peroxidase HPI 
Protein accessionYP_001002135 
Protein GI121997348 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0376] Catalase (peroxidase I) 
TIGRFAM ID[TIGR00198] catalase/peroxidase HPI 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.474968 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCAGATC AACAGAAGGC GGGCGGTTGC CCGGTGATGC ACGGTGCGAT GACCCAGGTC 
GGCGAGTCCA ACCTGGACAT GTGGCCCAAC GCCCTGAACC TCGACATCCT CCATCAGCAT
GATCGCAAGC CCGATCCGAT GGGTGAGGGG TTCAACTACC GGGAAGAGGT CAAGAAGCTC
GATCTCGACG CGGTCAAGCA AGACCTCCAC CAGCTGATGA CCGACAGCCA GGCGTGGTGG
CCGGCAGACT GGGGGCACTA CGGCGGGCTG ATGATCCGCA TGGCCTGGCA CGCCGCCGGC
ACCTACCGGG TGGCGGATGG CCGTGGCGGT GGCGGTACCG GCAATCAGCG CTTCGCCCCG
ATCAACAGCT GGCCGGACAA CGTCAACCTC GACAAGGCGC GGCGGCTGCT CTGGCCGATC
AAGAAGAAGT ACGGCAACCG GCTCAGCTGG GCCGACCTGA TCATCCTCGC CGGCAACGTG
GCCTACGAGT CCATGGGGCT GAAGACCTTC GGCTTCTCCC TGGGGCGCGA GGACATCTGG
CACCCGGAAA AGGATATCTA CTGGGGCTCC GAGAAGGAGT GGCTGGCGCC GAGCGATAGC
GAAGACAGCC GCTACGGCGA GGACCGTGCC TCGTTGGAGA ATCCGCTGGC GGCGGTGATG
ATGGGGCTGA TCTACGTCAA CCCCGAAGGT GTGGACGGCA ATCCGGACCC GCTGCGGACC
GCCGAGGACG TGCGCATCAC CTTCGAGCGC ATGGCCATGA ACGACGAGGA GACCGTCGCC
CTCACCGCCG GCGGCCACAC GGTCGGCAAG TGCCACGGCA ATGGCGATGT CGAGAACCTC
GGGCCGGATC CGGAGTCGGC GGACGTCGAG GAGCAGGGGC TCGGCTGGAA CAACAAGGTG
ACCCGCGGAG TGGGCCGGGA CACCGTCAGC AGCGGCATCG AGGGGGCCTG GACCACCTAT
CCCACCCGCT GGGACAACGG CTACTTCCAC CTGCTGCTCA ACTACGAGTG GGAACTGACC
AAGAGCCCCG CCGGGGCCTG GCAGTGGGAG CCGGTGGATA TCAAGGAGGA AGACAAGCCG
GTCGATGTTG AAGACCCGTC CATCCGCCTC AATCCGATCA TGACCGATGC GGATATGGCG
ATGAAGATGG ATCCGGCCTA CCGCAAGATC TCCGAGCGGT TCTACAACGA TCCGGCCTAC
TTCGACGAGG TCTTCGCCCG AGCCTGGTTC AAGCTCACTC ACCGGGATCT GGGGCCACGG
ACCCGTTATA TCGGCCCGGA GGCCCCCCAG GAAGATCTGA TTTGGCAGGA TCCGGTGCCG
GCGGGTCGGA CCGATTACGA CGTAGAGGCG CTCAAGGCGA AGATCGCCGA TAGCGGGCTG
AGCATCGGCG AGATGGTCAG CACTGCCTGG GACAGTGCCC GCACCTTCCG CGGCTCGGAT
AACCGGGGCG GGGCCAATGG GGCCCGGATT CGCCTGGCGC CGCAGAAGGA CTGGGAGGGT
AATGAGCCCG AGCGGCTGTC GAAGGTCCTC GGCGTCCTCG AGGGCATCGC CGCCGATGCC
GGTGCCAGTC TGGCCGATAC CATCGTGCTC GCCGGCAACG TGGGGATCGA GCAGGCGGCC
CGGGCAGCCG GCCACGACAT CACCGTTCCC TTCGCCCCCG GCCGCGGTGA TGCCAGCCAG
GAGATGACCG ACGTGGACTC CTTCCAGTAC CTGGAGCCGC TCGCCGACGG CTACCGCAAC
TGGGTCAAGA AGGAGTATGC GGTGCAGCCC GAGGAGATGA TGCTCGATCG CACCCAGCTG
ATGGGGCTGA CCGCGCCGGA GATGACCGTC CTGGTCGGCG GCATGCGGGT GCTGGGGACC
AACCACGGTG GCACCAAGCA CGGTGTGCTC ACCGACCGCG AGGGCCAGCT GACCAACGAC
TTCTTCGTCA ATCTCACGGA CATGGCCTAC ACCTGGAAGC CGGTGGGCAG CAACCGTTAC
GAGATCCGCC AGCGCAGCAG CGATGCGGTG AAGTGGACGG CGACCCGTGT GGATCTCGTC
TTCGGGTCCA ACTCGATCCT GCGCTCGTAC GCCGAGGTCT ACGCCCAGGA CGACAACCGG
GAGAAGTTCG TCCACGACTT CGTGGCGGCC TGGACCAAGG TGATGAACGC CGACCGATTC
GACCTGGTGG CGTAA
 
Protein sequence
MADQQKAGGC PVMHGAMTQV GESNLDMWPN ALNLDILHQH DRKPDPMGEG FNYREEVKKL 
DLDAVKQDLH QLMTDSQAWW PADWGHYGGL MIRMAWHAAG TYRVADGRGG GGTGNQRFAP
INSWPDNVNL DKARRLLWPI KKKYGNRLSW ADLIILAGNV AYESMGLKTF GFSLGREDIW
HPEKDIYWGS EKEWLAPSDS EDSRYGEDRA SLENPLAAVM MGLIYVNPEG VDGNPDPLRT
AEDVRITFER MAMNDEETVA LTAGGHTVGK CHGNGDVENL GPDPESADVE EQGLGWNNKV
TRGVGRDTVS SGIEGAWTTY PTRWDNGYFH LLLNYEWELT KSPAGAWQWE PVDIKEEDKP
VDVEDPSIRL NPIMTDADMA MKMDPAYRKI SERFYNDPAY FDEVFARAWF KLTHRDLGPR
TRYIGPEAPQ EDLIWQDPVP AGRTDYDVEA LKAKIADSGL SIGEMVSTAW DSARTFRGSD
NRGGANGARI RLAPQKDWEG NEPERLSKVL GVLEGIAADA GASLADTIVL AGNVGIEQAA
RAAGHDITVP FAPGRGDASQ EMTDVDSFQY LEPLADGYRN WVKKEYAVQP EEMMLDRTQL
MGLTAPEMTV LVGGMRVLGT NHGGTKHGVL TDREGQLTND FFVNLTDMAY TWKPVGSNRY
EIRQRSSDAV KWTATRVDLV FGSNSILRSY AEVYAQDDNR EKFVHDFVAA WTKVMNADRF
DLVA