Gene Haur_0032 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_0032 
Symbol 
ID5731904 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp40219 
End bp41676 
Gene Length1458 bp 
Protein Length485 aa 
Translation table11 
GC content51% 
IMG OID641277153 
Productcatalase 
Protein accessionYP_001542812 
Protein GI159896565 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0753] Catalase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00175553 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGATGAGC AAGAACAACT GACGACCGCC CATGGTGCGC CAATTGCCGA CAATCAAAAC 
TCGTTGACGG CTGGCCCCCG TGGCCCACTG TTGATGCAAG ATTATCAACT GTTGGAAAAA
ATGGCGACCT TCAATCGCGA ACGGGTGCCA GAACGGGTCG TCCATGCCAA AGGCTCAGGG
GCATTTGGGA CGTTTACCGT TACCAATGAT GTAACGGCCT ATACCAAGGC TTCTATCTTT
GGAGCAGTTG GCAAGCAAAC TCCAATGTTA TTGCGCTTCT CAACCGTTGC TGGCGAGCAC
GGCGCTGCCG ATGCCGAACG CGATGTGCGC GGTTTTGCCG TCAAATTCTA TACCGATGAA
GGCAACTGGG ATTTGGTCGG GAATAATACG CCAGTCTTTT TTGTGCGCGA TCCCTACAAA
TTCAGCGATT TTATTCATAC CCAAAAGCGT GACCCCAAAA CCAACATGCG CTCGCCCCAA
GCCATGTGGG ATTTCTGGTC GCTCTCACCC GAAAGCTTAC ACCAAGTAAC GATCTTGTTC
AGCGATCGCG GTTTGCCAAT TAGCTATCGG TTTGTGCATG GCTTTGGCAG CCACACCTAT
AGCTTTATCA ACGCTCAAGG CGAACGCTTC TGGATCAAGT TCCACTTCCG CTGCCAACAA
GGCATCAAGA ACTGGACGAA TGCCGAAGCC GCCGAAGTGG TTGGGGTTGA TCGCGAAAGC
TCACAACGCG ATTTGTTCGA TGCAATCGAA CGCGGCGAAT ATCCAAGCTG GAAGCTTTGC
GTCCAAGTGA TGCCCGAAGC TGATGCTGAA ACCTACCACC TCAACCCATT CGATTTGACC
AAAGTATGGC CACACGGCGA TTATCCGTTG ATCGAAGTTG GCACGATGGA GTTGAATCGT
AACCCCGAAA ACTATTTTGC TGAAATCGAG CAAGCTGCAT TTGAACCATC AAACATTGTG
CCAGGCATTG GCTTCTCGCC TGATAAGATG TTGCAAGCGC GGATTATGTC GTATGCTGAT
GCCCACCGCT ATCGCATTGG CGTAAATTAT GCTGCACTAC CAGTCAACAA ACCGCATTCA
CCAGTCAACA CCTATCATCG CGATGGCCAA ATGCGTTTCG ATGGTAATGG TGGTGGCTCG
GTTAACTACG AGCCAAACAG CTTTGGCGGC CCGGTGCAAA ACGAACGCTA CGCTGAACCA
GCCCTCAAAA TCAGCGGCGA TGCCGATCGC TACAATCATC GCGATGGCAA CGACGATTAC
ACCCAACCAG GCAATTTGTT CCGCTTGATG AATGCTGATC AACAACAACA GTTATTCAAC
AACATTGCAG CGGCGATGCA AGGCGTGCCT GAATTTATCC AATTGCGCCA AATCGGCCAC
TTCTTGAAAG CAGATCCTGC TTATGGTCGC GGAGTTGCCG CCGCCCTGGG CCTCGATATC
AGCAGCCTCG AAGCCTAG
 
Protein sequence
MDEQEQLTTA HGAPIADNQN SLTAGPRGPL LMQDYQLLEK MATFNRERVP ERVVHAKGSG 
AFGTFTVTND VTAYTKASIF GAVGKQTPML LRFSTVAGEH GAADAERDVR GFAVKFYTDE
GNWDLVGNNT PVFFVRDPYK FSDFIHTQKR DPKTNMRSPQ AMWDFWSLSP ESLHQVTILF
SDRGLPISYR FVHGFGSHTY SFINAQGERF WIKFHFRCQQ GIKNWTNAEA AEVVGVDRES
SQRDLFDAIE RGEYPSWKLC VQVMPEADAE TYHLNPFDLT KVWPHGDYPL IEVGTMELNR
NPENYFAEIE QAAFEPSNIV PGIGFSPDKM LQARIMSYAD AHRYRIGVNY AALPVNKPHS
PVNTYHRDGQ MRFDGNGGGS VNYEPNSFGG PVQNERYAEP ALKISGDADR YNHRDGNDDY
TQPGNLFRLM NADQQQQLFN NIAAAMQGVP EFIQLRQIGH FLKADPAYGR GVAAALGLDI
SSLEA