Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_0032 |
Symbol | |
ID | 5731904 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | - |
Start bp | 40219 |
End bp | 41676 |
Gene Length | 1458 bp |
Protein Length | 485 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 641277153 |
Product | catalase |
Protein accession | YP_001542812 |
Protein GI | 159896565 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG0753] Catalase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.00175553 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGATGAGC AAGAACAACT GACGACCGCC CATGGTGCGC CAATTGCCGA CAATCAAAAC TCGTTGACGG CTGGCCCCCG TGGCCCACTG TTGATGCAAG ATTATCAACT GTTGGAAAAA ATGGCGACCT TCAATCGCGA ACGGGTGCCA GAACGGGTCG TCCATGCCAA AGGCTCAGGG GCATTTGGGA CGTTTACCGT TACCAATGAT GTAACGGCCT ATACCAAGGC TTCTATCTTT GGAGCAGTTG GCAAGCAAAC TCCAATGTTA TTGCGCTTCT CAACCGTTGC TGGCGAGCAC GGCGCTGCCG ATGCCGAACG CGATGTGCGC GGTTTTGCCG TCAAATTCTA TACCGATGAA GGCAACTGGG ATTTGGTCGG GAATAATACG CCAGTCTTTT TTGTGCGCGA TCCCTACAAA TTCAGCGATT TTATTCATAC CCAAAAGCGT GACCCCAAAA CCAACATGCG CTCGCCCCAA GCCATGTGGG ATTTCTGGTC GCTCTCACCC GAAAGCTTAC ACCAAGTAAC GATCTTGTTC AGCGATCGCG GTTTGCCAAT TAGCTATCGG TTTGTGCATG GCTTTGGCAG CCACACCTAT AGCTTTATCA ACGCTCAAGG CGAACGCTTC TGGATCAAGT TCCACTTCCG CTGCCAACAA GGCATCAAGA ACTGGACGAA TGCCGAAGCC GCCGAAGTGG TTGGGGTTGA TCGCGAAAGC TCACAACGCG ATTTGTTCGA TGCAATCGAA CGCGGCGAAT ATCCAAGCTG GAAGCTTTGC GTCCAAGTGA TGCCCGAAGC TGATGCTGAA ACCTACCACC TCAACCCATT CGATTTGACC AAAGTATGGC CACACGGCGA TTATCCGTTG ATCGAAGTTG GCACGATGGA GTTGAATCGT AACCCCGAAA ACTATTTTGC TGAAATCGAG CAAGCTGCAT TTGAACCATC AAACATTGTG CCAGGCATTG GCTTCTCGCC TGATAAGATG TTGCAAGCGC GGATTATGTC GTATGCTGAT GCCCACCGCT ATCGCATTGG CGTAAATTAT GCTGCACTAC CAGTCAACAA ACCGCATTCA CCAGTCAACA CCTATCATCG CGATGGCCAA ATGCGTTTCG ATGGTAATGG TGGTGGCTCG GTTAACTACG AGCCAAACAG CTTTGGCGGC CCGGTGCAAA ACGAACGCTA CGCTGAACCA GCCCTCAAAA TCAGCGGCGA TGCCGATCGC TACAATCATC GCGATGGCAA CGACGATTAC ACCCAACCAG GCAATTTGTT CCGCTTGATG AATGCTGATC AACAACAACA GTTATTCAAC AACATTGCAG CGGCGATGCA AGGCGTGCCT GAATTTATCC AATTGCGCCA AATCGGCCAC TTCTTGAAAG CAGATCCTGC TTATGGTCGC GGAGTTGCCG CCGCCCTGGG CCTCGATATC AGCAGCCTCG AAGCCTAG
|
Protein sequence | MDEQEQLTTA HGAPIADNQN SLTAGPRGPL LMQDYQLLEK MATFNRERVP ERVVHAKGSG AFGTFTVTND VTAYTKASIF GAVGKQTPML LRFSTVAGEH GAADAERDVR GFAVKFYTDE GNWDLVGNNT PVFFVRDPYK FSDFIHTQKR DPKTNMRSPQ AMWDFWSLSP ESLHQVTILF SDRGLPISYR FVHGFGSHTY SFINAQGERF WIKFHFRCQQ GIKNWTNAEA AEVVGVDRES SQRDLFDAIE RGEYPSWKLC VQVMPEADAE TYHLNPFDLT KVWPHGDYPL IEVGTMELNR NPENYFAEIE QAAFEPSNIV PGIGFSPDKM LQARIMSYAD AHRYRIGVNY AALPVNKPHS PVNTYHRDGQ MRFDGNGGGS VNYEPNSFGG PVQNERYAEP ALKISGDADR YNHRDGNDDY TQPGNLFRLM NADQQQQLFN NIAAAMQGVP EFIQLRQIGH FLKADPAYGR GVAAALGLDI SSLEA
|
| |