Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SeHA_C1209 |
Symbol | hpaB |
ID | 6492290 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salmonella enterica subsp. enterica serovar Heidelberg str. SL476 |
Kingdom | Bacteria |
Replicon accession | NC_011083 |
Strand | - |
Start bp | 1190452 |
End bp | 1192014 |
Gene Length | 1563 bp |
Protein Length | 520 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 642741448 |
Product | 4-hydroxyphenylacetate 3-monooxygenase, oxygenase component |
Protein accession | YP_002045099 |
Protein GI | 194450127 |
COG category | [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG2368] Aromatic ring hydroxylase |
TIGRFAM ID | [TIGR02310] 4-hydroxyphenylacetate 3-monooxygenase, oxygenase component |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.75215 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 76 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAACCTG AAGATTTTCG TACTGATAAC AAGCGTCCGT TAACGGGCGA AGAGTATTTA AAAAGCCTGC AGGACGGGCG GGAAATTTAT ATTTACGGCG AACGCGTTAA AGATGTTACG ACACATCCGG CATTCCGCAA TGCCGCAGCC TCTGTCGCAC AACTGTATGA CGCATTACAT AAACCGTCGA TGCAAGATAC CCTGTGCTGG AATACCGATA CCGGCAGCGG CGGTTATACG CATAAATTTT TCCGCGTGGC GAAAAGCGCA GACGATCTGC GCCAACAGCG TGATGCTATC GCCGAGTGGT CACGCCTGAG TTACGGCTGG ATGGGACGCA CACCGGATTA CAAAGCCGCC TTTGGCTGCG CTCTGGGCGC TAACCCAGCC TTCTACGGCC AGTTTGAGCA GAACGCCCGC AACTGGTACA CCCGCATTCA GGAGACTGGC CTGTACTTTA ACCATGCTAT CGTCAACCCG CCCATTGACC GCCACAAACC TGCCGACGAA GTGAAAGACG TCTATATCAA GCTGGAGAAA GAGACGGACG CCGGGATTAT TGTCAGCGGG GCGAAAGTTG TCGCCACTAA CTCCGCCCTG ACTCACTACA ACATGATTGG TTTCGGCTCA GCCCAGGTGA TGGGCGAAAA CCCGGATTTC GCTCTGATGT TTGTCGCGCC AATGGATGCC GAAGGCGTAA AACTTATTTC GCGCGCCTCG TATGAAATGG TCGCGGGCGC GACGGGCTCG CCGTTTGATT ATCCACTCTC CAGCCGCTTT GATGAAAACG ATGCCATTCT GGTGATGGAC AAGGTGTTGA TCCCGTGGGA AAACGTGTTG ATTTACCGTG ATTTCGATCG TTGCCGTCGC TGGACGATGG AAGGCGGCTT TGCCCGTATG TATCCACTGC AAGCCTGTGT TCGTCTGGCG GTTAAACTTG ATTTCATTAC CGCGCTGCTG AAAAAATCGC TCGAATGTAC GGGTACCGTA GAGTTCCGGG GCGTGCAGGC CGATCTCGGC GAAGTCGTGG CCTGGCGCAA TATGTTCTGG GCATTGAGCG ATTCTATGTG TTCCGAAGCA ACCCCGTGGG TAAACGGCGC CTGGCTACCG GACCACGCCG CGCTGCAAAC CTATCGTGTG ATGGCCCCAA TGGCCTACGC GAAAATTAAA AATATTATTG AACGTAACGT TACCAGCGGC CTGATTTACC TGCCTTCCAG CGCCCGCGAT CTGAATAATC CGCAAATCGA CCAGTACCTG GCGAAATACG TACGCGGCTC TAACGGAATG GACCATGTTG AACGTATCAA AATTCTTAAA TTGATGTGGG ATGCCATCGG CAGCGAGTTT GGCGGTCGCC ATGAGCTGTA CGAGATTAAC TACTCGGGCA GCCAGGATGA AATTCGTCTG CAGTGCCTGC GTCAGGCCCA GAGCTCCGGC AATATGGATA AGATGATGGC AATGGTCGAT CGCTGCCTCT CCGAATACGA TCAGAATGGC TGGACGGTTT CGCATTTGCA CAATAACGAC GACATCAATC AACTGGATAA GCTGCTGAAA TAA
|
Protein sequence | MKPEDFRTDN KRPLTGEEYL KSLQDGREIY IYGERVKDVT THPAFRNAAA SVAQLYDALH KPSMQDTLCW NTDTGSGGYT HKFFRVAKSA DDLRQQRDAI AEWSRLSYGW MGRTPDYKAA FGCALGANPA FYGQFEQNAR NWYTRIQETG LYFNHAIVNP PIDRHKPADE VKDVYIKLEK ETDAGIIVSG AKVVATNSAL THYNMIGFGS AQVMGENPDF ALMFVAPMDA EGVKLISRAS YEMVAGATGS PFDYPLSSRF DENDAILVMD KVLIPWENVL IYRDFDRCRR WTMEGGFARM YPLQACVRLA VKLDFITALL KKSLECTGTV EFRGVQADLG EVVAWRNMFW ALSDSMCSEA TPWVNGAWLP DHAALQTYRV MAPMAYAKIK NIIERNVTSG LIYLPSSARD LNNPQIDQYL AKYVRGSNGM DHVERIKILK LMWDAIGSEF GGRHELYEIN YSGSQDEIRL QCLRQAQSSG NMDKMMAMVD RCLSEYDQNG WTVSHLHNND DINQLDKLLK
|
| |