Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SeSA_A1163 |
Symbol | hpaB |
ID | 6517128 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salmonella enterica subsp. enterica serovar Schwarzengrund str. CVM19633 |
Kingdom | Bacteria |
Replicon accession | NC_011094 |
Strand | - |
Start bp | 1144397 |
End bp | 1145959 |
Gene Length | 1563 bp |
Protein Length | 520 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 642746288 |
Product | 4-hydroxyphenylacetate 3-monooxygenase, oxygenase component |
Protein accession | YP_002114097 |
Protein GI | 194734102 |
COG category | [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG2368] Aromatic ring hydroxylase |
TIGRFAM ID | [TIGR02310] 4-hydroxyphenylacetate 3-monooxygenase, oxygenase component |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.152392 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 20 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAACCTG AAGATTTTCG TACTGATAAC AAGCGTCCGT TAACGGGCGA AGAGTATTTA AAAAGCCTGC AGGACGGGCG GGAAATTTAT ATTTACGGCG AACGCGTTAA AGATGTTACG ACACATCCGG CATTCCGCAA TGCCGCAGCC TCTGTCGCAC AACTGTATGA CGCATTACAT AAACCGTCGA TGCAAGATAC CCTGTGCTGG AATACCGATA CCGGCAGCGG CGGTTATACG CATAAATTTT TCCGCGTGGC GAAAAGCGCA GACGATCTGC GCCAACAGCG TGATGCTATC GCCGAGTGGT CACGCCTGAG TTACGGCTGG ATGGGACGCA CACCGGATTA CAAAGCCGCC TTTGGCTGCG CTCTGGGCGC TAACCCAGCC TTCTACGGCC AGTTTGAGCA GAACGCCCGC AACTGGTACA CCCGTATTCA GGAGACCGGC CTGTACTTTA ACCATGCTAT CGTCAACCCG CCTATCGACC GCCACAAACC TGCCGACGAA GTGAAAGACG TCTATATCAA GCTGGAGAAA GAGACGGACG CCGGGATTAT TGTCAGCGGG GCGAAAGTCG TCGCCACTAA CTCCGCCCTG ACTCACTACA ACATGATTGG TTTCGGCTCA GCCCAGGTGA TGGGCGAAAA CCCGGATTTT GCGCTGATGT TTGTCGCGCC AATGGATGCC GAAGGCGTAA AACTTATTTC GCGCGCCTCG TATGAAATGG TCGCGGGCGC GACGGGCTCG CCGTTTGATT ATCCACTCTC CAGCCGCTTT GATGAAAACG ATGCCATTCT GGTGATGGAT AAGGTGCTGA TCCCGTGGGA AAACGTGTTG ATTTACCGTG ATTTCGATCG TTGCCGTCGC TGGACGATGG AAGGCGGCTT TGCCCGTATG TATCCACTGC AAGCCTGTGT TCGTCTGGCG GTTAAACTTG ATTTCATTAC CGCGCTGCTG AAAAAATCGC TCGAATGTAC GGGTACCGTA GAGTTCCGGG GCGTGCAGGC CGATCTCGGC GAAGTCGTGG CCTGGCGCAA TATGTTCTGG GCATTGAGCG ATTCTATGTG TTCTGAAGCA ACCCCGTGGG TAAACGGCGC CTGGCTACCG GACCACGCTG CGCTGCAAAC CTATCGTGTG ATGGCCCCAA TGGCCTACGC GAAAATTAAA AATATTATTG AACGTAACGT TACCAGCGGC CTGATTTACC TGCCTTCCAG CGCCCGCGAT CTGAATAATC CGCAAATCGA CCAGTACCTG GCGAAATACG TACGTGGCTC TAACGGAATG GACCATGTTG AACGTATCAA AATTCTTAAA TTGATGTGGG ATGCCATCGG CAGCGAGTTT GGCGGTCGCC ATGAGCTGTA CGAGATTAAC TACTCGGGCA GCCAGGATGA AATTCGTCTG CAGTGCCTGC GTCAGGCCCA GAGCTCCGGC AATATGGATA AGATGATGGC AATGGTCGAT CGCTGCCTCT CCGAATACGA TCAGAATGGC TGGACGGTTT CGCATTTGCA CAATAACGAC GACATCAATC AACTGGATAA GCTGCTGAAA TAA
|
Protein sequence | MKPEDFRTDN KRPLTGEEYL KSLQDGREIY IYGERVKDVT THPAFRNAAA SVAQLYDALH KPSMQDTLCW NTDTGSGGYT HKFFRVAKSA DDLRQQRDAI AEWSRLSYGW MGRTPDYKAA FGCALGANPA FYGQFEQNAR NWYTRIQETG LYFNHAIVNP PIDRHKPADE VKDVYIKLEK ETDAGIIVSG AKVVATNSAL THYNMIGFGS AQVMGENPDF ALMFVAPMDA EGVKLISRAS YEMVAGATGS PFDYPLSSRF DENDAILVMD KVLIPWENVL IYRDFDRCRR WTMEGGFARM YPLQACVRLA VKLDFITALL KKSLECTGTV EFRGVQADLG EVVAWRNMFW ALSDSMCSEA TPWVNGAWLP DHAALQTYRV MAPMAYAKIK NIIERNVTSG LIYLPSSARD LNNPQIDQYL AKYVRGSNGM DHVERIKILK LMWDAIGSEF GGRHELYEIN YSGSQDEIRL QCLRQAQSSG NMDKMMAMVD RCLSEYDQNG WTVSHLHNND DINQLDKLLK
|
| |