Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | B21_04188 |
Symbol | hpaB |
ID | 8115084 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli BL21 |
Kingdom | Bacteria |
Replicon accession | NC_012892 |
Strand | - |
Start bp | 4497407 |
End bp | 4498969 |
Gene Length | 1563 bp |
Protein Length | 520 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 644850328 |
Product | hypothetical protein |
Protein accession | YP_003001901 |
Protein GI | 251787597 |
COG category | [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG2368] Aromatic ring hydroxylase |
TIGRFAM ID | [TIGR02310] 4-hydroxyphenylacetate 3-monooxygenase, oxygenase component |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAACCAG AAGATTTCCG CGCCAGTACC CAACGTCCTT TCACCGGGGA AGAGTATCTG AAAAGCCTGC AGGATGGTCG CGAGATCTAT ATCTATGGCG AGCGAGTGAA AGACGTCACC ACTCATCCGG CATTTCGTAA TGCGGCAGCG TCTGTTGCCC AGCTGTACGA CGCACTGCAC AAACCGGAGA TGCAGGACTC TCTGTGTTGG AACACCGACA CCGGCAGCGG CGGCTATACC CATAAATTCT TCCGCGTGGC GAAAAGTGCC GACGACCTGC GCCAGCAACG CGACGCCATC GCTGAGTGGT CACGCCTGAG CTATGGCTGG ATGGGCCGTA CCCCAGACTA CAAAGCCGCT TTCGGTTGCG CACTGGGCGC GAATCCGGGC TTTTACGGTC AGTTCGAGCA GAACGCCCGT AACTGGTACA CCCGTATTCA GGAAACTGGC CTCTACTTTA ACCACGCGAT TGTTAACCCA CCGATCGATC GTCATTTGCC GACCGATAAA GTGAAAGACG TTTACATCAA GCTGGAAAAA GAGACTGACG CCGGGATTAT CGTCAGCGGT GCGAAAGTGG TTGCCACCAA CTCGGCGCTG ACTCACTACA ACATGATTGG CTTCGGCTCG GCACAAGTGA TGGGCGAAAA CCCGGACTTC GCACTGATGT TCGTTGCGCC AATGGATGCC GATGGCGTGA AATTAATCTC CCGCGCCTCT TATGAGATGG TCGCGGGTGC TACCGGCTCG CCATACGACT ACCCGCTCTC CAGCCGCTTC GATGAGAACG ATGCGATTCT GGTGATGGAT AACGTGCTGA TTCCATGGGA AAACGTGCTG ATCTACCGCG ATTTTGATCG CTGCCGTCGC TGGACGATGG AAGGCGGTTT TGCCCGTATG TATCCGCTGC AAGCCTGTGT GCGCCTGGCA GTGAAATTAG ACTTCATTAC GGCACTGCTG AAAAAATCAC TCGAATGTAC CGGCACCCTG GAGTTCCGTG GTGTGCAGGC CGATCTCGGT GAAGTGGTAG CGTGGCGCAA CACCTTCTGG GCATTGAGTG ACTCGATGTG TTCAGAAGCA ACGCCGTGGG TCAACGGGGC TTATTTACCG GATCATGCCG CACTGCAAAC CTATCGCGTA CTGGCACCAA TGGCCTACGC GAAGATCAAA AACATTATCG AACGCAACGT TACCAGTGGC CTGATCTATC TCCCTTCCAG TGCCCGTGAC CTGAATAATC CGCAGATCGA CCAGTATCTG GCGAAGTATG TGCGCGGTTC GAACGGTATG GATCACGTCC AGCGCATCAA GATCCTCAAA CTGATGTGGG ATGCTATTGG CAGCGAATTT GGTGGTCGTC ACGAACTGTA TGAAATCAAC TACTCCGGTA GCCAGGATGA GATTCGCCTG CAGTGTCTGC GCCAGGCACA AAACTCCGGC AATATGGACA AGATGATGGC GATGGTTGAT CGCTGCCTGT CGGAATACGA CCAGGACGGC TGGACTGTGC CGCACCTGCA CAACAACGAC GATATCAACA TGCTGGATAA GCTGCTGAAA TAA
|
Protein sequence | MKPEDFRAST QRPFTGEEYL KSLQDGREIY IYGERVKDVT THPAFRNAAA SVAQLYDALH KPEMQDSLCW NTDTGSGGYT HKFFRVAKSA DDLRQQRDAI AEWSRLSYGW MGRTPDYKAA FGCALGANPG FYGQFEQNAR NWYTRIQETG LYFNHAIVNP PIDRHLPTDK VKDVYIKLEK ETDAGIIVSG AKVVATNSAL THYNMIGFGS AQVMGENPDF ALMFVAPMDA DGVKLISRAS YEMVAGATGS PYDYPLSSRF DENDAILVMD NVLIPWENVL IYRDFDRCRR WTMEGGFARM YPLQACVRLA VKLDFITALL KKSLECTGTL EFRGVQADLG EVVAWRNTFW ALSDSMCSEA TPWVNGAYLP DHAALQTYRV LAPMAYAKIK NIIERNVTSG LIYLPSSARD LNNPQIDQYL AKYVRGSNGM DHVQRIKILK LMWDAIGSEF GGRHELYEIN YSGSQDEIRL QCLRQAQNSG NMDKMMAMVD RCLSEYDQDG WTVPHLHNND DINMLDKLLK
|
| |