Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Bpro_4213 |
Symbol | |
ID | 4013117 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Polaromonas sp. JS666 |
Kingdom | Bacteria |
Replicon accession | NC_007948 |
Strand | + |
Start bp | 4431761 |
End bp | 4432891 |
Gene Length | 1131 bp |
Protein Length | 376 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 637943860 |
Product | 4-hydroxyphenylpyruvate dioxygenase |
Protein accession | YP_551003 |
Protein GI | 91790051 |
COG category | [E] Amino acid transport and metabolism [R] General function prediction only |
COG ID | [COG3185] 4-hydroxyphenylpyruvate dioxygenase and related hemolysins |
TIGRFAM ID | [TIGR01263] 4-hydroxyphenylpyruvate dioxygenase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAACGCAC CCCTATCTTT CGAAACAGGC GCCGCCTGGG ACAACCCCAT GGGCACCGAC GGTTTCGAGT TCATCGAATA CGCCGCGCCG GACCCCCAGG CCATGGGCGC GCTGTTCGAG CGCATGGGCT TCAAGCCGAT TGCCAGGCAT CGCCATAAAG ATGTGACGCT GTACCGCCAG GGCGGCATCA ACTTCATCCT CAATGCCGAG CCCGATTCAT TTGCGCAGCG CTTTGCACGC CAGCACGGTC CCAGCGTCTG CGCCATTGCG TTCAGGGTGC GGGATGCCAA GGCCGCTTAC GAGCGCGCGA TTGCGCTGGG CGCCTGGGGT TATGCCCACA CCGCCGGCCC GGGCGAGCTG AACATCCCGG CCATCAAGGG CATTGGCGAC TCCATCATCT ATTTCATTGA CCGCTGGCGC GGCAAGAACG GCGCCCGGGA AGGCGATATC GGCAACATCG GTTTTTACGA TGTTGACTTC GAGCCCCTGC CGGGCGTGAG CGGCGCCGAG GCGCTGAATC CCACGGGCCA TGGGCTGACC TACATTGACC ACCTGACGCA CAACGTGCAC CGCGGCCGCA TGGACGAGTG GGCCGGCTTC TACGAGCGCC TGTTCAACTT CCGCGAGATC CGCTACTTCG ACATCGAAGG CCTGGTGACC GGCGTGAAAA GCAAGGCCAT GACCAGCCCC TGCGGCAAGA TCCGCATCCC GATCAACGAG GAAGGCAATG AGAAAGCCGG CCAGATCCAG GAGTACCTGG ACCGTTACCA GGGCGAGGGC ATCCAGCACA TCGCCATGGG CAGCGGCAAT TTGCCAGCCA CCGTGGACAA GCTACGCGCC AGCGGCATCA AGCTGCTGGA CACGGTAGAC ACCTACTACG AACTGATCGA CAAGCGCATT CCAGGCCATG GCGAAAATGT GGCGGAACTG CACAAGCGAA AAATTTTGGT GGACGGCAAG AAAGGCGCGC TTCTGCTGCA GATCTTCAGT GAAAACCAGC TCGGCCCGAT CTTCTTTGAA TTCATCCAGC GCAAGGGCGA CGAGGGTTTT GGCGAAGGCA ACTTCAAGGC ACTGTTCGAA AGCATCGAGC TGGACCAGAT GCGCCGAGGG GTTTTGGCGA GTGCACAATA A
|
Protein sequence | MNAPLSFETG AAWDNPMGTD GFEFIEYAAP DPQAMGALFE RMGFKPIARH RHKDVTLYRQ GGINFILNAE PDSFAQRFAR QHGPSVCAIA FRVRDAKAAY ERAIALGAWG YAHTAGPGEL NIPAIKGIGD SIIYFIDRWR GKNGAREGDI GNIGFYDVDF EPLPGVSGAE ALNPTGHGLT YIDHLTHNVH RGRMDEWAGF YERLFNFREI RYFDIEGLVT GVKSKAMTSP CGKIRIPINE EGNEKAGQIQ EYLDRYQGEG IQHIAMGSGN LPATVDKLRA SGIKLLDTVD TYYELIDKRI PGHGENVAEL HKRKILVDGK KGALLLQIFS ENQLGPIFFE FIQRKGDEGF GEGNFKALFE SIELDQMRRG VLASAQ
|
| |