Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Arth_1935 |
Symbol | |
ID | 4445554 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Arthrobacter sp. FB24 |
Kingdom | Bacteria |
Replicon accession | NC_008541 |
Strand | + |
Start bp | 2178913 |
End bp | 2180529 |
Gene Length | 1617 bp |
Protein Length | 538 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 639689745 |
Product | 4-hydroxyphenylacetate 3-hydroxylase |
Protein accession | YP_831417 |
Protein GI | 116670484 |
COG category | [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG2368] Aromatic ring hydroxylase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACCGAAA TTGTTGACCA GTCAGTTCCC GCTACCGATT CCTCGGCGTC CTCCACGCCG TCGATGAGTG AGAGAACAAC GCCGGCGAAT CCGAAGATGC CGATGACCGG GGATGAGTAC ATTAAGTCTT TACAGGATGA CCGCGAGGTC TGGATTTACG GTGAGCGTGT CAAGGACGTG ACTACCCACC CGGCGTTCCG GAACTCGATC AGGATGGCCG CCCGGCTGTA CGACGCAATG CACAAGCCCG AGATGCAGGA CAAGCTGATG GTGCCCACGG ACACCGGCTC CGGTGGCAAG ACGATGTCCT TCTTCCGCAC CCCGCACAGC GTCGAGGATC TGAAGAAGGA CCGCTTAGCG ATCGAGACCT GGTCCCGCAT GAGCTACGGA TGGCTGGGCC GCTCACCGGA TTACAAGGCC GCGTTCCTGG GCACCTTGGG CGGCAACACG GACTTCTACG ATCCGTTCCA GGAGAACGCC AAGCGCTGGT ACAAGGAATC TCAGGAGAAG GTCCTGTACT GGAACCATGC GATCATCAAC CCGCCGGTGG ACCGGCACCT CCCGGCGGAC CAGGTCGGCG ATGTGTTCAT GAAGGTCGAG AAGGAAACCG GCTCGGGCCT GATCGTCTCC GGTGCAAAGG TCGTTGCCAC CGGGTCCGCG ATCACCAACT ACAATTTCAT CGCGCACTAC GGCCTGCCGA TCAAGCGCAA GGAATTCGCA CTGATCTGCA CCGTGCCGAT GGACGCCCCG GGTGTGAAGC TGATCAGCCG GGCCTCCTAC GCCCACCAGG CCGCTGTGAT GGGCACCCCG TTCGACTACC CGCTGTCGAG CCGGATGGAC GAGAACGACT CTGTCTTCAT CTTCGACAAG GTCCTAATCC CGTGGGAGAA CGTCTTCGCC TACGGGGACG TGGAGAAGAT CAACAACTTC TTCCCGCAGA CCGGCTTCAT CAACCGCTTC ACCTTCCAGG GTGTGATCCG TCTCGCCACC AAGCTCGACT TCATCGCCGG GCTGCTGATG AAGGCCCTGG AAGTCGCCGG AACGCAGGAC TTCCGCGGCG TCCAGACCCG GGTGGGCGAG GTCCTGGGCT GGCGCAACAT GTTCCATGCG TTGATCGACG GCATGACCCT GAACCCCGAC CAGGGCCCCA ACGGCACCGT GCTGCCCAAG CTCGACTACG GACTGTCCTA CCGGATGTTC ATGGCCATCG GCTACCCGCG GATCAAGGAA ATCATCGAAC AAGACGTCGC ATCCGGCCTG ATCTACCTGA ACTCCTCAGC CCTGGACTTC AAGACCCCGG AAATCCGGCC CTACCTGGAC AAGTACATCC GCGGCTCTGA CGGCGTAGAG GCCGTGGATC GGGTCAAGTT GATGAAGCTG CTCTGGGACT CCATCGGCAC CGAGTTTGGC GGCCGGCATG AACTCTACGA GCGGAACTAC TCCGGCAACC ACGAGAATGT GAAGGCCGAG ATCCTCTTCG CCGCGCAGGC CCAGGGCACC ACGGACTACA TGAAGGGCTT CGCTGACCAG TGCCTCGCCG AATACGACCT GGACGGCTGG ACCGTGCCCG ACCTGATCAG CAACGACGAC GTCAGCCTCT TCCTCAAGCG CAGCTAG
|
Protein sequence | MTEIVDQSVP ATDSSASSTP SMSERTTPAN PKMPMTGDEY IKSLQDDREV WIYGERVKDV TTHPAFRNSI RMAARLYDAM HKPEMQDKLM VPTDTGSGGK TMSFFRTPHS VEDLKKDRLA IETWSRMSYG WLGRSPDYKA AFLGTLGGNT DFYDPFQENA KRWYKESQEK VLYWNHAIIN PPVDRHLPAD QVGDVFMKVE KETGSGLIVS GAKVVATGSA ITNYNFIAHY GLPIKRKEFA LICTVPMDAP GVKLISRASY AHQAAVMGTP FDYPLSSRMD ENDSVFIFDK VLIPWENVFA YGDVEKINNF FPQTGFINRF TFQGVIRLAT KLDFIAGLLM KALEVAGTQD FRGVQTRVGE VLGWRNMFHA LIDGMTLNPD QGPNGTVLPK LDYGLSYRMF MAIGYPRIKE IIEQDVASGL IYLNSSALDF KTPEIRPYLD KYIRGSDGVE AVDRVKLMKL LWDSIGTEFG GRHELYERNY SGNHENVKAE ILFAAQAQGT TDYMKGFADQ CLAEYDLDGW TVPDLISNDD VSLFLKRS
|
| |