Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Arth_3519 |
Symbol | |
ID | 4443829 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Arthrobacter sp. FB24 |
Kingdom | Bacteria |
Replicon accession | NC_008541 |
Strand | - |
Start bp | 3956228 |
End bp | 3958144 |
Gene Length | 1917 bp |
Protein Length | 638 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 639691343 |
Product | phenol 2-monooxygenase |
Protein accession | YP_832994 |
Protein GI | 116672061 |
COG category | [C] Energy production and conversion [H] Coenzyme transport and metabolism |
COG ID | [COG0654] 2-polyprenyl-6-methoxyphenol hydroxylase and related FAD-dependent oxidoreductases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.777145 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGCAATTCC ACCATCACGG TTACGTATCC GGTGACCCGC GAGTCCAGCC GGCAGCAGGC GTAGGCATCA ATCGGCCTAC TGAGCTTCCT GATGAAGTGG ACGTCCTGAT TGTCGGCACA GGCCCTGCCG GCATGCTGGC CGCCGCCCAG CTGTCCCAGT TCCCGGGCGT CACCACGCGG ATCGTGGAAC GCCGTGCCGG CAGGCTGCCC ATCGGCCAGG CAGACGGCAT CCAGGCGAGG AGCGTCGAGA CCTTCCAGGC CTTCGGCTTC GCCGAGCGGA TCATCGCCGA GGCGTACCAC ATCACCGAAA TGGCGTTCTG GAAGCCGGAC CCGGCCGACC ACTCGCGCAT CATCCGGGGA GCCCGCGCGG TGGACGACGA GATGGGGATC AGTGAGTTCC CGCACCTCAT CGTCAACCAG GCCCGCGTGC TGGACTACTT CGCTGAATTC ATGGCGAACT CGCCTACCCG CATGGCGCCT GACTACGGCT TTGAGTTCCG GAGCCTGGAG GTCACCGGCG AGGGCGAGTA TCCGGTCACG GTGACCTTGC TTCACACTGC AGGCCCCCAG GAAGGCCGGG AGAAGGTTGT CCGGGCCAAA TATGTGATTG GTGCAGACGG CGCGCGCAGC AAGGTACGCG ATGCGATTGG CTGCACCCTG GCCGGCGACG CCGCCAACCA CGCCTGGGGT GTTATGGACG CCCTGGCCGT CACCGACTTC CCCGATATCC GCACAAAGTG CGCCATCCAG TCCGGGTCGG GCGGAAGCAT CCTGCTCATC CCGCGCGAAG GCGGCTTCCT GTTCCGCATG TATGTTGACC TCGGCGAGGT GGACCCGAAC GACAAGGGCG CCGTGCGCAA CACTTCCATC GAAGAGATCA TCCGCAAGGC GAACGAGATC CTCCACCCGT ACACGCTCGA CGTCCGAAAT GTCGCGTGGC ACAGCGTGTA CGAGGTGGGC CACCGGCTCA CTGACCGGTT CGACGACGTC CTCCCGGACC AGCGGGGCAC CCGCACGCCG CGCGTATTCA TCACCGGCGA CGCCTGCCAC ACGCACAGCG CCAAGGCCGG CCAGGGCATG AACGTCTCCA TGCAGGACGG TTTCAACCTG GCCTGGAAGC TTGGCCACGT CCTCGAGGGC CGCAGCCCGG AAAGCCTGCT GACCACGTAC TCGGACGAAC GTCAGGTCAT CGCCAAGAAC CTCATCGACT TCGACAAAGA GTGGTCCACG ATGATGGCGA AGAAGCCTGA AGAGTTCGAG AGCCCTTCCG AGCTTGAGGA CTTCTACGTC AGCACCGCCG AGTTCCCGGC CGGATTCATG ACCCAGTACG CCCCGTCGAT GCTCACCGGC GGCACCGGAC ACCAGGACCT GGCCGCCGGT TTCCCCGTCG GCAAGCGCTT CAAGTCAGCG CCCGTCGTGC GGGTCTGCGA TACCAACCCC ATGCAACTCG GACACCACGC CACAGCCGAC GGACGGTGGC GCATCTATGT CTTCGCCGAC GCCGCCGCGC CGGCAGCGGG ACAGCAGGGT GTCCCCTCAG CAGTGGCCGA CTTTGCCGAG TGGATTGCGC AGGCGCCGGA CTCGCCGCTG GCCGCCACGC CGTCGGGCGC CGACCTCGAC GCATGGTTCG ACGTGAAGGT GATCTACCAG CAGCCCCACA CGGACATCGA TATCAACGCA GTGCCGGCGG TGTTCAAGCC GCAGGTTGGC CCGTTCCAGC TGACGGATTA CGAGAAGGTG TACGCCACCG ATCCGAAGGC TGACATCTTC GAGCTGCGCG GCCTGGACCG CGGCGGCGTG ATCGTGGTGG TTCGCCCGGA CCAATACGTG GCCAACGTCC TGCCCCTGGC TGCGACGGCG GAACTCGGTG CGTTCTTCGC ACCCCTCCTG GCTACGGGAC GGGCCGCGGC AGTCTAG
|
Protein sequence | MQFHHHGYVS GDPRVQPAAG VGINRPTELP DEVDVLIVGT GPAGMLAAAQ LSQFPGVTTR IVERRAGRLP IGQADGIQAR SVETFQAFGF AERIIAEAYH ITEMAFWKPD PADHSRIIRG ARAVDDEMGI SEFPHLIVNQ ARVLDYFAEF MANSPTRMAP DYGFEFRSLE VTGEGEYPVT VTLLHTAGPQ EGREKVVRAK YVIGADGARS KVRDAIGCTL AGDAANHAWG VMDALAVTDF PDIRTKCAIQ SGSGGSILLI PREGGFLFRM YVDLGEVDPN DKGAVRNTSI EEIIRKANEI LHPYTLDVRN VAWHSVYEVG HRLTDRFDDV LPDQRGTRTP RVFITGDACH THSAKAGQGM NVSMQDGFNL AWKLGHVLEG RSPESLLTTY SDERQVIAKN LIDFDKEWST MMAKKPEEFE SPSELEDFYV STAEFPAGFM TQYAPSMLTG GTGHQDLAAG FPVGKRFKSA PVVRVCDTNP MQLGHHATAD GRWRIYVFAD AAAPAAGQQG VPSAVADFAE WIAQAPDSPL AATPSGADLD AWFDVKVIYQ QPHTDIDINA VPAVFKPQVG PFQLTDYEKV YATDPKADIF ELRGLDRGGV IVVVRPDQYV ANVLPLAATA ELGAFFAPLL ATGRAAAV
|
| |