Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Arth_4043 |
Symbol | |
ID | 4447879 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Arthrobacter sp. FB24 |
Kingdom | Bacteria |
Replicon accession | NC_008541 |
Strand | - |
Start bp | 4562840 |
End bp | 4564030 |
Gene Length | 1191 bp |
Protein Length | 396 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 639691874 |
Product | 4-hydroxybenzoate 3-monooxygenase |
Protein accession | YP_833518 |
Protein GI | 116672585 |
COG category | [C] Energy production and conversion [H] Coenzyme transport and metabolism |
COG ID | [COG0654] 2-polyprenyl-6-methoxyphenol hydroxylase and related FAD-dependent oxidoreductases |
TIGRFAM ID | [TIGR02360] 4-hydroxybenzoate 3-monooxygenase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 26 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCAGCCC GCAAAGTACT CACCACCCAA GTGGCCATTC TGGGTGCAGG ACCGGCAGGC CTGATGCTCT CGCATCTGCT CGCCAAGTCC GGCATAGAAT CCACCGTCAT CGAAATCCGG AGCCGCCAGG AAATCCAGGA GACAGTCCGC GCCGGCATCC TGGAGCACGG CACCGTGAAC ATGTTGGTGG ACAGCGGCGT CTCGGACCGC GTGCTGCGCG ACGGTGACCG GCACGACGGC ATTGAACTGC GCTTCAACGG CGAAAGCCAC CGGATCGATT TCCAGGACCT GGTGGGGGAG TCGGTGTGGC TCTACCCGCA GACCGACGTG TTCATGGACC TGGCCGCTCG CCGCGAGGCC GACGGCGGAG ACGTCCGCTA CAGCGTAACG GATACCACCA TCCACGACAT CGAAGGATCC CCAAAGGTCT GGTTCACGGA CGCCGACGGT GTGGAATATG AGCTGCAGGC CGATTTCATC ACCGGCGCGG ACGGCTCCCG CAGCCACTGC CGCTTCCAGA TCCCTGAGGC GGACCGCAAA TGGTACTTCC ACGAGTACCC CTTCGCGTGG TTCGGCATCC TGGCTGAGGC GCCCCGGAGC TCCGACGAGC TGATCTACGC CAACTCGGAC AACGGCTTTG CGCTGATCAG CCAGCGGACC GAAACCGTTC AGCGGATGTA CTTCCAATGC GATCCCAACG AGGACGTGAA CAACTGGAGT GAGGACCGCA TCTGGGACGC CTTCCGCAGC CGGGTCAACG GCAACGGCTA CGAGCTCAAG GAAGGCCCGG TCATCGACAA GATGGTGCTG AAGTTCCGCA GCTTCGTCCA CGCCCCCATG CGCCACGGAA AGCTCTTCCT GGCCGGCGAC GCCGCCCACA CCGTCCCGCC CACCGGCGCC AAGGGCTTGA ACCTGGCCAT CCACGACGTC AAGGTCCTTT TCGAAGGGCT GGAAAGCTAC TACAAGGGCG GTTCAACGGT GCTGTTGGAC GCCTACAGCG ACCGCGCCCT GGAGCGCGTC TGGAAGGCAC AGCAGTTCTC CTACTGGATG ACCTCGATGC TCCACACGCC GGTGGACGCC GACGACTTCT CCCGGGCCCG CCAGCTGGGT GAACTCAACT CGGTGGTCTC CTCCCGCCAC GGCCGGGCCT ACCTGGCCGA GGCCTACACC GGCTGGCCCG GGGCCCACTA G
|
Protein sequence | MAARKVLTTQ VAILGAGPAG LMLSHLLAKS GIESTVIEIR SRQEIQETVR AGILEHGTVN MLVDSGVSDR VLRDGDRHDG IELRFNGESH RIDFQDLVGE SVWLYPQTDV FMDLAARREA DGGDVRYSVT DTTIHDIEGS PKVWFTDADG VEYELQADFI TGADGSRSHC RFQIPEADRK WYFHEYPFAW FGILAEAPRS SDELIYANSD NGFALISQRT ETVQRMYFQC DPNEDVNNWS EDRIWDAFRS RVNGNGYELK EGPVIDKMVL KFRSFVHAPM RHGKLFLAGD AAHTVPPTGA KGLNLAIHDV KVLFEGLESY YKGGSTVLLD AYSDRALERV WKAQQFSYWM TSMLHTPVDA DDFSRARQLG ELNSVVSSRH GRAYLAEAYT GWPGAH
|
| |