Gene Arth_3519 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagArth_3519 
Symbol 
ID4443829 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter sp. FB24 
KingdomBacteria 
Replicon accessionNC_008541 
Strand
Start bp3956228 
End bp3958144 
Gene Length1917 bp 
Protein Length638 aa 
Translation table11 
GC content66% 
IMG OID639691343 
Productphenol 2-monooxygenase 
Protein accessionYP_832994 
Protein GI116672061 
COG category[C] Energy production and conversion
[H] Coenzyme transport and metabolism 
COG ID[COG0654] 2-polyprenyl-6-methoxyphenol hydroxylase and related FAD-dependent oxidoreductases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.777145 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGCAATTCC ACCATCACGG TTACGTATCC GGTGACCCGC GAGTCCAGCC GGCAGCAGGC 
GTAGGCATCA ATCGGCCTAC TGAGCTTCCT GATGAAGTGG ACGTCCTGAT TGTCGGCACA
GGCCCTGCCG GCATGCTGGC CGCCGCCCAG CTGTCCCAGT TCCCGGGCGT CACCACGCGG
ATCGTGGAAC GCCGTGCCGG CAGGCTGCCC ATCGGCCAGG CAGACGGCAT CCAGGCGAGG
AGCGTCGAGA CCTTCCAGGC CTTCGGCTTC GCCGAGCGGA TCATCGCCGA GGCGTACCAC
ATCACCGAAA TGGCGTTCTG GAAGCCGGAC CCGGCCGACC ACTCGCGCAT CATCCGGGGA
GCCCGCGCGG TGGACGACGA GATGGGGATC AGTGAGTTCC CGCACCTCAT CGTCAACCAG
GCCCGCGTGC TGGACTACTT CGCTGAATTC ATGGCGAACT CGCCTACCCG CATGGCGCCT
GACTACGGCT TTGAGTTCCG GAGCCTGGAG GTCACCGGCG AGGGCGAGTA TCCGGTCACG
GTGACCTTGC TTCACACTGC AGGCCCCCAG GAAGGCCGGG AGAAGGTTGT CCGGGCCAAA
TATGTGATTG GTGCAGACGG CGCGCGCAGC AAGGTACGCG ATGCGATTGG CTGCACCCTG
GCCGGCGACG CCGCCAACCA CGCCTGGGGT GTTATGGACG CCCTGGCCGT CACCGACTTC
CCCGATATCC GCACAAAGTG CGCCATCCAG TCCGGGTCGG GCGGAAGCAT CCTGCTCATC
CCGCGCGAAG GCGGCTTCCT GTTCCGCATG TATGTTGACC TCGGCGAGGT GGACCCGAAC
GACAAGGGCG CCGTGCGCAA CACTTCCATC GAAGAGATCA TCCGCAAGGC GAACGAGATC
CTCCACCCGT ACACGCTCGA CGTCCGAAAT GTCGCGTGGC ACAGCGTGTA CGAGGTGGGC
CACCGGCTCA CTGACCGGTT CGACGACGTC CTCCCGGACC AGCGGGGCAC CCGCACGCCG
CGCGTATTCA TCACCGGCGA CGCCTGCCAC ACGCACAGCG CCAAGGCCGG CCAGGGCATG
AACGTCTCCA TGCAGGACGG TTTCAACCTG GCCTGGAAGC TTGGCCACGT CCTCGAGGGC
CGCAGCCCGG AAAGCCTGCT GACCACGTAC TCGGACGAAC GTCAGGTCAT CGCCAAGAAC
CTCATCGACT TCGACAAAGA GTGGTCCACG ATGATGGCGA AGAAGCCTGA AGAGTTCGAG
AGCCCTTCCG AGCTTGAGGA CTTCTACGTC AGCACCGCCG AGTTCCCGGC CGGATTCATG
ACCCAGTACG CCCCGTCGAT GCTCACCGGC GGCACCGGAC ACCAGGACCT GGCCGCCGGT
TTCCCCGTCG GCAAGCGCTT CAAGTCAGCG CCCGTCGTGC GGGTCTGCGA TACCAACCCC
ATGCAACTCG GACACCACGC CACAGCCGAC GGACGGTGGC GCATCTATGT CTTCGCCGAC
GCCGCCGCGC CGGCAGCGGG ACAGCAGGGT GTCCCCTCAG CAGTGGCCGA CTTTGCCGAG
TGGATTGCGC AGGCGCCGGA CTCGCCGCTG GCCGCCACGC CGTCGGGCGC CGACCTCGAC
GCATGGTTCG ACGTGAAGGT GATCTACCAG CAGCCCCACA CGGACATCGA TATCAACGCA
GTGCCGGCGG TGTTCAAGCC GCAGGTTGGC CCGTTCCAGC TGACGGATTA CGAGAAGGTG
TACGCCACCG ATCCGAAGGC TGACATCTTC GAGCTGCGCG GCCTGGACCG CGGCGGCGTG
ATCGTGGTGG TTCGCCCGGA CCAATACGTG GCCAACGTCC TGCCCCTGGC TGCGACGGCG
GAACTCGGTG CGTTCTTCGC ACCCCTCCTG GCTACGGGAC GGGCCGCGGC AGTCTAG
 
Protein sequence
MQFHHHGYVS GDPRVQPAAG VGINRPTELP DEVDVLIVGT GPAGMLAAAQ LSQFPGVTTR 
IVERRAGRLP IGQADGIQAR SVETFQAFGF AERIIAEAYH ITEMAFWKPD PADHSRIIRG
ARAVDDEMGI SEFPHLIVNQ ARVLDYFAEF MANSPTRMAP DYGFEFRSLE VTGEGEYPVT
VTLLHTAGPQ EGREKVVRAK YVIGADGARS KVRDAIGCTL AGDAANHAWG VMDALAVTDF
PDIRTKCAIQ SGSGGSILLI PREGGFLFRM YVDLGEVDPN DKGAVRNTSI EEIIRKANEI
LHPYTLDVRN VAWHSVYEVG HRLTDRFDDV LPDQRGTRTP RVFITGDACH THSAKAGQGM
NVSMQDGFNL AWKLGHVLEG RSPESLLTTY SDERQVIAKN LIDFDKEWST MMAKKPEEFE
SPSELEDFYV STAEFPAGFM TQYAPSMLTG GTGHQDLAAG FPVGKRFKSA PVVRVCDTNP
MQLGHHATAD GRWRIYVFAD AAAPAAGQQG VPSAVADFAE WIAQAPDSPL AATPSGADLD
AWFDVKVIYQ QPHTDIDINA VPAVFKPQVG PFQLTDYEKV YATDPKADIF ELRGLDRGGV
IVVVRPDQYV ANVLPLAATA ELGAFFAPLL ATGRAAAV