Gene Arth_2569 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagArth_2569 
Symbol 
ID4444895 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter sp. FB24 
KingdomBacteria 
Replicon accessionNC_008541 
Strand
Start bp2882880 
End bp2884334 
Gene Length1455 bp 
Protein Length484 aa 
Translation table11 
GC content63% 
IMG OID639690388 
Product4-hydroxyphenylacetate 3-monooxygenase, oxygenase subunit 
Protein accessionYP_832048 
Protein GI116671115 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG2368] Aromatic ring hydroxylase 
TIGRFAM ID[TIGR02309] 4-hydroxyphenylacetate 3-monooxygenase, oxygenase component 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.488052 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGGCATCC GCACCGGCCG CCAGTACCTG GACAAACTTA ACGCCATGAC GCCCCACGTG 
GTGATCGACG GGGAAGTCGT CAGTGAAAAA ATCGCCGAGC ATCCCGCCTT CCGGAATGTG
GCCAGGTCCT ACGCCAAGCT CTTCGACATG CAGCACGACC CCGCCCACCA GGAAGCGCTG
ACGTACACCT CGCCCACCAC CGGCGACCTG GTGAATGCGT CCTTCCTGGT TCCCAGGACC
ATCGAGGACC TGCAGCGCCG GCGCAGGGCC ATTTCCACCT GGGCCGAATC CTCCCACGGC
TTCCTTGGCC GCTCCGGGGA CTACATGAAT TCCTCCCTGA CCGCGCTCAG CACCGCGGAA
AAGTGGTTCT CGCAGGCCGA CCCCAAATTC GGCGAGAACA TCCGCAGATA CTACGAATGG
GCGCGGGAAA ATGATGTCCT GGCCACGCAC ACGCTCATCC CGCCCCAGGT CAACCGCTCC
GTCTCGGGCT CCGAGCAGAT GGGTGGACAG TTGTCGGCGC GCATCGTCGA GGAGCGGGAT
GACGGCATTG TGATCAGCGG CGCCCGCATG CTGGCCACGA TCGCTCCCAT TGCCGACGAG
CTCCTGGTGT TTCCGTCCAC CGTGTTGCGG GGCACTCCGG AGGATGCCCC CTATTCCTAC
GCGTTCGCCA TCCCCAATGA TGCGCCGGGC CTGCGCTACC TCTGCCGCGC TTCGCTGTAC
AACGGCGGCA GCACGCATGA CGAACCACTT GCCTCGCGCT ATGAGGAAAT GGACGCGGTG
GCCATCTTCG ACAACGTCTT CGTGCCAAGC GAGCGGATTT TCATGCTTGG CCATCCGCAG
CTCTGTAACG CCTTTTATAC GGAAACCGGC GCGGGCGCCC TCATGACTCA CCAGGTGGTG
ACCCGGACCA TCGCCAAAAG CGAGTTCTTC CTTGGCCTGG CATCAGAGCT GGCAGAGTCC
ATCGGAATCG ACGGCTTCCA GCACATCCAG GAGGACATCG CGGAACTGAT CATCGACGTC
GAAATCGGCA AGGCGCTGGT CCGTGCGTCG GAGGCGGACG CCGGCCTCAA CGAGGCCGGC
GTCATGCTGC CCAAGTGGAC CACGCTGAAT GCCGCCAGGA ACTGGTACCC CAAAATTGCC
CAGCGCTTCC CGCAGATCAT CCGGAAGTTT TCGGCGTCCG GGCTGATGGC ATTACCCGGT
GAGGCCGATG TCAACAGCGA GGCCAGGGCG GACATCGAGA TGTACCTGCA AGGTAAGACA
CTCACCGGCC CGGAACGTGT CCGCCTGTTC AAGCTTGCCT TTGACGCCTC CATCTCAGGC
TTCTCGGGCC GGCAGTCCCT TTACGAGTAC TTTTTCTTCG GTGATCCCGT CCGAATGGCC
GGTGCTCTGG TCAACAGCTA CGACCGCGAG CCGGTACGGG CCAGGGTCCG CGAATTCCTC
AATAGGCAGG ACTGA
 
Protein sequence
MGIRTGRQYL DKLNAMTPHV VIDGEVVSEK IAEHPAFRNV ARSYAKLFDM QHDPAHQEAL 
TYTSPTTGDL VNASFLVPRT IEDLQRRRRA ISTWAESSHG FLGRSGDYMN SSLTALSTAE
KWFSQADPKF GENIRRYYEW ARENDVLATH TLIPPQVNRS VSGSEQMGGQ LSARIVEERD
DGIVISGARM LATIAPIADE LLVFPSTVLR GTPEDAPYSY AFAIPNDAPG LRYLCRASLY
NGGSTHDEPL ASRYEEMDAV AIFDNVFVPS ERIFMLGHPQ LCNAFYTETG AGALMTHQVV
TRTIAKSEFF LGLASELAES IGIDGFQHIQ EDIAELIIDV EIGKALVRAS EADAGLNEAG
VMLPKWTTLN AARNWYPKIA QRFPQIIRKF SASGLMALPG EADVNSEARA DIEMYLQGKT
LTGPERVRLF KLAFDASISG FSGRQSLYEY FFFGDPVRMA GALVNSYDRE PVRARVREFL
NRQD