Gene Arth_1935 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagArth_1935 
Symbol 
ID4445554 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter sp. FB24 
KingdomBacteria 
Replicon accessionNC_008541 
Strand
Start bp2178913 
End bp2180529 
Gene Length1617 bp 
Protein Length538 aa 
Translation table11 
GC content61% 
IMG OID639689745 
Product4-hydroxyphenylacetate 3-hydroxylase 
Protein accessionYP_831417 
Protein GI116670484 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG2368] Aromatic ring hydroxylase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACCGAAA TTGTTGACCA GTCAGTTCCC GCTACCGATT CCTCGGCGTC CTCCACGCCG 
TCGATGAGTG AGAGAACAAC GCCGGCGAAT CCGAAGATGC CGATGACCGG GGATGAGTAC
ATTAAGTCTT TACAGGATGA CCGCGAGGTC TGGATTTACG GTGAGCGTGT CAAGGACGTG
ACTACCCACC CGGCGTTCCG GAACTCGATC AGGATGGCCG CCCGGCTGTA CGACGCAATG
CACAAGCCCG AGATGCAGGA CAAGCTGATG GTGCCCACGG ACACCGGCTC CGGTGGCAAG
ACGATGTCCT TCTTCCGCAC CCCGCACAGC GTCGAGGATC TGAAGAAGGA CCGCTTAGCG
ATCGAGACCT GGTCCCGCAT GAGCTACGGA TGGCTGGGCC GCTCACCGGA TTACAAGGCC
GCGTTCCTGG GCACCTTGGG CGGCAACACG GACTTCTACG ATCCGTTCCA GGAGAACGCC
AAGCGCTGGT ACAAGGAATC TCAGGAGAAG GTCCTGTACT GGAACCATGC GATCATCAAC
CCGCCGGTGG ACCGGCACCT CCCGGCGGAC CAGGTCGGCG ATGTGTTCAT GAAGGTCGAG
AAGGAAACCG GCTCGGGCCT GATCGTCTCC GGTGCAAAGG TCGTTGCCAC CGGGTCCGCG
ATCACCAACT ACAATTTCAT CGCGCACTAC GGCCTGCCGA TCAAGCGCAA GGAATTCGCA
CTGATCTGCA CCGTGCCGAT GGACGCCCCG GGTGTGAAGC TGATCAGCCG GGCCTCCTAC
GCCCACCAGG CCGCTGTGAT GGGCACCCCG TTCGACTACC CGCTGTCGAG CCGGATGGAC
GAGAACGACT CTGTCTTCAT CTTCGACAAG GTCCTAATCC CGTGGGAGAA CGTCTTCGCC
TACGGGGACG TGGAGAAGAT CAACAACTTC TTCCCGCAGA CCGGCTTCAT CAACCGCTTC
ACCTTCCAGG GTGTGATCCG TCTCGCCACC AAGCTCGACT TCATCGCCGG GCTGCTGATG
AAGGCCCTGG AAGTCGCCGG AACGCAGGAC TTCCGCGGCG TCCAGACCCG GGTGGGCGAG
GTCCTGGGCT GGCGCAACAT GTTCCATGCG TTGATCGACG GCATGACCCT GAACCCCGAC
CAGGGCCCCA ACGGCACCGT GCTGCCCAAG CTCGACTACG GACTGTCCTA CCGGATGTTC
ATGGCCATCG GCTACCCGCG GATCAAGGAA ATCATCGAAC AAGACGTCGC ATCCGGCCTG
ATCTACCTGA ACTCCTCAGC CCTGGACTTC AAGACCCCGG AAATCCGGCC CTACCTGGAC
AAGTACATCC GCGGCTCTGA CGGCGTAGAG GCCGTGGATC GGGTCAAGTT GATGAAGCTG
CTCTGGGACT CCATCGGCAC CGAGTTTGGC GGCCGGCATG AACTCTACGA GCGGAACTAC
TCCGGCAACC ACGAGAATGT GAAGGCCGAG ATCCTCTTCG CCGCGCAGGC CCAGGGCACC
ACGGACTACA TGAAGGGCTT CGCTGACCAG TGCCTCGCCG AATACGACCT GGACGGCTGG
ACCGTGCCCG ACCTGATCAG CAACGACGAC GTCAGCCTCT TCCTCAAGCG CAGCTAG
 
Protein sequence
MTEIVDQSVP ATDSSASSTP SMSERTTPAN PKMPMTGDEY IKSLQDDREV WIYGERVKDV 
TTHPAFRNSI RMAARLYDAM HKPEMQDKLM VPTDTGSGGK TMSFFRTPHS VEDLKKDRLA
IETWSRMSYG WLGRSPDYKA AFLGTLGGNT DFYDPFQENA KRWYKESQEK VLYWNHAIIN
PPVDRHLPAD QVGDVFMKVE KETGSGLIVS GAKVVATGSA ITNYNFIAHY GLPIKRKEFA
LICTVPMDAP GVKLISRASY AHQAAVMGTP FDYPLSSRMD ENDSVFIFDK VLIPWENVFA
YGDVEKINNF FPQTGFINRF TFQGVIRLAT KLDFIAGLLM KALEVAGTQD FRGVQTRVGE
VLGWRNMFHA LIDGMTLNPD QGPNGTVLPK LDYGLSYRMF MAIGYPRIKE IIEQDVASGL
IYLNSSALDF KTPEIRPYLD KYIRGSDGVE AVDRVKLMKL LWDSIGTEFG GRHELYERNY
SGNHENVKAE ILFAAQAQGT TDYMKGFADQ CLAEYDLDGW TVPDLISNDD VSLFLKRS