Gene RoseRS_2569 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRoseRS_2569 
Symbol 
ID5209538 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus sp. RS-1 
KingdomBacteria 
Replicon accessionNC_009523 
Strand
Start bp3184847 
End bp3186412 
Gene Length1566 bp 
Protein Length521 aa 
Translation table11 
GC content60% 
IMG OID640596173 
Product4-hydroxyphenylacetate 3-hydroxylase 
Protein accessionYP_001276895 
Protein GI148656690 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG2368] Aromatic ring hydroxylase 
TIGRFAM ID[TIGR02310] 4-hydroxyphenylacetate 3-monooxygenase, oxygenase component 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCGTCG AAACAGTAGA GCGAACGACA CTGCCCCTCA CCGGCGAGGA GTATCTGGAA 
AGTCTGCGGG ATGGGCGCGA AATCTGGATC TATGGCGAGC GAGTCAAAGA TATCACCACC
CATCCGGCGT TCCGCAACGC AACCCGTATG GTTGCCCGCC TCTACGATGC ACTGCATGAC
CCTGAGAAGC AGGCGGTGCT GACCTGTCCG ACCGATACCG GCAACGGCGG CTTTACGCAC
AAGTTTTTCC GCGCGTCGCG CAGCGCGGAG GAATTGGTTG GCGCACGCGA TGCAATTGCC
GAGTGGGCGC GGTTGACGTA TGGCTGGATG GGGCGCAGTC CTGATTACAA AGCCGCCTTT
CTGGCGACGC TCGGCGCGAA TGCGGAGTTC TACACCCCCT ATCAGGAAAA TGCACGCCGC
TGGTACCGCG AGTCGCAGGA GCGAGTGCTC TACTTCAACC ATGCGATTGT CAACCCGCCG
ATTGATCGCA ACCGCTCCCC GGATGAAGTC CGTGACGTGT ACATGCACGT CGAGCGCGAG
ACCGATGCCG GATTGATCGT CAGCGGCGCC AAGGTGGTCG CCACCGGCTC GGCGCTGACC
CACTACAATT TCATTGCCCA CTACGGTCCA CTGCCGATCA AGAGCAAAGA GTTCGCCCTG
ATCTTCATCG TGCCGATGGA TGCGCCGGGC GTCAAACTGA TCGCCCGCCC GTCGTATGAG
ATGGCGGCGG AGGTGATGGG CAGCCCATTC GATTATCCGC TTTCGAGTCG CCTCGATGAG
AATGACTCGG TGATGGTCTT CGACCAGGTG CTGATCCCCT GGGAGAATGT CTTTGTCTAC
GGCGATGTTG AGAAGGTGAA TGCCTTCTTC CCGCTCTCCG GCTTTATTCC GCGCTTCACG
TTCCACGGCT GCACCCGCAT GGCGGTGAAA CTCGACTTTA TCGCCGGTCT GTTCCTGAAG
GCGATCGACG CAACAGGGGC GAAGGATTTT CGCGGCGTTC AGGCGCGCGT CGGCGAGGTG
CTTGCCTGGC GGAACCTGTT CTGGGCGATC AGCGACGCCA TGGCGCGCAC GCCGATTCCC
TGGAACGAGG GGGCGGTGCT GCCGAATCTG GATTACGGTC TGGCGTATCG CGTCTTCGCC
ACCGTCGCGT ATCCGCGCAT CAAGGAACTG ATCGAGAATG ATGTCGCCAG CGCGCTCATC
TATCTCAACT CGCACGCCGT CGATTTCAAG ACGCCGGAAA TACGCGGCTA CCTGGATAAG
TATCTGCGCG GGTCGAACGG CTACTCGTCG CTGGATCGCG TCAAACTGAT GAAGTTGTTG
TGGGATGCGA TCGGTTCGGA GTTTGGCGGA CGGCACGAAC TCTATGAGCG GAACTATGCC
GGCAACCACG AGAACATTCG CCTGGAAGTG TTGCTGACGG CGATGGCAAC CGGCGCTGCC
GATCGCTACA AAGGATTCGC CGATCAGTGT CTCAACGAGT ACGATCTCGA CGGCTGGACG
GTTCCCGATC TGATCAACCC CGACGATGTG AATATCGTGA TGCAACGGTT CGGCGCCAGA
CAGTAG
 
Protein sequence
MTVETVERTT LPLTGEEYLE SLRDGREIWI YGERVKDITT HPAFRNATRM VARLYDALHD 
PEKQAVLTCP TDTGNGGFTH KFFRASRSAE ELVGARDAIA EWARLTYGWM GRSPDYKAAF
LATLGANAEF YTPYQENARR WYRESQERVL YFNHAIVNPP IDRNRSPDEV RDVYMHVERE
TDAGLIVSGA KVVATGSALT HYNFIAHYGP LPIKSKEFAL IFIVPMDAPG VKLIARPSYE
MAAEVMGSPF DYPLSSRLDE NDSVMVFDQV LIPWENVFVY GDVEKVNAFF PLSGFIPRFT
FHGCTRMAVK LDFIAGLFLK AIDATGAKDF RGVQARVGEV LAWRNLFWAI SDAMARTPIP
WNEGAVLPNL DYGLAYRVFA TVAYPRIKEL IENDVASALI YLNSHAVDFK TPEIRGYLDK
YLRGSNGYSS LDRVKLMKLL WDAIGSEFGG RHELYERNYA GNHENIRLEV LLTAMATGAA
DRYKGFADQC LNEYDLDGWT VPDLINPDDV NIVMQRFGAR Q