Gene Sare_1268 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_1268 
SymbolmhpA 
ID5704481 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp1468568 
End bp1470160 
Gene Length1593 bp 
Protein Length530 aa 
Translation table11 
GC content72% 
IMG OID641270783 
Product3-(3-hydroxyphenyl)propionate hydroxylase 
Protein accessionYP_001536164 
Protein GI159036911 
COG category[C] Energy production and conversion
[H] Coenzyme transport and metabolism 
COG ID[COG0654] 2-polyprenyl-6-methoxyphenol hydroxylase and related FAD-dependent oxidoreductases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000150974 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
TTGAGCGACG CGCACGTCGT CATCGTCGGC AACGGGCCCG TCGGTGCCAC CCTGTCGGTG 
CTGCTCGCCC AACGCGGCTG GCGGGTGACC GTGATCGAAC GCCGCCCCCG GCCGTACCGG
CTTCCTCGGG CGACCAGCTT CGACGGTGAG ACCGCCCGCC TCCTGGCCGA CACCGGGATC
GGCCCGGACC TCGGCCGGAT CACGGCTCCG GCCAACGGCT ACCAGTGGCG CACGGCCGCC
GGTGAGACCC TGCTGGATAT CGCGTTCAGC ACGCCCGGCG CGTACGGCTG GCCGGACGTG
AACACGATGC ACCAGCCGGC CCTGGAGGAG CTCATCGCCA CCCGGGCGAC GTCGTTGCCC
AGTCTCACCC TGCTGCGCGG ACACGAGGTC GTGGGCATCA GCGAGCACCA CGGCCGGGTG
GAGGTGATCG CCACCGACGA CGACGGCACG ACGCGGCGGG TCTCCGCGCA CTGGGTCGTG
GGATGCGACG GCGCCAACAG CTTCGTCCGC CACCACCTCG ACGTTCCCGT GACGGACCTC
GGGTTCTCCT ACGAGTGGCT GCTCTGCGAC GTCGAACTCC ACGAGCCGCG CGAGTTCGTC
CCCACCAATG TGCAGATCTG CGATCCGGCG CGGCCCACGA CCCAGGTGGG CGGCGGCCCT
GGACGACGGC GCTGGGAATT CATGCGCCTG CCCGGCGAAA GCACGGCCGA GCTGAACCGG
GACGAGACGG CGTGGCGTCT GCTGGCACCG TTCGGCGTCA GACCCGACAC CGCCACCCTC
CTGCGCCACA CCACGTACAT CTTCCAGGCC CGGTGGGCCG AGCGGTGGCG GGTGGGTCAC
GTCCTGCTGG CCGGCGACGC GGCCCACCTC ATGCCACCGT TCGCCGGTCA GGGCATGTGC
GCCGGCATCC GGGACGTCGT CAACCTGGCG TGGAAGCTCG ACCTCACCCT GCGCGGACTC
GCGGCAGAGT CCCTGATCGA CTCCTACCAA CAGGAACGCC GCGCACAGGC CAAGGAGGCG
ATCCTGGCGT CGGTCCAGCT GGGCCGAGTG ATCTGCGTGA CCGATCCGGT GGCCGCCGCC
GAGCGCGACG CCACGGTGCT GGCCAACCGC CGAGGGCAGC CCCCGGTCCG GCCGGATCCG
GCGAAGCCGC TCTCCGACGG GCTGCTGCAC CGGCGGCCCG GGGCGGACGC GGCCGAGCCG
CCGGCCGGTG CGGTCGCGCC GCAGGGCCGA GTGGCCCTGG GCCGCGACAT CGGGCTGTTC
GACGACGTCG TCGGCCGGGG CTTCGTCCTG CTGACCACCG AGGATCCGCA CACCGCACTC
GACGACGATC GGCTGTCGTT CCTCGGCACG CTGGACACCC ACATGGTGCG GCTGCTGCCC
CCCGGTAACG CGGTGGAGCA GGGCGCGGTG GACGTGGACG ACGTCTACCG CCCGTATCTG
ACGCGGTGCG GGGCGACCGC CCTGCTCATC CGGCCCGATC ACCACGTCTT CGGTGCTGGA
AGCGGCCCGA GCGGCATCCG GGACCTCGTC GACGACCTGC GACACCAGCT ACGAGCTCCG
GCGCCGGTGA GTGCGCTAGG GACCGTCGGC TGA
 
Protein sequence
MSDAHVVIVG NGPVGATLSV LLAQRGWRVT VIERRPRPYR LPRATSFDGE TARLLADTGI 
GPDLGRITAP ANGYQWRTAA GETLLDIAFS TPGAYGWPDV NTMHQPALEE LIATRATSLP
SLTLLRGHEV VGISEHHGRV EVIATDDDGT TRRVSAHWVV GCDGANSFVR HHLDVPVTDL
GFSYEWLLCD VELHEPREFV PTNVQICDPA RPTTQVGGGP GRRRWEFMRL PGESTAELNR
DETAWRLLAP FGVRPDTATL LRHTTYIFQA RWAERWRVGH VLLAGDAAHL MPPFAGQGMC
AGIRDVVNLA WKLDLTLRGL AAESLIDSYQ QERRAQAKEA ILASVQLGRV ICVTDPVAAA
ERDATVLANR RGQPPVRPDP AKPLSDGLLH RRPGADAAEP PAGAVAPQGR VALGRDIGLF
DDVVGRGFVL LTTEDPHTAL DDDRLSFLGT LDTHMVRLLP PGNAVEQGAV DVDDVYRPYL
TRCGATALLI RPDHHVFGAG SGPSGIRDLV DDLRHQLRAP APVSALGTVG