Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sare_2098 |
Symbol | |
ID | 5704712 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salinispora arenicola CNS-205 |
Kingdom | Bacteria |
Replicon accession | NC_009953 |
Strand | - |
Start bp | 2416783 |
End bp | 2418360 |
Gene Length | 1578 bp |
Protein Length | 525 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 641271583 |
Product | 4-hydroxyphenylacetate 3-hydroxylase |
Protein accession | YP_001536954 |
Protein GI | 159037701 |
COG category | [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG2368] Aromatic ring hydroxylase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 28 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 8 |
Fosmid unclonability p-value | 0.0285855 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACTGACC CACGGGATTC CGACACCGAA CCGTCCCGGC GGCGGGTGAC CCGCCCGCTG ACCGGCGACG AGTACGTCGA GTCGCTGCGC GACGACCGGC AGGTCTACGT CTACGGCGAC CGGGTGCGCG ACGTCACCGC GCATCCTGCC TTTCGCAACC CGGTCCGGAT GACCGCGCGA CTGTACGACG CGCTGCACGA CCCGCAAACC CGGCCCGTGC TCACCGCACC CACCGACACC GGCAGCGACG GGTATACCCA CCGGTTCTTC ACGACCGCGC GCAGCGTCGC CGATCTCGTC GCCGACCAGC GCGCCATCGC GGCCTGGGCT CGCCTCAGCT ACGGCTGGAT GGGACGCGCC CCGGACTACA AGGCGGCGTT CCTCGGCACC CTCGGTGCGA ACGCCGAGTA CTACGCGCCG TTCGCCGACA ATGCCCGGCG CTGGTATCGG GAGTCTCAGG AAAAGGTGCT GTACTGGAAC CACGCGATCG TGCACCCGCC GGTGGACCGG TCCCGACCGC CGGACGAGGT GCGCGATGTC TTCATCCACG TGGAGCGGGA AACCGACGCC GGTCTCGTGG TCAGCGGTGC CAAGGTGGTC GCGACCGCAT CCGCGCTCAC CCACTACAAC TTCCTCGCCC ACTACGGTCT GCCGGTGCGC AAACGCGAGT TCGCGCTCGT CGCCACCGTC CCGATGGACG CGCCCGGTAT GAAGCTGATC TGCCGTCCGT CGTACGCGGC CACGGCCGCG GTGATGGGCA GCCCCTTCGA CTATCCGCTG TCCTCCCGTC TGGACGAGAA CGACAGCATC CTGGTGCTGG ACCGGGTGCT GATCCCGTGG GAAAACGTGT TCATCTACGG CGACCTGGCC AAGGTGCAGA TGTTCGCCGG GCAGTCCGGG TTCACCGAGC GGTTTACCTT TCACGGATGT ACTCGCCTGG CGGTTAAGCT GGAGTTCCTG GCCGGGCTGC TCGCCAAGGC CGTGGAGTTG ACCGGCACCG CCGAATTCCG GGGGGTGCAG AGCCGGCTTG GCGAGGTATT GGCCTGGCGG AACCTCTTCT GGGGCTTGTC TGATGCCGCC GCCCGCAACC CGGTGCCCTG GAAGAACGGC GCGCTGCTGC CCAACCCGCA ATACGGGATG GCGTACCGGT GGTTCATGCA GATCGGCTAT CCCCGGATCC GGGAGATCGC GATGCAGGAT GTCGCCAGTG GTCTGATCTA CGTCAACTCC AGCGCTGATG ACTTCCATAA CCCGGACATC CGCCCCTATC TCGACACGTA CCTGCGGGGG TCGGGCGGCG CGGACGCGGT CGAGCGGGTC AAGCTGATGA AGCTGCTGTG GGACGCCATC GGCACCGAGT TCGGCGGGAG GCACGAGCTG TACGAGCGCA ACTACGCCGG TAACCACGAG AATGTGCGGG TCGAGCTGTT CAACGCACAG ACCCTCGGTG GCGAGGTGGA CGATTACAAG GCATTCGTCG ACGAGTGCCT GCGCGAGTAC GACCTGGACG GCTGGCGGGT ACCGGACCTT GCGTCCTTTC CCGACCTTCG TCGCCTGCAG GACTTGAACG ACGCCTGA
|
Protein sequence | MTDPRDSDTE PSRRRVTRPL TGDEYVESLR DDRQVYVYGD RVRDVTAHPA FRNPVRMTAR LYDALHDPQT RPVLTAPTDT GSDGYTHRFF TTARSVADLV ADQRAIAAWA RLSYGWMGRA PDYKAAFLGT LGANAEYYAP FADNARRWYR ESQEKVLYWN HAIVHPPVDR SRPPDEVRDV FIHVERETDA GLVVSGAKVV ATASALTHYN FLAHYGLPVR KREFALVATV PMDAPGMKLI CRPSYAATAA VMGSPFDYPL SSRLDENDSI LVLDRVLIPW ENVFIYGDLA KVQMFAGQSG FTERFTFHGC TRLAVKLEFL AGLLAKAVEL TGTAEFRGVQ SRLGEVLAWR NLFWGLSDAA ARNPVPWKNG ALLPNPQYGM AYRWFMQIGY PRIREIAMQD VASGLIYVNS SADDFHNPDI RPYLDTYLRG SGGADAVERV KLMKLLWDAI GTEFGGRHEL YERNYAGNHE NVRVELFNAQ TLGGEVDDYK AFVDECLREY DLDGWRVPDL ASFPDLRRLQ DLNDA
|
| |