Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sare_0574 |
Symbol | |
ID | 5705560 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salinispora arenicola CNS-205 |
Kingdom | Bacteria |
Replicon accession | NC_009953 |
Strand | + |
Start bp | 653419 |
End bp | 654639 |
Gene Length | 1221 bp |
Protein Length | 406 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 641270099 |
Product | phthalate 4,5-dioxygenase |
Protein accession | YP_001535493 |
Protein GI | 159036240 |
COG category | [P] Inorganic ion transport and metabolism [R] General function prediction only |
COG ID | [COG4638] Phenylpropionate dioxygenase and related ring-hydroxylating dioxygenases, large terminal subunit |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.105836 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 0.217979 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGACCACCA GCAAAATGAA TGATCAGCTC ACCCGGGTGG GGCCCGGGAC GCCGATGGGT CGGCTCCTAC GGGAGTACTG GATGCCCGTC ATGCGCTCCG GGCGGCTGAC CGAGCCGGGC GGAGCGCCCA TGGCAGTCGA GCTGCTCGGC GAGAAGTTCG TCGTCTTCCG TGCCGACGAC GGCACCCTCG GCTGCTTCGC CGAGGCGTGC CCGCACCGGG GTGCCTCGCT CACGTTGGCC CGCAACGAGG ACTGCGCCCT GCGCTGCATC TACCACGGCT GGAAGTTCTC GGTCACCGGC CAAGTCCTGG AGACCCCCTC AGAGCCCGCG GAACTGACCA GGTTCGCCTC CCGGGTGAAG CTTCGGCACC ACCCCGTGAT CGAATCCGGC GGCGTGATCT GGGTGTGGAT CGGCGGCACC GGCCGGTCCG CTCCTGCGCC GCCGGCCTTT ACCTTCACCA AGCTGCCCCC AGAGCAGGTC TTCGGCCTGG TGGCCGTGGT CGAGTGCAAC TGGCTGCAAG GGCTGGAGGC GGACATCGAC TCGGCGCACG TCTCGCTGTT GCACGAGACC GAGGCGCGGG CCGGCGCCTT GCGCGACCTG CTCGACGACC GGACGCCCCG CGACGAGATC GACGAGCAGC CGTACGGCCT GCGCTTCGGC TCTGTCCGGA CGCTGTCCTC CGGCGCAGAG TTGGTCCGGG TCAAACCGTT CGCGATGCCC TGGTACACAG TGGTGCCCGA GCTACCCAGC GGCGACCGGC TCTGGCACGC CTGGGTGCCG ATCAACGACC ATCGCACCAT CATGTGGTAC CTCTGGTACA ACGAGGAGCG TCCGGTCGAT CCGGCCGTCT TCGCCGATCA GTTCGGCCTC AACCTCGACA CGATGAATCC GGACAACATC CGGGAGGGGT TCACCCGGGA GAACAACTGG GGCCAGGACC GCCGGCAGAT GCGGGAGAAC CAAAGCTTCT CAGGCATTCG AGGGCTGGTG CTACAAGACA TCGCCGTGCA GGAGAGCATG GGCCCCATCG TCGACCGGAC CGGGGAGAAC CCGGGCCGCA GCGACACGGC CATCGTCGCG ACCCGCCGCT ACCTGCTCGA CGCGATCAAG CGGCACGAGC GCGGCGAGAC CCCGCCCGGC CTGGGTCCGG AGGCCGACTA CGACCGGGTC CGCTCCGCGG AGATCACGCT GGCTCCGGGC GTCGACTGGC GTTCGGCATG A
|
Protein sequence | MTTSKMNDQL TRVGPGTPMG RLLREYWMPV MRSGRLTEPG GAPMAVELLG EKFVVFRADD GTLGCFAEAC PHRGASLTLA RNEDCALRCI YHGWKFSVTG QVLETPSEPA ELTRFASRVK LRHHPVIESG GVIWVWIGGT GRSAPAPPAF TFTKLPPEQV FGLVAVVECN WLQGLEADID SAHVSLLHET EARAGALRDL LDDRTPRDEI DEQPYGLRFG SVRTLSSGAE LVRVKPFAMP WYTVVPELPS GDRLWHAWVP INDHRTIMWY LWYNEERPVD PAVFADQFGL NLDTMNPDNI REGFTRENNW GQDRRQMREN QSFSGIRGLV LQDIAVQESM GPIVDRTGEN PGRSDTAIVA TRRYLLDAIK RHERGETPPG LGPEADYDRV RSAEITLAPG VDWRSA
|
| |