Gene Sare_0574 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_0574 
Symbol 
ID5705560 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp653419 
End bp654639 
Gene Length1221 bp 
Protein Length406 aa 
Translation table11 
GC content69% 
IMG OID641270099 
Productphthalate 4,5-dioxygenase 
Protein accessionYP_001535493 
Protein GI159036240 
COG category[P] Inorganic ion transport and metabolism
[R] General function prediction only 
COG ID[COG4638] Phenylpropionate dioxygenase and related ring-hydroxylating dioxygenases, large terminal subunit 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.105836 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.217979 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACCACCA GCAAAATGAA TGATCAGCTC ACCCGGGTGG GGCCCGGGAC GCCGATGGGT 
CGGCTCCTAC GGGAGTACTG GATGCCCGTC ATGCGCTCCG GGCGGCTGAC CGAGCCGGGC
GGAGCGCCCA TGGCAGTCGA GCTGCTCGGC GAGAAGTTCG TCGTCTTCCG TGCCGACGAC
GGCACCCTCG GCTGCTTCGC CGAGGCGTGC CCGCACCGGG GTGCCTCGCT CACGTTGGCC
CGCAACGAGG ACTGCGCCCT GCGCTGCATC TACCACGGCT GGAAGTTCTC GGTCACCGGC
CAAGTCCTGG AGACCCCCTC AGAGCCCGCG GAACTGACCA GGTTCGCCTC CCGGGTGAAG
CTTCGGCACC ACCCCGTGAT CGAATCCGGC GGCGTGATCT GGGTGTGGAT CGGCGGCACC
GGCCGGTCCG CTCCTGCGCC GCCGGCCTTT ACCTTCACCA AGCTGCCCCC AGAGCAGGTC
TTCGGCCTGG TGGCCGTGGT CGAGTGCAAC TGGCTGCAAG GGCTGGAGGC GGACATCGAC
TCGGCGCACG TCTCGCTGTT GCACGAGACC GAGGCGCGGG CCGGCGCCTT GCGCGACCTG
CTCGACGACC GGACGCCCCG CGACGAGATC GACGAGCAGC CGTACGGCCT GCGCTTCGGC
TCTGTCCGGA CGCTGTCCTC CGGCGCAGAG TTGGTCCGGG TCAAACCGTT CGCGATGCCC
TGGTACACAG TGGTGCCCGA GCTACCCAGC GGCGACCGGC TCTGGCACGC CTGGGTGCCG
ATCAACGACC ATCGCACCAT CATGTGGTAC CTCTGGTACA ACGAGGAGCG TCCGGTCGAT
CCGGCCGTCT TCGCCGATCA GTTCGGCCTC AACCTCGACA CGATGAATCC GGACAACATC
CGGGAGGGGT TCACCCGGGA GAACAACTGG GGCCAGGACC GCCGGCAGAT GCGGGAGAAC
CAAAGCTTCT CAGGCATTCG AGGGCTGGTG CTACAAGACA TCGCCGTGCA GGAGAGCATG
GGCCCCATCG TCGACCGGAC CGGGGAGAAC CCGGGCCGCA GCGACACGGC CATCGTCGCG
ACCCGCCGCT ACCTGCTCGA CGCGATCAAG CGGCACGAGC GCGGCGAGAC CCCGCCCGGC
CTGGGTCCGG AGGCCGACTA CGACCGGGTC CGCTCCGCGG AGATCACGCT GGCTCCGGGC
GTCGACTGGC GTTCGGCATG A
 
Protein sequence
MTTSKMNDQL TRVGPGTPMG RLLREYWMPV MRSGRLTEPG GAPMAVELLG EKFVVFRADD 
GTLGCFAEAC PHRGASLTLA RNEDCALRCI YHGWKFSVTG QVLETPSEPA ELTRFASRVK
LRHHPVIESG GVIWVWIGGT GRSAPAPPAF TFTKLPPEQV FGLVAVVECN WLQGLEADID
SAHVSLLHET EARAGALRDL LDDRTPRDEI DEQPYGLRFG SVRTLSSGAE LVRVKPFAMP
WYTVVPELPS GDRLWHAWVP INDHRTIMWY LWYNEERPVD PAVFADQFGL NLDTMNPDNI
REGFTRENNW GQDRRQMREN QSFSGIRGLV LQDIAVQESM GPIVDRTGEN PGRSDTAIVA
TRRYLLDAIK RHERGETPPG LGPEADYDRV RSAEITLAPG VDWRSA