Gene Dshi_3822 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDshi_3822 
Symbol 
ID5714351 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDinoroseobacter shibae DFL 12 
KingdomBacteria 
Replicon accessionNC_009956 
Strand
Start bp28782 
End bp29852 
Gene Length1071 bp 
Protein Length356 aa 
Translation table11 
GC content67% 
IMG OID641276737 
Productphenylacetate-CoA oxygenase/reductase, PaaK subunit 
Protein accessionYP_001542033 
Protein GI159046362 
COG category[C] Energy production and conversion 
COG ID[COG1018] Flavodoxin reductases (ferredoxin-NADPH reductases) family 1 
TIGRFAM ID[TIGR02160] phenylacetate-CoA oxygenase/reductase, PaaK subunit 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.0510949 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones37 
Fosmid unclonability p-value0.275552 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCCGTT TTCATCCCCT GTCAGTCACG GATGTCCGCA AGACCATTCG CGACGCGGTC 
GTGGTGACCC TGAAACCCGT CGACGGGGGC GATTTCGGGT TCATCCAGGG GCAGTACCTG
ACCTTCCGGC GGTCCTTCGA CGGCACCGAG CTGCGCCGCT CCTACTCGAT CTGCGCCGGA
CGGGATGACG GCGTGCTTCA GGTCGGCATC AAGCGGGTCG AGGGCGGCGC GTTCTCGACC
TGGGCCAATG ACAGCCTCGC GCCCGGCATG ACCCTGGAGG CGATGGCGCC GATGGGCAGC
TTCCACACGC CCCTCGACCC GCACACGCCC CGCAACTATC TCGCCTTCGC CGGGGGGTCG
GGCATCACGC CGATCCTGTC GATCCTGAAG ACCGTGCTCG CCCGGGAGCC CGGCAGCCGC
TTGACGCTGG TCTATGCCAA CCGGGGTGTG AACACGATCA TGTTCCGCGA GGAGCTGGAG
GATCTCAAGA ACCTGCACAT GGGGCGGCTG ACCGTGATCC ATGTGCTGGA AAGCGACGCG
CAGGAGATCG ACCTGTTCAC CGGCCGGGTC GACGGGGCGA AATGCGACGC CCTGTTCGCC
CACTGGATCG ACATCGACAG CATCGACACC GCCTTCATCT GCGGGCCCGA ACCGATGATG
CTGGGCATCG CGGCGGCGCT GCGGGCGCAC GGGATGACCG ACGACCGGAT CAAGTTCGAA
CTTTTCGCCA GCGGCCAGCC CGGGCGCCTG CCGCGCAAAC CCGGCGCCGC CGCCGGTCAC
GACCCCGAGG CCCGGGCGAC CGCGGCCACC GTCACCATGG ACGGCGCGGC GCGCAGCTTT
GCCATGGACA AGGACCAGTC GATCCTCGAC GCGGCGCTCG CGAACGCACT GGATGCGCCC
TATGCCTGCA AGGCCGGGGT CTGCTCCACC TGCAAGTGCA AGGTGCTGGA GGGCGAGGTG
GAGATGATCG CCAACCACGC GCTGGAAGAT TACGAGGTGG CAAGGGGGTA CGTGCTGTCC
TGCCAGTCCT ACCCGGTGAC GGACCGGGTT GTGGTGACCT ACGACCACTA G
 
Protein sequence
MARFHPLSVT DVRKTIRDAV VVTLKPVDGG DFGFIQGQYL TFRRSFDGTE LRRSYSICAG 
RDDGVLQVGI KRVEGGAFST WANDSLAPGM TLEAMAPMGS FHTPLDPHTP RNYLAFAGGS
GITPILSILK TVLAREPGSR LTLVYANRGV NTIMFREELE DLKNLHMGRL TVIHVLESDA
QEIDLFTGRV DGAKCDALFA HWIDIDSIDT AFICGPEPMM LGIAAALRAH GMTDDRIKFE
LFASGQPGRL PRKPGAAAGH DPEARATAAT VTMDGAARSF AMDKDQSILD AALANALDAP
YACKAGVCST CKCKVLEGEV EMIANHALED YEVARGYVLS CQSYPVTDRV VVTYDH