Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Dshi_3822 |
Symbol | |
ID | 5714351 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Dinoroseobacter shibae DFL 12 |
Kingdom | Bacteria |
Replicon accession | NC_009956 |
Strand | + |
Start bp | 28782 |
End bp | 29852 |
Gene Length | 1071 bp |
Protein Length | 356 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 641276737 |
Product | phenylacetate-CoA oxygenase/reductase, PaaK subunit |
Protein accession | YP_001542033 |
Protein GI | 159046362 |
COG category | [C] Energy production and conversion |
COG ID | [COG1018] Flavodoxin reductases (ferredoxin-NADPH reductases) family 1 |
TIGRFAM ID | [TIGR02160] phenylacetate-CoA oxygenase/reductase, PaaK subunit |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.0510949 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 37 |
Fosmid unclonability p-value | 0.275552 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCCCGTT TTCATCCCCT GTCAGTCACG GATGTCCGCA AGACCATTCG CGACGCGGTC GTGGTGACCC TGAAACCCGT CGACGGGGGC GATTTCGGGT TCATCCAGGG GCAGTACCTG ACCTTCCGGC GGTCCTTCGA CGGCACCGAG CTGCGCCGCT CCTACTCGAT CTGCGCCGGA CGGGATGACG GCGTGCTTCA GGTCGGCATC AAGCGGGTCG AGGGCGGCGC GTTCTCGACC TGGGCCAATG ACAGCCTCGC GCCCGGCATG ACCCTGGAGG CGATGGCGCC GATGGGCAGC TTCCACACGC CCCTCGACCC GCACACGCCC CGCAACTATC TCGCCTTCGC CGGGGGGTCG GGCATCACGC CGATCCTGTC GATCCTGAAG ACCGTGCTCG CCCGGGAGCC CGGCAGCCGC TTGACGCTGG TCTATGCCAA CCGGGGTGTG AACACGATCA TGTTCCGCGA GGAGCTGGAG GATCTCAAGA ACCTGCACAT GGGGCGGCTG ACCGTGATCC ATGTGCTGGA AAGCGACGCG CAGGAGATCG ACCTGTTCAC CGGCCGGGTC GACGGGGCGA AATGCGACGC CCTGTTCGCC CACTGGATCG ACATCGACAG CATCGACACC GCCTTCATCT GCGGGCCCGA ACCGATGATG CTGGGCATCG CGGCGGCGCT GCGGGCGCAC GGGATGACCG ACGACCGGAT CAAGTTCGAA CTTTTCGCCA GCGGCCAGCC CGGGCGCCTG CCGCGCAAAC CCGGCGCCGC CGCCGGTCAC GACCCCGAGG CCCGGGCGAC CGCGGCCACC GTCACCATGG ACGGCGCGGC GCGCAGCTTT GCCATGGACA AGGACCAGTC GATCCTCGAC GCGGCGCTCG CGAACGCACT GGATGCGCCC TATGCCTGCA AGGCCGGGGT CTGCTCCACC TGCAAGTGCA AGGTGCTGGA GGGCGAGGTG GAGATGATCG CCAACCACGC GCTGGAAGAT TACGAGGTGG CAAGGGGGTA CGTGCTGTCC TGCCAGTCCT ACCCGGTGAC GGACCGGGTT GTGGTGACCT ACGACCACTA G
|
Protein sequence | MARFHPLSVT DVRKTIRDAV VVTLKPVDGG DFGFIQGQYL TFRRSFDGTE LRRSYSICAG RDDGVLQVGI KRVEGGAFST WANDSLAPGM TLEAMAPMGS FHTPLDPHTP RNYLAFAGGS GITPILSILK TVLAREPGSR LTLVYANRGV NTIMFREELE DLKNLHMGRL TVIHVLESDA QEIDLFTGRV DGAKCDALFA HWIDIDSIDT AFICGPEPMM LGIAAALRAH GMTDDRIKFE LFASGQPGRL PRKPGAAAGH DPEARATAAT VTMDGAARSF AMDKDQSILD AALANALDAP YACKAGVCST CKCKVLEGEV EMIANHALED YEVARGYVLS CQSYPVTDRV VVTYDH
|
| |