Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Dshi_0623 |
Symbol | yedY |
ID | 5712077 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Dinoroseobacter shibae DFL 12 |
Kingdom | Bacteria |
Replicon accession | NC_009952 |
Strand | + |
Start bp | 615171 |
End bp | 616088 |
Gene Length | 918 bp |
Protein Length | 305 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 641266526 |
Product | putative sulfite oxidase subunit YedY |
Protein accession | YP_001531970 |
Protein GI | 159043176 |
COG category | [R] General function prediction only |
COG ID | [COG2041] Sulfite oxidase and related enzymes |
TIGRFAM ID | [TIGR01409] Tat (twin-arginine translocation) pathway signal sequence |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 28 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 35 |
Fosmid unclonability p-value | 0.689313 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGACGCA CAACCGATCT GAACACGGGC CTGACACATG CGGACATCAC GCCCAGATCC GTTTTCCTCA ATCGCCGGAG CTTCATGGCA AGCGCCGCGG CTGCGGGGAT CTTCGCCGGC AGCCAGGCCA CCGCGCGCGA TTTCGCGGGC GGCGAGACGC CCAACACGCT GGAAGAGATC ACCAGCTACA ACAATTTCTA CGAGTTCGGC TTCGACAAGG GCGATCCGGC GAAATACGCC CATGCGCTGA CCACCGATCC CTGGTCGATC ACGGTGGACG GTTTGGTCGA CAACCCCGGA ACCTATGACA TCGCCGACTT GGTGGATGCG GCAGAGATCG AAGAACGCAT CTACCGCTTC CGCTGTGTCG AGGCCTGGTC GATGGTCATT CCCTGGAACG GGATCGAGTT GAACAAGGTG CTGGCCAAAC TCGGTGTGCA GCCCGGCGCG AAATACGTGG CCTTCGAGAC GCTCGTGCGC CCGTCCGAGA TGCCGGCGCA GCGGGGCGGG GGCAATATCG AATGGCCGTA TCGCGAAGGT CTGCGCCTGG ACGAGGCGAT GCATCCGCTG ACCCTGTTGG CGACGGGCAT TTACGGCGCG CCCATGCCGA AACAGAACGG GGCCCCGATC CGGCTGGTGG TGCCGTGGAA ATACGGGTTC AAGTCGATCA AGTCGATCGT GCGGATCTCG CTTGTGGACC AGGAGCCGCC GACCACCTGG AACATGCTGC AGCCGCGCGA GTACGGTTTC TATTCCAACG TGAATCCCGA GGTCGACCAC CCCCGCTGGT CCCAGGCCAC GGAGCGGCGC ATCGGCGACG GGCTATTCGC ACGCCGCCGC GAGACCTTGA TGTTCAATGG CTATGCCGAC CAGGTCGCAA GCCTTTATGA CGGCATGGAC CTGACCGAGT TCTACTGA
|
Protein sequence | MRRTTDLNTG LTHADITPRS VFLNRRSFMA SAAAAGIFAG SQATARDFAG GETPNTLEEI TSYNNFYEFG FDKGDPAKYA HALTTDPWSI TVDGLVDNPG TYDIADLVDA AEIEERIYRF RCVEAWSMVI PWNGIELNKV LAKLGVQPGA KYVAFETLVR PSEMPAQRGG GNIEWPYREG LRLDEAMHPL TLLATGIYGA PMPKQNGAPI RLVVPWKYGF KSIKSIVRIS LVDQEPPTTW NMLQPREYGF YSNVNPEVDH PRWSQATERR IGDGLFARRR ETLMFNGYAD QVASLYDGMD LTEFY
|
| |