Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Dshi_3016 |
Symbol | deoC |
ID | 5710868 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Dinoroseobacter shibae DFL 12 |
Kingdom | Bacteria |
Replicon accession | NC_009952 |
Strand | + |
Start bp | 3184761 |
End bp | 3185780 |
Gene Length | 1020 bp |
Protein Length | 339 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 641268943 |
Product | deoxyribose-phosphate aldolase |
Protein accession | YP_001534350 |
Protein GI | 159045556 |
COG category | [F] Nucleotide transport and metabolism |
COG ID | [COG0274] Deoxyribose-phosphate aldolase |
TIGRFAM ID | [TIGR00126] deoxyribose-phosphate aldolase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 0.0930267 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 32 |
Fosmid unclonability p-value | 0.304024 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGAGCAGCA CCCCCACGGA CACCACGACC ACGGGCCAAC AGATCGACCT GCACCGGGGG CACCTGCCCG ACACGGTTCT GCCCCGCAAT CCGGGGCTTC CGCTGGACCT CGACTGGGTG CGGTCGGCGG CAGTGAACAC CTCGGCCGTG GAACGCCGGG CCGCAAGCCT GCCCGGGCGG AGATCGGTCA AGAAGGACCA TCAGGCGGCC TGGCTGCTGA AGGCGGTCAC GTTGATCGAC CTGACCACGC TGGCGGGCGA CGACACGGCT GGCCGGGTGC GGCGGCTATG TGCCAAGGCG CGCCAGCCCG TCGCACCGGA GGTTCTGGCC GCGCTCGGCA TGGGGCCGGT CACCACCGGG GCGGTCTGCG TCTATCACGA CATGGTCCAC GTGGCGGTCG AGGCGCTGGA GGGCTCCGGC ATCCCCGTCG CCGCCGTCTC CACCGGGTTT CCCGCCGGCC TGTCGCCCTT CCACCTGCGC GTGGCCGAGA TCGAGGAAAG CGTTGCGGCG GGGGCTGCCG AGATCGACAT CGTGATCTCG CGCCGCCATG TCCTGACCGG CAATTGGCAA GCGCTCTATG ACGAGATGCG GGCGTTTCGC GCGGCCTGCG GCGACGCCCA TGTGAAGGCG ATCCTGGCCA CGGGCGAGTT GGGGGACCTG GGCAACGTCG CCCGCGCGAG CCTTGTCTGC ATGATGGCGG GCGCGGATTT CATCAAGACC TCCACCGGCA AGGAGAGCGT GAATGCCACC CTGCCCGTGA GCCTCGTGAT GATCCGGGCG ATCCGGGACT ATGAGGCCGC GACGGGGGTG AAGGTCGGCT ACAAGCCTGC GGGCGGGATC TCCAAGGCCA AGGATGCGCT GGTCTACCTC AGCCTGATCA AGGAAGAGCT GGGCGATCGG TGGCTGCAGC CGGATCTGTT CCGGTTCGGG GCCTCGTCTC TCTTGGGCGA TATCGAGCGG CAGTTGGAGC ACCACGTCAC CGGCGCCTAT TCCGCGACGC ACCGGCACGC GCTGGGCTGA
|
Protein sequence | MSSTPTDTTT TGQQIDLHRG HLPDTVLPRN PGLPLDLDWV RSAAVNTSAV ERRAASLPGR RSVKKDHQAA WLLKAVTLID LTTLAGDDTA GRVRRLCAKA RQPVAPEVLA ALGMGPVTTG AVCVYHDMVH VAVEALEGSG IPVAAVSTGF PAGLSPFHLR VAEIEESVAA GAAEIDIVIS RRHVLTGNWQ ALYDEMRAFR AACGDAHVKA ILATGELGDL GNVARASLVC MMAGADFIKT STGKESVNAT LPVSLVMIRA IRDYEAATGV KVGYKPAGGI SKAKDALVYL SLIKEELGDR WLQPDLFRFG ASSLLGDIER QLEHHVTGAY SATHRHALG
|
| |