Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Dshi_1960 |
Symbol | |
ID | 5712954 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Dinoroseobacter shibae DFL 12 |
Kingdom | Bacteria |
Replicon accession | NC_009952 |
Strand | - |
Start bp | 2056060 |
End bp | 2057664 |
Gene Length | 1605 bp |
Protein Length | 534 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 641267884 |
Product | phosphodiesterase/nucleotide pyrophosphatase |
Protein accession | YP_001533301 |
Protein GI | 159044507 |
COG category | [S] Function unknown |
COG ID | [COG3379] Uncharacterized conserved protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 0.151896 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 0.00177875 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGTCACAGA CGAAAATTCA GACGGTTATC ATTGGCCTTG ATGGGGCGAC CTACGACATG CTCGACCACC TGGTCGCGGA GGGGGTCATG CCCAATTTGG GGCGTATCAT GGCCGAAGGC GCGCGCGGGA TCCTGGCCTC GACGATCCAC CCGCTGACCC CGCCGGCCTG GGCCACGCTG ATGACCGGGC GCAGCCCCGG CAATCACGGT GTGTTCGACT TCATCCGGGT CGACCGCGAA GGGTCCAAGC CCAGCTACAC GCTGGCGACC TCGGCGGATG TGAAGGTGCC GACCATCTGG CAGATCGCGA GCGCCGCCGG CAAGCGGGCG ACGACCCTGA ACTACCCGGT GATGTTCCCG GCCAAGCCGA TCGACGGGGT GGTGATCCCG GGCTATGTGC CCTGGTCCTA TCTCGGCCGC GCCATCCACC CGCGCGAGAC CTTCAAGATG CTGAAGGCCA AGGGGGTCTT CAAGGCCTCC GAGATGTCCA CCGACTGGCA GCATGAGCGC AAGGCCGTGC AGGGCCTGTC GGAGAACCAG CTGACCGATT GGGTGCAGTT CCACATCACG CGGGAACAGC GCTGGCAGGA CATCCTGCTG ACCCTGATGG AGGAAGAGCC GTCAGAGCTG ACAGCCGTTC TGTTCGACGG CGTCGACCGG ATCCAGCACC TCTGCTGGCA CCTGATCGAC CCGGTCAGCC GGGACGACTA CACCACGCCG GAGTCGGTGG CGGCACGGGA GCTGGTGCTG CAATATTTCC GCAACGTGGA CGACTACCTG GTGCAGATCA TCGACAAGGC CGGGCCCCAG GCGCAGGTCT TCATCGTGTC CGATCACGGC TTTACCCGCT CGGGCACGCG GATCTTCTAT GCCAATACCT TCCTGGAACA TGCGGGGCTC CTGACCTGGA ACGCGGGCGT GGCAATGGAC GACCAGGGCC GCGTGGCGCT CGACGAGAAT ACCGAGGCCA GCACCCTGAT CGACTGGGCC GAGACCAAGG CTTATTCGCT CAGTTCCTCC TCCAACGCGA TCTTCATCCG CCGCGCGGCC AAGCCGGGCG ATCCCGGCGT CACCGATGCG GCGTACGAGG CGTTCCGCGA CGACCTGATC GCGCAGCTTC TGGCCTTCAC CGACCCCGAG ACCGGCAAGC CGGTGATCAA GTCGGTGTTC AAGCGCGAAG ATGCCTTCCC CGGCACCCAG ACCGAGCGCG CCCCGGACCT GACCCTGCAG CTCCATGATT ACAGCTTCCT GTCGGTGCTG CGCGCGGACC AGCCGATCAA GGACCGGCGC GTGCCCTATG CCACCCACCA CCCGGACGGC ATCTTCGTGG CCACCGGGCC GGGGATCGCG GCGGGCACGG CGCTCGACCG GCTGCAGATC GCCGACGTGG CGCCCACGGC GCTCTATTCC TGCGGGGTCG AGGTGCCCTC GGAGATGGAG GGCAAGGTGG CCGAGCAGGC CTTCGCCGAG GCCTACAAGG CTGACAACCC GATCCGTTAT ACCGCGGGCG AGGGGGCGGC GGCGGGCGAT ACCGACGACG CGGCCCTGAC CGGCGACGCC GAAGAGCAGA TCCGCGAACG CCTGAAATCC CTCGGGTATC TCTGA
|
Protein sequence | MSQTKIQTVI IGLDGATYDM LDHLVAEGVM PNLGRIMAEG ARGILASTIH PLTPPAWATL MTGRSPGNHG VFDFIRVDRE GSKPSYTLAT SADVKVPTIW QIASAAGKRA TTLNYPVMFP AKPIDGVVIP GYVPWSYLGR AIHPRETFKM LKAKGVFKAS EMSTDWQHER KAVQGLSENQ LTDWVQFHIT REQRWQDILL TLMEEEPSEL TAVLFDGVDR IQHLCWHLID PVSRDDYTTP ESVAARELVL QYFRNVDDYL VQIIDKAGPQ AQVFIVSDHG FTRSGTRIFY ANTFLEHAGL LTWNAGVAMD DQGRVALDEN TEASTLIDWA ETKAYSLSSS SNAIFIRRAA KPGDPGVTDA AYEAFRDDLI AQLLAFTDPE TGKPVIKSVF KREDAFPGTQ TERAPDLTLQ LHDYSFLSVL RADQPIKDRR VPYATHHPDG IFVATGPGIA AGTALDRLQI ADVAPTALYS CGVEVPSEME GKVAEQAFAE AYKADNPIRY TAGEGAAAGD TDDAALTGDA EEQIRERLKS LGYL
|
| |