Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Dshi_1962 |
Symbol | |
ID | 5712956 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Dinoroseobacter shibae DFL 12 |
Kingdom | Bacteria |
Replicon accession | NC_009952 |
Strand | - |
Start bp | 2057989 |
End bp | 2062884 |
Gene Length | 4896 bp |
Protein Length | 1631 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 641267886 |
Product | putative modular PKS system |
Protein accession | YP_001533303 |
Protein GI | 159044509 |
COG category | [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG3321] Polyketide synthase modules and related proteins |
TIGRFAM ID | [TIGR00556] phosphopantethiene--protein transferase domain |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.0314751 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 0.000910515 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGACAGCAG ACGGTGCCCA CACGCGCCAC GCTGACAACG AACCGATTGC AATCGTAGGC ATCGCCTCGA TCTATCCCGA GGCGCCGGAC ACGGCCACCT ATTGGCGCAA CATCACCGAC AAGGTGGATG CGGTGGCCGA TGCGCCGGCC GATTGGTGCG CGGATCTGTT CTTTGACCCC GAGAGCGCGG ATAACGACCG GATCTACACC AAGCGCGGTG GGTTCCTGCG CGATCTCGCG GTGTTCGACC CGCTGCGCCA CGGGGTCATG CCGACCTCCA TCGACGGGGG GGAGCCGGAC CAGTTCCTCG CGTTGCAGGT GGCCGCCGAC GCGGTGGAGG ATGCCGGTTT CGCCGCGCGC CCCGTCCCCG GCGAACGGGT GGCGGTGATC CTGGGCCGCG GCACCTATAT CAACCGCGGC TTCACCAGCG TGGTGCAGCA CGGGATCATG GTGGATCGCT TCCTGTCGAT CCTGAAAAAG CTCCATCCCG ACACCGACGC GGAAGAGCTG GCCGAGGTCA AGGCGCAGCT CAAGGCCTCC CTGCCGCCCT TCAATGCGGA AATGGCCCCG GCCCTGGTGC CGAACCTGGT GACGGGGCGG ATCTCCAACC GGCTGGACTT CAAGGGCGCC AATTACATCG TGGACGCGGC CTGCGCCTCG TCGCTCATCG CCATGGATCG CGGGGTTGCG GACCTGCGGG CGGGGCGGTG CGACATGGCG GTGGTGGGCG GTGTGCATGC CTCCACCCCC GCACCGATCT ACCAGATCTT CTGCCAGCTC GAAGCGCTGT CGCGGAAGGG GTCGATCAAA CCGTTCTCGG CCAATGCCGA TGGCACGCTT CTGGGCGAGG GCGTCGGGCT GATGTGCATC AAGCGGCTTT CCGATGCGGA GGCCGCGGGC GACCGGATCT ACGCGCTGGT GCGCGGGGTC GGCGTGGCCA GCGACGGCAA GGGGCTGGGC ATTCTCGCCC CCCGGATAGA GGGGGAGACA CTCGCGCTCC AGCGCGCCTA TGAGGATGCC GGGGTCGATC CGGCCACCGT GGGTCTGATC GAGGCCCATG GCACGGCGAC CGCCCTGGGC GACGCGACGG AGGTCGAGGC GCTCGTGGGC CTGATGGGGA CGCGCGACAG CGACGCGCCG TCCTGCGCGC TGGGCTCGGT GAAGTCGATG ATCGGGCATT GCCTGCCCGC CTCGGGCTCG GCCGGAATCA TCAAGACCGC GCTGGCGCTC TATCACAAGA CCCTGCCGCC CACCTTGCTG GACGAGGCGA ACCCTGAGTT GGGGATCGAA GGCTCGCCGC TCTACCTGAA CACCGAAACC CGACCCTGGA TCCACGGGCT GGACACGCCC CGGCGGGCGG GGGTGAACGC CTTCGGCTTC GGCGGGATCA ATGCCCATAC GGTGCTGGAA GAATATCGCG GCGGGGCGGA GCCTGCACCG AACCTCACCC GGTCGAGCGA GCTTCTGGTG GCGGGCGCGC CGGATGCGTC GGGACTGAGC GCACGGCTGG AGGCGCTGAA AACCGCGCTA TCTGGCGACA TGCCCCCGGC AGAGGTCGCG CGGGTCGAGA ATGCCGCCCC GTGCCCCGGC CCCTGGCGCG CGGCCATCGT GGCCCGGGAC GCGGGCGATG CCATGGCCAA GATCGACAAG ATCCTCGGCC GGATCGCCAG CGGCAAGGAC CGGCTCAGGG ATCGCAAGGG GGCGTATCTG AACGCGGCAC CGCTGGCCGC CACGGGCAAG GTCGCCTTCA TGTTCCCCGG GGAGGGGAGC CAGCATCTGG GCATGCTGCG CGAGATCATG CTGCATTACC CCCGGATGCG GCAGTGGTTC GACCTGGTGG ACGGGGCCTT CGCGGGCCAT GGCCGCAACC TGCTGCCGAG CCAGGTGATC TTCCCGCCCC CCGGCACCGC CGAAGAGGGC GGCGCGCTGT GGCGGATGGA TATCGGGCCA GAGGCGATTT TCGCCGCAAA TCAATCGGTT GCGGCGCTGT ATGAAAGCAT CGGCTTGCGC CCGGACATGC TGGTCGGCCA TTCCACGGGG GAATACTCGG CGCTCTTTGC CGCCGGGGTG ACGCGGCGCG AAGATCCGGC CGTTCTCCAG GCGGAGATGC GCGCGCTCAA CGCGGAATAC GAGGCGATGG CGGCCGAGGG GCTGATCGCC GAGGGCGCGC TGGTGGCGCT CGGCGCGGTG GATGGCGCGG CGCTGGAGGA CAAGCTGGCC GGGCGCGACG ATGTTTATCT GGCCATGGAC AACTGCCCCA ACCAGAAGGT CGTGGCGGCG TTTTCCGACG CGGGCCGCGA CTATGCCCTG AGCCTCGCGC CCGCGCTGGG CGGGTTCGAC GAGCTCTTGC CGTTCTCGCG CGCCTATCAC AGCCCGGCCT TCGCACCGTT CAGCGCGCGG CTCGAAGGGT TCCTCGCGGG GATGCAGGTC GCGGCCCCCG AACGCCCGGT CTATTCCTGC CTGAGCACCG CGCCCTTTCC AGACTCGCCC GACGCGATCC GCAAGCTGAC CGCCGACCAA TGGTCCGGGC GGGTGCGTTT CGTCGAGACG GTGGAGCGGA TGTATGCGGA CGGGGCGCGG GTCTTCGTCG AATGCGGGCC GCGCAACAAC CTGTCGGCCT TCGTGAACGA CATCCTGCGC GGGCGCGATC ATGTGGCGGT GGCGGTGGAC ACGCCGTCCT TGCAGGGGCT CGACCAGCTG CACCACATGG CCGCCCAGCT GGTGGTGGAG GGGGTGACGC TCGACCTCGG CGCGCTGGCG GCCCCGGTGC GGGCGGTGGC GAAGCCGGCC GGTCCGAGCC GTCCGCAGGT GCTCAAGATG GGGGTGCAGC CCATGGATAT CGCGTCGGTT CCCAAGCGGG CGCAGAGCGC GCCGTCCAAG GCACCGCCCG CGCCGAAACC CGCCCCCGCG GCACCGCCCC CCGCCGCGTC GCAAACCCGT CCCACCGCCA ACTCGCCCGC CGCCCCGACA CCGCCGAAGA TGCCGGTGAC GGTCGCCGCG GCACCCCGGC CGCAGCCCGC GCCCGCCGCG CCCCCGGCAG AGGCCGAAGC GGTGGTCGAC GCCTATTTCG ACACGCTGGA GGGCTTCCTC GCCTCCGAGA CCGAGATCAT GGCCGCCTAT CTCGGCACGG TGCCGGGACA GGCGGACGCG GCCCCCACGC CCATGCCCAC TCCACAGACC CAAGCGCCCC GGCGGTTCCC GATGCTCGGG GCGGTGACCC CGGCCCCCGA CGGTCGCAGC CTGACCGCCC GGATCGAGAT CGACCCGGCC CGCGCGCCCT ACCTGCGCGA TCACAGCTTC GGCCACGGCA TTTCCCAGCG CGACCCGGAG CTGACCGGCC TGCCGGTGGT GCCGCTGACC TTCTCGATGG AGGCGCTGGC CGAGGCCGCC GCGGCCCTGC GCCCGGACCT GGTGGTGACG GGGATGCGGG AGGTGCGCGC CTCGCGCTGG TTCGCCCTGG ATCGCCCGCC CCTGGTGGTG GAGGCTGCGG CGGAACTGCG CGAAGAGGGG CCCGTGACCG TGGTGCGCGT GCGGATCCGC GCCGCCGATG GCCCGGCGCT GCGCCCGACC CTGATCGAGG CGGATGTGAC CCTGGCCCAA GCGCGCCCGG AGGCGCCCCT GGCCCAGCCC CAGAGCTATG GCGCCCTGCG CGACGCGCCC TGGTCGCTTG CGGATGTCTA CGGCAAGATC ATGTTCCACG GTCCCAAGCT GCAGGCGGTG GAGGCCATGG ATGTGGTCGG CGCAGGCGGG GTGGAAGGCA CCCTGATCGG GCTGCCCCAT GACGCGCTCT ATGCCGGGGT CGCCGACCCG GGCTTCGAGA CCGACGCGAT CACCCTTGAC GCGGTGGGCC AGCTGGTCGG CGTCTGGTCG GCGGAAATGC TGCCCGAGGC GTTCCACATC TTCCCCTTCC GGGTCGAGGC GCTGGAGATC TTCCGCGACC GCCTCGCCCC CGGCGAGCGC GCGCGCTGCC GCGCGACCAT CGATCTGGTG GGCGCGGACG AGATGCGCGC CGATATCGAC GTGGTCTCGG CCGACGGGCG GCTGCAATGC CGGATCGCGG GCTGGTGGGA CAAGCGGTTC AACCTGCCCG AGCGGTTCTT CCGGGCCCGG CTCGACCCGC CGCGCAATCC GCTCTCGGCG CCCATCGTGC TGCCGGGGGG CGCGCCGTTG CCCGAGGGCT GCGCGCTGCA GCTCTGTGAC GAGATCGGTG ACGATCTGCT GGACGGCTCC GGTGGGATCT GGGCCAAGGT GCTGGCGGGG CTTGTCCTGT CCCGGGCCGA GCGGGCGGAA TTCGCCGCCA TGGACGGGGC CACGGACAAG CGGCGCTGGG AATGGCTGCG CGGGCGCGCG GCGGCCAAGG ATGCGGCCCG CGCGGTGCTG GGCCGCGCGC TCGCGCCCGC GGACATTCCC CTGGTCGTGC CGGGGGAGGA TGCAGGCCCG GTGATCGACC CGACCCTGCC GCCCGCCTGG GGGCCTGCGC CGCTTATCTC GATCGCCCAC AAGGGTGCGC GCGCCCTGGG CCTGGCGGCC GATCCGCGCC GCTATGCCGG GGTGGGCGTG GATCTCGAAG AGATCGCGCC GCGCACGCCC GCGTTCCTGC AGACCGCCTT CACCCCCGCC GAGCGCGCCT CGATCATGGC CCTGCCCGAA GGACAGCCGC GCGATACGGC AGCGACCCGC TATTGGTGCG CAAAGGAAGC AGTCGGCAAA AGCTTCGGGC TCGGCCTCGC CCAGGCCCTT GATCGTTTCG AGGTTCAGGG TGATGGAACG GACGCCCCGC CCGTGATTGT TCGGGATCTG TCAACACACA CCGATATTGC GGTGACCGCG ATGGCCGAGC TTGCGCCGGG CCTGATCGCC GCCGTCGTGC TCAGACCGCA GGACGCAGGA AATTGA
|
Protein sequence | MTADGAHTRH ADNEPIAIVG IASIYPEAPD TATYWRNITD KVDAVADAPA DWCADLFFDP ESADNDRIYT KRGGFLRDLA VFDPLRHGVM PTSIDGGEPD QFLALQVAAD AVEDAGFAAR PVPGERVAVI LGRGTYINRG FTSVVQHGIM VDRFLSILKK LHPDTDAEEL AEVKAQLKAS LPPFNAEMAP ALVPNLVTGR ISNRLDFKGA NYIVDAACAS SLIAMDRGVA DLRAGRCDMA VVGGVHASTP APIYQIFCQL EALSRKGSIK PFSANADGTL LGEGVGLMCI KRLSDAEAAG DRIYALVRGV GVASDGKGLG ILAPRIEGET LALQRAYEDA GVDPATVGLI EAHGTATALG DATEVEALVG LMGTRDSDAP SCALGSVKSM IGHCLPASGS AGIIKTALAL YHKTLPPTLL DEANPELGIE GSPLYLNTET RPWIHGLDTP RRAGVNAFGF GGINAHTVLE EYRGGAEPAP NLTRSSELLV AGAPDASGLS ARLEALKTAL SGDMPPAEVA RVENAAPCPG PWRAAIVARD AGDAMAKIDK ILGRIASGKD RLRDRKGAYL NAAPLAATGK VAFMFPGEGS QHLGMLREIM LHYPRMRQWF DLVDGAFAGH GRNLLPSQVI FPPPGTAEEG GALWRMDIGP EAIFAANQSV AALYESIGLR PDMLVGHSTG EYSALFAAGV TRREDPAVLQ AEMRALNAEY EAMAAEGLIA EGALVALGAV DGAALEDKLA GRDDVYLAMD NCPNQKVVAA FSDAGRDYAL SLAPALGGFD ELLPFSRAYH SPAFAPFSAR LEGFLAGMQV AAPERPVYSC LSTAPFPDSP DAIRKLTADQ WSGRVRFVET VERMYADGAR VFVECGPRNN LSAFVNDILR GRDHVAVAVD TPSLQGLDQL HHMAAQLVVE GVTLDLGALA APVRAVAKPA GPSRPQVLKM GVQPMDIASV PKRAQSAPSK APPAPKPAPA APPPAASQTR PTANSPAAPT PPKMPVTVAA APRPQPAPAA PPAEAEAVVD AYFDTLEGFL ASETEIMAAY LGTVPGQADA APTPMPTPQT QAPRRFPMLG AVTPAPDGRS LTARIEIDPA RAPYLRDHSF GHGISQRDPE LTGLPVVPLT FSMEALAEAA AALRPDLVVT GMREVRASRW FALDRPPLVV EAAAELREEG PVTVVRVRIR AADGPALRPT LIEADVTLAQ ARPEAPLAQP QSYGALRDAP WSLADVYGKI MFHGPKLQAV EAMDVVGAGG VEGTLIGLPH DALYAGVADP GFETDAITLD AVGQLVGVWS AEMLPEAFHI FPFRVEALEI FRDRLAPGER ARCRATIDLV GADEMRADID VVSADGRLQC RIAGWWDKRF NLPERFFRAR LDPPRNPLSA PIVLPGGAPL PEGCALQLCD EIGDDLLDGS GGIWAKVLAG LVLSRAERAE FAAMDGATDK RRWEWLRGRA AAKDAARAVL GRALAPADIP LVVPGEDAGP VIDPTLPPAW GPAPLISIAH KGARALGLAA DPRRYAGVGV DLEEIAPRTP AFLQTAFTPA ERASIMALPE GQPRDTAATR YWCAKEAVGK SFGLGLAQAL DRFEVQGDGT DAPPVIVRDL STHTDIAVTA MAELAPGLIA AVVLRPQDAG N
|
| |