Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Dshi_3858 |
Symbol | |
ID | 5714387 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Dinoroseobacter shibae DFL 12 |
Kingdom | Bacteria |
Replicon accession | NC_009956 |
Strand | + |
Start bp | 66007 |
End bp | 68247 |
Gene Length | 2241 bp |
Protein Length | 746 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 641276771 |
Product | protein of unknown function DUF940 membrane lipoprotein putative |
Protein accession | YP_001542067 |
Protein GI | 159046396 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.00089637 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | 46 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGCGCGG GTCGTTTCGC GGCGAAATAC CGCCTGGCGC GGGGCGGTCT GCTGGCGCTG TTGCTGTCGG TCTCCGCCGC GGCGCTGCAG GCCCAGAGCA CCGAGCAGCT GCTTGCGGTG CAGGGCCCGT CCGAGCGCGG GGCCAGCGCC GCCGCTGCCG GGCCCGAGGG CGCGTTCCGC CCCAGCCCCG AGACCATCTT TCGCCAGCCG AGCCTGAACT TCTACGGGGT GCCGGGGTTG ATCGACCTGC CCTCCGGCGA GGCCATGCCC GACGGCCAGC TCGCGGTGGG CGTGTCGACC TTCGGCGGCA CCACCCGGAC CACGCTCAGC TTCCAGCTCA CCCCGCGCAT CTCGGGCAGT TTCCGCTATT CGGCGATCCG CGACTGGGAC AGCGACGGGT TCGATACCTA TTACGATCGC AGCTTCGATC TGCGCTTTCT GGCCCTTGAA GAGAGCCGCT TTCTGCCCTC GGTGACCATC GGGTTGCAGG ATTTCGCGGG CACCGGGATT TATGCCGGGG AATACATCGC CGCCACCAAG ACTTTTGCCG GGGGCCTCAA GGGCACGGTG GGCCTCGGCT GGGGCCGGTT CGGCAGCGCG AGCTCCTTCG GCGGGCTGAT TTCCGACGAG CGGCCGGCCT TCGATCCCAA CGATACCGGC GGGGAGCCGA GCACCGACCA GTGGTTTCGC GGCCCGCAAT CGGTCTTTGC CGGGATCGAG TGGCAGCCCA CGGACCGGCT GGGTCTGAAG CTGGAATATT CCACCGATGC CTATGAGGCC GAGACCGTCG ACCGCGACGT GTTCGAGCGC GAGTCGGACT GGAATTTCGG GCTGGAATAC CAGGTCAGCG AGGACTGGCG GCTGGGGGGG TATTACCTCT ATGGGGCCGA GTTGGGCGTG ATGGCGCAAT TCCAGCTCAA CCCGCGGCGC CCGGCGGTGC CGATGCGGGT GGCGGCCCCC GACCCGGTGG ACCCGCGCCC GGACCGGGCG GCCAACCCGA CCCTCTGGTC GGCGGACTGG ATCACCATCC CCGGCGCGCA GGAGACCCTG CGCGACGCCC TGGAAGCGCC GCTCGCCGCG GAAGGGATCG AGTTGCAGGC GCTGGCGGCC ACGGGCACCA GCATCGACCT GCGCTATCGC AACGCGCGCT ACCTGTCCTC GGCCAACGCG GTGGGGCGGG TGGCGCGGGT GCTGGCCCGG GTGCTGCCGC CCTCGATCGA GACCTTCCAC CTGACCCCGG AGGTGTCGGG CATGCCGGCG AGCCGGATCA CCCTGCGCCG GACGGCGCTG GAGGAGCTGG AATTCACGCC CCAGGCCGGG GCGCGGCTGC TGGCGCAGAG CGAGATTTCC GAAGCGCCCC CGCTGCCGGA GACGGCGGTG GCCTCCGAGA CGGTCGCGCC GCGCTTCTCC TGGTCCCTGG GGCCCTACCT GGAGCAGAGC TTCTTCGATC CGGACGAGCC CTGGCGGTTC GAGATCGGGC TGGATGCCAG CGCGTCCTAT CAGATCACGC CGAACCTGAG CCTGTCGGGG TCGATCACCA AGGAAATCGT CGGCACCATC GCCGACAGCA CGCGCGTGTC CAACAGCCAG TTGCCCCCGG TCCGTTCGAA CGGGGTGCTC TATGCGCGGG AGGGCGATCC GGGGCTCGAC AACCTGGTGC TGGCCTATAC GTTCCGCCCC GGTCAGGATC TTTATGGGCG GGTCAGCGCG GGCTACCTGG AATCCATGTT CGGCGGGGTC TCGGCGGAGC TGCTGTGGAA ACCCGTGGAC AGCCGCCTGG CCCTGGGGGT GGAGCTGAAC TATGCCCGCC AGCGGGATTT CGACCAGCGC CTCGGCTTCC AGGATTATGA CGTGATCACC GGGCATGCCT CGGCCTATTA CGCGTTCGGG GATGGCTACC TGGGGCAGGT GGATGTGGGC CAGTACCTGG CGGGCGACAA GGGCGCGACC TTCACCCTGT CGCGGGAATT CGGCAATGGC TGGAAGCTGG GCGGGTTCTT CACGCTGACC GATGTCTCGG CCGAGGAATT CGGCGAGGGG TCGTTCGACA AGGGGATCAT GCTGACGATC CCGGCGGGCT GGATCCTGGG CCAGCCGAAC CGCACCGCCC TGTCGACCAC GATCCGGCCC CTGCAGCGCG ACGGCGGCCA GCGGCTGGAG GTGCCGGGGC GGCTTTACGA CCCGGTCCGC GCGCAACATG CCCGCGCCCT GACGCGGCAA TGGGAACGGG TATGGCAATG A
|
Protein sequence | MRAGRFAAKY RLARGGLLAL LLSVSAAALQ AQSTEQLLAV QGPSERGASA AAAGPEGAFR PSPETIFRQP SLNFYGVPGL IDLPSGEAMP DGQLAVGVST FGGTTRTTLS FQLTPRISGS FRYSAIRDWD SDGFDTYYDR SFDLRFLALE ESRFLPSVTI GLQDFAGTGI YAGEYIAATK TFAGGLKGTV GLGWGRFGSA SSFGGLISDE RPAFDPNDTG GEPSTDQWFR GPQSVFAGIE WQPTDRLGLK LEYSTDAYEA ETVDRDVFER ESDWNFGLEY QVSEDWRLGG YYLYGAELGV MAQFQLNPRR PAVPMRVAAP DPVDPRPDRA ANPTLWSADW ITIPGAQETL RDALEAPLAA EGIELQALAA TGTSIDLRYR NARYLSSANA VGRVARVLAR VLPPSIETFH LTPEVSGMPA SRITLRRTAL EELEFTPQAG ARLLAQSEIS EAPPLPETAV ASETVAPRFS WSLGPYLEQS FFDPDEPWRF EIGLDASASY QITPNLSLSG SITKEIVGTI ADSTRVSNSQ LPPVRSNGVL YAREGDPGLD NLVLAYTFRP GQDLYGRVSA GYLESMFGGV SAELLWKPVD SRLALGVELN YARQRDFDQR LGFQDYDVIT GHASAYYAFG DGYLGQVDVG QYLAGDKGAT FTLSREFGNG WKLGGFFTLT DVSAEEFGEG SFDKGIMLTI PAGWILGQPN RTALSTTIRP LQRDGGQRLE VPGRLYDPVR AQHARALTRQ WERVWQ
|
| |