Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Dshi_3654 |
Symbol | |
ID | 5714184 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Dinoroseobacter shibae DFL 12 |
Kingdom | Bacteria |
Replicon accession | NC_009955 |
Strand | - |
Start bp | 53399 |
End bp | 55099 |
Gene Length | 1701 bp |
Protein Length | 566 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 641276572 |
Product | hypothetical protein |
Protein accession | YP_001541868 |
Protein GI | 159046196 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.773483 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 4 |
Fosmid unclonability p-value | 0.0000658966 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | GTGTCAGATT ACAGTCAACA AGGTGATGAC GCCGAATATA CACTAAGTGC TGAATTTTGG GATAAGGGGC AAGCCCTTGG CTTTCCGATC CAGTTTGATC TCGTGCTCAA GCACCTACGA CAGGACATGA GGGACGACTG GTACTTCGAC TGCCTGCAAT ATGACGATAT ATTTAAGAAT CCGGACGAGG CCAAACGCAT CGTCCTTTCT CTACTGCAGG AATGGAACGG TGTGTACCTC GGCACCCGCA GCGTTGTCCG TAACATTCCC AAGAAAGGCT ATGGGGAACG CTACGGTCTG GAAACCGATT TCTTTGATAG GTTTGTCTAT CAGGCGATCT GCACCTTCCT TATTCCTTAC TATGACAGTC TGCTCAGCCA CCGAGTGCTA AGCTACCGCC ACGACTCTGC GCCGCAGAAC TCCAAATACT TGTTCAAGAA CAAGATCGAT CGATGGTTCA CATTTGAGGG GATAACTCTT ACGTTCGCCC GATCGAACCA GCATCTTCTG GTGACCGATC TCAGCAATTT TTTCGAGAAC ATCTCGCGAG AGCAGATCAT CGCAGCCTTG GAGAAGGCTA TACCAGAGAT CGTGGCGACG GGCCCTGAAA AGCTCCAAAT TCGCAATGCT ATCCGTACGT TGGACAGGCT GCTCGAACAA TGGACGTTCA GCCGCGACCA TGGTCTACCA CAGAACCGGG ACGCATCTTC GTTTCTTTCC AACATCCTGC TCTCTTCCGT TGATCGCGAA ATGGCCAAGA AGGGTTACGA TTACTACAGA TACGTGGACG ACATCCGCGT TTTGGCCGAC ACGGAGATCC ACGCGCGCCG CGCACTCCAG GATATTATCA GGGAGCTACG GAAGGTCGGC CTGAACATCA ACGCGAGTAA GACCGAGATA TTGCCGCCTA ACGCTTCACT AGAGAAGTTG GTCGCCCACT TCCCTTCGCA AGACAGCGCG ACCACCGCCA TTAATCAGAT GTGGCAATCT CGGAGCCGCA GAATTGTGAC CCGTTCTGTT GAATACATCT TTGGTATCCT GACCAGCTGC ATTGAAGCTG GTGACACTCA AACGCGACAG TTCCGCTTCG CGGTAAACCG CGTGGCCCAG ATTGTGGAAT CTGGCCTCTT CGACGTCGGC GACGCACTAT CAGCTTCACT GCTCGATACG CTTTCCCGCT CGCTTTCGGA GCATGCTGTA TCCACAGATC AATACTGCCG GCTTATTTCA ACACTAGACC GAGATGGCCA ATGCCTTCCG GCGCTGGAGG GGTTTCTGTT GGCGGAGGAC CGAGCCATCC ACGATTGGCA GAACTACAAC ATTTGGATGC TGCTTGGGTC GAAGAGACAC CGATCGGATA GGTTGGTTGA TCTCGCCGCT AGGAAGCTCC ACGAGGACAT CAGGTCCGGC GAGGCTGCTG CCATCTTAAT CTGGCTGCGG TGTGTCGGTG AGACGGCACT TATCCGCGGA TGCATCGAAA AGTTTAGCGA ACTGCCTTAT CAGAACGCCC GCTATCTCTT GATTGCCTCC TCGGTGCTTC ACAAAGATGA CCTCAAGCCG CTCTATGGCC TTGTGCCTAT CTGCCTCAAG GGCACAGGGC CAAGAGCCGA GCGTTACACC AGCGAGGAAG GTCTTCCCTT TGCAAAACGG GAAGCGCCCG ATCTGCTAAA CCTTGTCGAC GAGGTCAGTG AGTATGATTG A
|
Protein sequence | MSDYSQQGDD AEYTLSAEFW DKGQALGFPI QFDLVLKHLR QDMRDDWYFD CLQYDDIFKN PDEAKRIVLS LLQEWNGVYL GTRSVVRNIP KKGYGERYGL ETDFFDRFVY QAICTFLIPY YDSLLSHRVL SYRHDSAPQN SKYLFKNKID RWFTFEGITL TFARSNQHLL VTDLSNFFEN ISREQIIAAL EKAIPEIVAT GPEKLQIRNA IRTLDRLLEQ WTFSRDHGLP QNRDASSFLS NILLSSVDRE MAKKGYDYYR YVDDIRVLAD TEIHARRALQ DIIRELRKVG LNINASKTEI LPPNASLEKL VAHFPSQDSA TTAINQMWQS RSRRIVTRSV EYIFGILTSC IEAGDTQTRQ FRFAVNRVAQ IVESGLFDVG DALSASLLDT LSRSLSEHAV STDQYCRLIS TLDRDGQCLP ALEGFLLAED RAIHDWQNYN IWMLLGSKRH RSDRLVDLAA RKLHEDIRSG EAAAILIWLR CVGETALIRG CIEKFSELPY QNARYLLIAS SVLHKDDLKP LYGLVPICLK GTGPRAERYT SEEGLPFAKR EAPDLLNLVD EVSEYD
|
| |