Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Dshi_1413 |
Symbol | |
ID | 5712589 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Dinoroseobacter shibae DFL 12 |
Kingdom | Bacteria |
Replicon accession | NC_009952 |
Strand | + |
Start bp | 1466558 |
End bp | 1467793 |
Gene Length | 1236 bp |
Protein Length | 411 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 641267325 |
Product | extracellular solute-binding protein family 1 |
Protein accession | YP_001532756 |
Protein GI | 159043962 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1653] ABC-type sugar transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 0.127019 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 35 |
Fosmid unclonability p-value | 0.973051 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAATTTCA TCAAAACCGG ATTTTTGCCC GCAGCGATTC TGGCCGTGGG GATCGGCGGG GCGTCGGCCT GCGAGCTGGC CGAAGGCTCC GTGCGCATCC TGTCCAACGA TTTCCCCGCC CTGCAGGCCG TTACCGGCGC GGCGGCGGCC TGTGCCACGG GCGGGGCGAC GGTGACGGCG AACCTGACGA CCGAGCACCG CAACATCCAG GTGGCGGCCC TGACCGCGGA TCCGGCGGAA TATACCTCGG CCGTGGTGTC GAATTCGAGC CTCGTGCCGC TGCTGACCGA CGGGCTGGTG CGGCCGCTGG ACGATCTGAT CGCGGCCCAT GGGGCGGGGG TGTCGCCGAA CCAGAAGATC ATGATCGACG GGCGCACCAT GGCGATCGCC TTCATGGCCA ATGCGCAGCA CCTCTATTAT CGGCGCGACA TCCTGGAGGC GGCGGGGCTG GAGGTGCCGA CGACCTATGA GGAGGTGCTC GCCGCGGCAG AGGCGATCCG CGACCAGGGG CTGATGGAAT TTCCCCTCGG CGGGACGTTC AAGGCGGGCT GGAACCTGGC CCAGGAATTC GTGAACATGT ATCTCGGCCA TGGTGGGGCG TTTTTCGAGC CCGGCTCGGC GGCCCCGGCG ATCAACAATG CCCAAGGGGT TGCGGCGCTG GAGATGATGA AGGCGCTGAC CGCGTACATG AACCCGGATT TTCTCACCTA CGATTCCAAT GCCTTGCAGG CCGAATTTGA AGCAGGAAAC GTGGCGCTGG CCAATTTCTG GGGCTCGCGG GCCGGCGGCG TGACCGATGC GGAGGGCGCG ACCCCCGAGA TCGCGGCGGC CATCGGGTTC GCGGCGGCCC CCACGGTGGC GGGGGGCACG ACGCCGGCGT CGACCCTGTG GTGGGACGGG TTCACCATCG CCACCAACAT CCCGGACGTG GATGCGGAGG CGAGCTTCGT CGCCATGGTC CACGGCGCCT CGACCGAGGT GGCCAATGCC AATCCCAACG CGGCGGTCTG GCTGATCGAC GGCTACACCC CGGGCGACGC GGCGGTGGGC GTTCTGGCTA CGGCGCAGAT GGGCACGTCG CCCTATCCGA TGCTGCCCTA TATGAGCCTG ATGCACACCG CGCTCGGCAC CGAGATCGTC GAGTTCCTGC AAGGCAGCGA GACGGCGGAA CAGGCCCTGT CGGATGTGGA GGCGTCCTAC AGGGCGGCCG CGCGCGAAGC GGGCTTTCTG AACTGA
|
Protein sequence | MNFIKTGFLP AAILAVGIGG ASACELAEGS VRILSNDFPA LQAVTGAAAA CATGGATVTA NLTTEHRNIQ VAALTADPAE YTSAVVSNSS LVPLLTDGLV RPLDDLIAAH GAGVSPNQKI MIDGRTMAIA FMANAQHLYY RRDILEAAGL EVPTTYEEVL AAAEAIRDQG LMEFPLGGTF KAGWNLAQEF VNMYLGHGGA FFEPGSAAPA INNAQGVAAL EMMKALTAYM NPDFLTYDSN ALQAEFEAGN VALANFWGSR AGGVTDAEGA TPEIAAAIGF AAAPTVAGGT TPASTLWWDG FTIATNIPDV DAEASFVAMV HGASTEVANA NPNAAVWLID GYTPGDAAVG VLATAQMGTS PYPMLPYMSL MHTALGTEIV EFLQGSETAE QALSDVEASY RAAAREAGFL N
|
| |