Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Dshi_0974 |
Symbol | |
ID | 5710487 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Dinoroseobacter shibae DFL 12 |
Kingdom | Bacteria |
Replicon accession | NC_009952 |
Strand | - |
Start bp | 995276 |
End bp | 996583 |
Gene Length | 1308 bp |
Protein Length | 435 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 641266882 |
Product | extracellular solute-binding protein family 1 |
Protein accession | YP_001532317 |
Protein GI | 159043523 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1653] ABC-type sugar transport system, periplasmic component |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 0.606433 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 29 |
Fosmid unclonability p-value | 0.163223 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCCTTGA AGAACGCACT TTACGCGGCC ACCGCCCTGA CCCTTGTAAG CTCGGGCGCT ATGGCAAGCG AAAACCTGGT AATCGCAACC GTCAACAATG GCGACATGAT CCGGATGCAG GGTCTGACCC AGGATTTTAC CGACAAGACC GGCCACACGG TCGAGTGGGT GACCCTCGAA GAGAACGTCC TGCGCCAGCG CGTCACGACG GATATTGCTG CCAAGGGCGG CTCCTTCGAC ATCATGACCA TCGGCATGTA CGAAACTCCG ATCTGGGGCG CCAATGGCTG GCTCGTGCCG CTGGACGATC TGTCCGCGGA CTACAACGCC GACGATATTC TGCCCGCGAT GCGCGCCGGT CTGAGCCATG ACGGCACGCT CTATGCTGCG CCGTTCTACG GCGAAAGCTC CATGATCATG TACCGCAAGG ACCTGATGGA GAAGGCCGGG CTGGAAATGC CCGATGCGCC GACCTGGCAG TTCATCCGCG AAGCGGCCGC CGCCATGACC GACCGCGAGA ACGACATCAA CGGCATCTGC CTGCGCGGCA AGGCCGGCTG GGGCGAAGGC GGCGCGTTCA TCACCGTCAC CGCGAACTCC TTTGGCGCGC GCTGGTTCGA CGAGGACTGG AACGCCCAAT TCGACCAACC CGAGTGGAAA GAGGCGCTGG AATTCTTCGT CGGCATGATG AACGAGTCCG GGCCGAACGG CTACGCCACC AACGGCTTCA ACGAGAACCT GAACCTGTTC CAGCAGGGCA AGTGCGGCAT GTGGATCGAC GCCACGGTGG CGGCGTCCTT CGTGACCAAC CCCAACGACT CGACCGTGGC CGACCAGGTG GGCTTCGCCC TCGCCCCGAA CAGCGAGGGC ATCGAGAAGC GCGCGAACTG GCTCTGGGCC TGGGCCCTGG CGATCCCCGC CGGCACGCAG AAGGCCGATG CCGCCAAGGA ATTCATCGAG TGGGCCACCT CGACCGATTA TATCGAGTTG GTGGCCGCGA ACGAAGGTTG GGCCAACGTG CCTCCGGGTG CGCGGACCTC GCTCTATGAG AACGAGAACT ACAAGGACAT TCCGTTCGCC AAGATGACCC TGGATTCGAT CCTGGCTGCC GATCCGACCG ACCCGACCGT GGACCCGGTG CCCTATGTTG GCATCCAGTT CGTCGCTATC CCCGAATTTG CGGGCATCGC CACCGAAGTC AGCCAGGAAT TCTCCGCCGT CTATGCCGGT CAGCAGACCG TCGAAGAGGC GCTTGAGAAA GCCCAGGCCC TGACCAACGA CGCCATGGAA GCCGCCGGCT ACCGCTAA
|
Protein sequence | MSLKNALYAA TALTLVSSGA MASENLVIAT VNNGDMIRMQ GLTQDFTDKT GHTVEWVTLE ENVLRQRVTT DIAAKGGSFD IMTIGMYETP IWGANGWLVP LDDLSADYNA DDILPAMRAG LSHDGTLYAA PFYGESSMIM YRKDLMEKAG LEMPDAPTWQ FIREAAAAMT DRENDINGIC LRGKAGWGEG GAFITVTANS FGARWFDEDW NAQFDQPEWK EALEFFVGMM NESGPNGYAT NGFNENLNLF QQGKCGMWID ATVAASFVTN PNDSTVADQV GFALAPNSEG IEKRANWLWA WALAIPAGTQ KADAAKEFIE WATSTDYIEL VAANEGWANV PPGARTSLYE NENYKDIPFA KMTLDSILAA DPTDPTVDPV PYVGIQFVAI PEFAGIATEV SQEFSAVYAG QQTVEEALEK AQALTNDAME AAGYR
|
| |