Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Dshi_1353 |
Symbol | |
ID | 5711904 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Dinoroseobacter shibae DFL 12 |
Kingdom | Bacteria |
Replicon accession | NC_009952 |
Strand | + |
Start bp | 1404655 |
End bp | 1405959 |
Gene Length | 1305 bp |
Protein Length | 434 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 641267265 |
Product | extracellular solute-binding protein family 1 |
Protein accession | YP_001532696 |
Protein GI | 159043902 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1653] ABC-type sugar transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 30 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 34 |
Fosmid unclonability p-value | 0.621342 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCCCTGA CCGCAGGGGC CGTCAGCGCC GACACGATCC GCTTCTGGAC CACCGAGGAG CAGCCCGAGC GCCTGGCCAA GCAGCAGGAA ATGGCCGCGC AATTCGAGGC GGAGACCGGC ACCGCCGTGG AGGTGATCCC GGTCACCGAG AGCGACCTGG GCACCCGGGC CACGGCGGCC TTCGCGGCGG GCGATCTGCC GGACGTGATC TATCACACCC TGCAATACGC GCTGCCCTGG GCGGAGGCTG GCATTCTGGA CACCGATGCC GCCACCGAGG TGGTCGAGGA TCTAGGCGAG GATACCTTTG CGCCCGGGGC CTTGCAGATG GCCTCCACCG GCGATGGCGT GGCCTCGGTG CCGGTGGATG GCTGGACCCA GATGATCGTC TATCGCAAGG ACAAGTTCGA GGAGATGGGG CTGGAGCCGC CGACCTCCTT TGCCAATGTG ACTGCCGCGC TGGAGGCGCT GCACAATCCG CCGGAGATGT ACGGCTTCGT GGCCGCCACC AAGGTGGACG AGAACTTCAT GTCCCAGGTT CTGGAGCATG TGTTCCTGGC CAACGGCGTC AGCCCGGTGG ACGACGATGG CTTCGCGCCG CTCGACGAGG CCGCCACCAC CGAAGTGCTG GAGTTCTACA GGGCGATCGC CGAGGCCTCG CCCCCGGGCG AGCTTTACTG GAAGCAGTCG CGCGAGCTCT ATTTCGCGGG ACAGGCCGCG ATGATCATCT GGTCGCCCTT CATTCTCGAC GAGTTGGCCG GTCTGCGCGA CAGCGCGCCG CCCACCATCA ACGACGACCC GACCAGCGCG GAATTGGCCA GCCTGACCGG CATCGTGACC AACTTCTCCG GCCCGTCGAA CCCCGAAGGT GCTGCCTGGG GCGATATCCG GTATTTCGGC ATCACCACGG ACGCGGACAC AGACGCGGCG ATGGAGTTCG TGAAGTTCTC GATGGACGAG GGCTATACCC AAACCCTCAG CATCGCGCCG GAGGGCAAGT TCCCGGTCCG CAAGGGCACG GCCGAGGATC CGCAGAAGTT CACCGAGGCC TGGTCGCAGC TGCCCGTGGG CGTGGATCGC AAGGCGCCGT TGGGCGATCT TTATGACGCG GCGATGATCG ACGAGATCGT CGGCGGGCTC GATGTGGCGC AGCGCTGGGG CGTGGCGGAG GGGCAGTTGT CGCTGGCCTC CAAGATGATC AACAGCCAGG CGATCAACCG TATCGTGCGT CAGTATATCG ATGGCGAGGT CGATGCCGCT GCCGCCGTGG CCGCGATGAA CGACGCGCTG TCGCAGATCG ACTGA
|
Protein sequence | MALTAGAVSA DTIRFWTTEE QPERLAKQQE MAAQFEAETG TAVEVIPVTE SDLGTRATAA FAAGDLPDVI YHTLQYALPW AEAGILDTDA ATEVVEDLGE DTFAPGALQM ASTGDGVASV PVDGWTQMIV YRKDKFEEMG LEPPTSFANV TAALEALHNP PEMYGFVAAT KVDENFMSQV LEHVFLANGV SPVDDDGFAP LDEAATTEVL EFYRAIAEAS PPGELYWKQS RELYFAGQAA MIIWSPFILD ELAGLRDSAP PTINDDPTSA ELASLTGIVT NFSGPSNPEG AAWGDIRYFG ITTDADTDAA MEFVKFSMDE GYTQTLSIAP EGKFPVRKGT AEDPQKFTEA WSQLPVGVDR KAPLGDLYDA AMIDEIVGGL DVAQRWGVAE GQLSLASKMI NSQAINRIVR QYIDGEVDAA AAVAAMNDAL SQID
|
| |