Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Dshi_2595 |
Symbol | |
ID | 5713493 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Dinoroseobacter shibae DFL 12 |
Kingdom | Bacteria |
Replicon accession | NC_009952 |
Strand | + |
Start bp | 2759653 |
End bp | 2761176 |
Gene Length | 1524 bp |
Protein Length | 507 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 641268519 |
Product | putative extracellular solute-binding protein |
Protein accession | YP_001533929 |
Protein GI | 159045135 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0747] ABC-type dipeptide transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 29 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 22 |
Fosmid unclonability p-value | 0.00975919 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGTCACTTT TGACCCGCAT GACCGCCGCT GCCTTAATCG CGCTTGGCAC CGCCTTGCCA GCCGCAGCAC AGGACGACAC GCTCACCATT TCAGTTACAT TCGGCCCCAC GGCCGAGGTG CCTGACCCGC GTGCAGGCTA CAACGGCTGG ATGTCAAACC AGACTGGCGT CACAGAAACG CTGATGGGCA TCGACTATGA CCTGAACCTC TATCCGCGCC TGGCAGAGAG CATTGAGCAA ACCGCACCGA CGACCTGGCG TGTGACCTTG CGCGATGGTT TGACATTCCA CGATGGCTCT CCCGTGACGG CACAGGCGGT CATCGATGCA ATTGCGCCGA TCTCGGATGA AGCGGACCCT GCGTTCAACA AACGCATCGC CAGTGTGCTT GATCTGGCGG GCATGTCTGC AGATGGCGAC CGCTTTGTGG TGTTCGAGAC CAACAGCCCG AACGCCGCTT TCCAATGGAG CCTGTCTGAT CCGGGTGTCG CCATTCTCGG CGCGCCTTCA GACGCATTCC CAATCAACGC AACCGGCCCC TACATCTTCC GTGAGGCTGT GCCGGAGCAG CTTTACCGCG TTGAAGCAAA CTCAGACTAT CGCATGGGCG AGCCCGGCTT CGACGAAGTG CGGGTTGTCG CCTCACCTGA TCCAGCCACA GCCTCACTGG CCTTTGAAGC CGGTGATGTG GATATGGTGA TCAACTACCC CGAAACCGAT TTTGCACGTA TCCAGGAAAC GGGTGCGCAG GGCTTTACCG CTCCAACCGC CCGGCTTTAC TTCTATTCGA TGAACACAGC GTCCGGCCCG CTGGCAAACC CGTTGATCCG GCAAGCCGTA TCTTTGGCGA TTGACCGGGA CGGCATCGTT GAGGCGGTTC TTTCCGGCGT TGGCGGTGTG CCAGCAGGCA CCATCTATCC GGAGGGCAAG AGCTGGGCCG CGGACATCGC GCCCACATAT GATCCGGCAC GGGCAGAGGA ATTGCTTGCC GAAGCCGGAG CGGTCAAGCA GGGCGGCACC TGGATGCTTG ATGGTGAACC GCTTGAGATC GAGATCCTGA CCTATTCGTC GCGTGCGGCG CTGCCGCCCA CGGCGGAGAT TACGCAGGCC TTTCTGCAGC AGATCGGGAT CACAGCATCT GTGCGTGTGG GCGAGTACGG CGCCAGCAAC GATGCGCTCA AAGCGGGTGA CGGCGACATG TTCCTGCAAG CATGGGTGAT GTTGCCTCAG GGCGACCCCG GTTCGATCCT CGGCTTTCTA TTGGCCAGCG ACGGCGGCTC GAACGCGGGC AACTACGCCA ATCCCGCGAT GGACGCCCTA CTTGTCGAAG GTCAGACCAC GTTTGAACAA GCCGAGCGTG AGCGGATTTA CGATGAGGTT CAGCAGATCA TTGCCACCGA TGTCCCGCTT ATCCCTGTCT TCCACGTGAG CCAGGCCGTT GTGGCCCGTG CCGGTCTGAC CGGTTACCAG GTCCACCCGA CCGAGACCTA CTGGATCACC CACGAGACAC GCTTCGCTGA ATGA
|
Protein sequence | MSLLTRMTAA ALIALGTALP AAAQDDTLTI SVTFGPTAEV PDPRAGYNGW MSNQTGVTET LMGIDYDLNL YPRLAESIEQ TAPTTWRVTL RDGLTFHDGS PVTAQAVIDA IAPISDEADP AFNKRIASVL DLAGMSADGD RFVVFETNSP NAAFQWSLSD PGVAILGAPS DAFPINATGP YIFREAVPEQ LYRVEANSDY RMGEPGFDEV RVVASPDPAT ASLAFEAGDV DMVINYPETD FARIQETGAQ GFTAPTARLY FYSMNTASGP LANPLIRQAV SLAIDRDGIV EAVLSGVGGV PAGTIYPEGK SWAADIAPTY DPARAEELLA EAGAVKQGGT WMLDGEPLEI EILTYSSRAA LPPTAEITQA FLQQIGITAS VRVGEYGASN DALKAGDGDM FLQAWVMLPQ GDPGSILGFL LASDGGSNAG NYANPAMDAL LVEGQTTFEQ AERERIYDEV QQIIATDVPL IPVFHVSQAV VARAGLTGYQ VHPTETYWIT HETRFAE
|
| |