Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Dshi_0872 |
Symbol | |
ID | 5710562 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Dinoroseobacter shibae DFL 12 |
Kingdom | Bacteria |
Replicon accession | NC_009952 |
Strand | + |
Start bp | 888387 |
End bp | 889970 |
Gene Length | 1584 bp |
Protein Length | 527 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 641266782 |
Product | extracellular solute-binding protein family 5 |
Protein accession | YP_001532218 |
Protein GI | 159043424 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0747] ABC-type dipeptide transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 36 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 34 |
Fosmid unclonability p-value | 0.625735 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCAAAAAA CCACCCTCCT GCTGGCCACA TCGGCGCTCG TTCTGACGCC GATCCTCGCC AGCGCCGAAA CCCTGCGCTG GGCGCGGGCG GGCGACAGCC TGACCCTCGA TCCCCATGCC CAGAACGAAG GGCCCACCCA TACGCTCGCC CACCAGATCT ACGAGCCGCT GATCATCCGC GATCATTTCG GCCAGTTCCA GGGTGCACTG GCCACCGAAT GGGCCCCCAA GCCCGATGAT CCCAGTGTCT GGGTCTTCAA GCTGCGCGAG GGCGTGACCT TCCATGACGG CTCGGCCTTC ACCGCCGAGG ACGTGGTGTT CTCCTTCGAG CGGGCGAAAT CCGAAACCTC CGCCATGAAA GAGCTGCTGA CCTCGATCTC CGAGGTGCGC GCGGTCGACG ACATGACGGT GGAGTTCGTG ACCGAAGGAC CGAACCCGAT CCTGCCCAAC AACTTCACCA ACCTGTTCAT CATGGACAAG GGCTGGACCG AGGCCAATGG CGTCACCCGG GTGCAGGACA TCGCCAATGG CGAGACCACC TTCGCCGCGA CCAACGCCAT GGGCACCGGC CCCTACGTGC TCACCAGCCG CGAGCCGGAT GTGCGCACCG TGCTGACGAT CAACCCAGAC TACTGGGGCG CGGATGACTT CCCGCTGCAG GTCACCGAGA TCATCTATAC CCCGATCCAG AACGCCGCGA CACGGGTGGC GGCCCTTCTG TCGGGCGAGG TGGACTTCAT CCAGGACGTG CCCGTGCAGG ATCTCGAACG CGTGGCCGGG ACCGACGGGC TGGAGGTGCA GACCGCCGCC CAGAACCGGG TGATCTTCCT CGGCATGAAC TCCGGGGCCG ACGATCTCGA CAGCGACAAT GTCGAAGGCG CCAACCCGCT GGCCGATGCG CGCGTGCGCA AGGCAATGAA CATGGCGATC AACCGCGACG CCATCAAACA GGTGGTGATG CGTGGCCAGT CCGCGCCTGC CGGCATGATC GCGCCGCCCT TCGTCAACGG CTGGACCGCC GAGATGGACG GGGCCGCCAT GACCGATATC GACGCCGCCA AGGCGCTGAT GGAGGAAGCA GGCTATGGCG ACGGCTTCTC GATCCAGCTC GACTGCCCCA ATGACCGCTA CATCAACGAC GAGGCGATCT GCCAGGCGGT CGTGGGGATG CTCGGCCAGA TCGGCGTGAC CGTGAACCTC GACGCCAAGC CCAAGGCACA GCACTTCCCG CTGATCCAGA ACGGCGAGAC CAACTTCTAC ATGCTGGGCT GGGGTGTTCC GACCTATGAC AGCGAGTATG TCTTCAACTT CCTGGTGCAC ACCCGCGAGG GTGACCGGGG CTCCTGGAAC AACACCGGCT TCTCGAACGC CGAGGTGGAC GCCAAGATCG TCAGCCTGGC CTCCGAGACC GATCTGGCGG TCCGCAACCA GACCATCGCC GAGATCTGGC AGGTGGTGCA GGACGAACAA CTCTACCTGC CGATCCACCA CCAAGTGCTG AACTGGGGCA ACACCTCCGC CGTCGGCTAC CCCAACGTCT CGCCGGAGGA CGACCCGAAG TTCAAGTTCT TCGAGATGAA CTGA
|
Protein sequence | MQKTTLLLAT SALVLTPILA SAETLRWARA GDSLTLDPHA QNEGPTHTLA HQIYEPLIIR DHFGQFQGAL ATEWAPKPDD PSVWVFKLRE GVTFHDGSAF TAEDVVFSFE RAKSETSAMK ELLTSISEVR AVDDMTVEFV TEGPNPILPN NFTNLFIMDK GWTEANGVTR VQDIANGETT FAATNAMGTG PYVLTSREPD VRTVLTINPD YWGADDFPLQ VTEIIYTPIQ NAATRVAALL SGEVDFIQDV PVQDLERVAG TDGLEVQTAA QNRVIFLGMN SGADDLDSDN VEGANPLADA RVRKAMNMAI NRDAIKQVVM RGQSAPAGMI APPFVNGWTA EMDGAAMTDI DAAKALMEEA GYGDGFSIQL DCPNDRYIND EAICQAVVGM LGQIGVTVNL DAKPKAQHFP LIQNGETNFY MLGWGVPTYD SEYVFNFLVH TREGDRGSWN NTGFSNAEVD AKIVSLASET DLAVRNQTIA EIWQVVQDEQ LYLPIHHQVL NWGNTSAVGY PNVSPEDDPK FKFFEMN
|
| |