Gene Dshi_0872 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDshi_0872 
Symbol 
ID5710562 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDinoroseobacter shibae DFL 12 
KingdomBacteria 
Replicon accessionNC_009952 
Strand
Start bp888387 
End bp889970 
Gene Length1584 bp 
Protein Length527 aa 
Translation table11 
GC content64% 
IMG OID641266782 
Productextracellular solute-binding protein family 5 
Protein accessionYP_001532218 
Protein GI159043424 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0747] ABC-type dipeptide transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones36 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones34 
Fosmid unclonability p-value0.625735 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAAAAAA CCACCCTCCT GCTGGCCACA TCGGCGCTCG TTCTGACGCC GATCCTCGCC 
AGCGCCGAAA CCCTGCGCTG GGCGCGGGCG GGCGACAGCC TGACCCTCGA TCCCCATGCC
CAGAACGAAG GGCCCACCCA TACGCTCGCC CACCAGATCT ACGAGCCGCT GATCATCCGC
GATCATTTCG GCCAGTTCCA GGGTGCACTG GCCACCGAAT GGGCCCCCAA GCCCGATGAT
CCCAGTGTCT GGGTCTTCAA GCTGCGCGAG GGCGTGACCT TCCATGACGG CTCGGCCTTC
ACCGCCGAGG ACGTGGTGTT CTCCTTCGAG CGGGCGAAAT CCGAAACCTC CGCCATGAAA
GAGCTGCTGA CCTCGATCTC CGAGGTGCGC GCGGTCGACG ACATGACGGT GGAGTTCGTG
ACCGAAGGAC CGAACCCGAT CCTGCCCAAC AACTTCACCA ACCTGTTCAT CATGGACAAG
GGCTGGACCG AGGCCAATGG CGTCACCCGG GTGCAGGACA TCGCCAATGG CGAGACCACC
TTCGCCGCGA CCAACGCCAT GGGCACCGGC CCCTACGTGC TCACCAGCCG CGAGCCGGAT
GTGCGCACCG TGCTGACGAT CAACCCAGAC TACTGGGGCG CGGATGACTT CCCGCTGCAG
GTCACCGAGA TCATCTATAC CCCGATCCAG AACGCCGCGA CACGGGTGGC GGCCCTTCTG
TCGGGCGAGG TGGACTTCAT CCAGGACGTG CCCGTGCAGG ATCTCGAACG CGTGGCCGGG
ACCGACGGGC TGGAGGTGCA GACCGCCGCC CAGAACCGGG TGATCTTCCT CGGCATGAAC
TCCGGGGCCG ACGATCTCGA CAGCGACAAT GTCGAAGGCG CCAACCCGCT GGCCGATGCG
CGCGTGCGCA AGGCAATGAA CATGGCGATC AACCGCGACG CCATCAAACA GGTGGTGATG
CGTGGCCAGT CCGCGCCTGC CGGCATGATC GCGCCGCCCT TCGTCAACGG CTGGACCGCC
GAGATGGACG GGGCCGCCAT GACCGATATC GACGCCGCCA AGGCGCTGAT GGAGGAAGCA
GGCTATGGCG ACGGCTTCTC GATCCAGCTC GACTGCCCCA ATGACCGCTA CATCAACGAC
GAGGCGATCT GCCAGGCGGT CGTGGGGATG CTCGGCCAGA TCGGCGTGAC CGTGAACCTC
GACGCCAAGC CCAAGGCACA GCACTTCCCG CTGATCCAGA ACGGCGAGAC CAACTTCTAC
ATGCTGGGCT GGGGTGTTCC GACCTATGAC AGCGAGTATG TCTTCAACTT CCTGGTGCAC
ACCCGCGAGG GTGACCGGGG CTCCTGGAAC AACACCGGCT TCTCGAACGC CGAGGTGGAC
GCCAAGATCG TCAGCCTGGC CTCCGAGACC GATCTGGCGG TCCGCAACCA GACCATCGCC
GAGATCTGGC AGGTGGTGCA GGACGAACAA CTCTACCTGC CGATCCACCA CCAAGTGCTG
AACTGGGGCA ACACCTCCGC CGTCGGCTAC CCCAACGTCT CGCCGGAGGA CGACCCGAAG
TTCAAGTTCT TCGAGATGAA CTGA
 
Protein sequence
MQKTTLLLAT SALVLTPILA SAETLRWARA GDSLTLDPHA QNEGPTHTLA HQIYEPLIIR 
DHFGQFQGAL ATEWAPKPDD PSVWVFKLRE GVTFHDGSAF TAEDVVFSFE RAKSETSAMK
ELLTSISEVR AVDDMTVEFV TEGPNPILPN NFTNLFIMDK GWTEANGVTR VQDIANGETT
FAATNAMGTG PYVLTSREPD VRTVLTINPD YWGADDFPLQ VTEIIYTPIQ NAATRVAALL
SGEVDFIQDV PVQDLERVAG TDGLEVQTAA QNRVIFLGMN SGADDLDSDN VEGANPLADA
RVRKAMNMAI NRDAIKQVVM RGQSAPAGMI APPFVNGWTA EMDGAAMTDI DAAKALMEEA
GYGDGFSIQL DCPNDRYIND EAICQAVVGM LGQIGVTVNL DAKPKAQHFP LIQNGETNFY
MLGWGVPTYD SEYVFNFLVH TREGDRGSWN NTGFSNAEVD AKIVSLASET DLAVRNQTIA
EIWQVVQDEQ LYLPIHHQVL NWGNTSAVGY PNVSPEDDPK FKFFEMN