Gene Dshi_0974 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDshi_0974 
Symbol 
ID5710487 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDinoroseobacter shibae DFL 12 
KingdomBacteria 
Replicon accessionNC_009952 
Strand
Start bp995276 
End bp996583 
Gene Length1308 bp 
Protein Length435 aa 
Translation table11 
GC content63% 
IMG OID641266882 
Productextracellular solute-binding protein family 1 
Protein accessionYP_001532317 
Protein GI159043523 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1653] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value0.606433 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones29 
Fosmid unclonability p-value0.163223 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCTTGA AGAACGCACT TTACGCGGCC ACCGCCCTGA CCCTTGTAAG CTCGGGCGCT 
ATGGCAAGCG AAAACCTGGT AATCGCAACC GTCAACAATG GCGACATGAT CCGGATGCAG
GGTCTGACCC AGGATTTTAC CGACAAGACC GGCCACACGG TCGAGTGGGT GACCCTCGAA
GAGAACGTCC TGCGCCAGCG CGTCACGACG GATATTGCTG CCAAGGGCGG CTCCTTCGAC
ATCATGACCA TCGGCATGTA CGAAACTCCG ATCTGGGGCG CCAATGGCTG GCTCGTGCCG
CTGGACGATC TGTCCGCGGA CTACAACGCC GACGATATTC TGCCCGCGAT GCGCGCCGGT
CTGAGCCATG ACGGCACGCT CTATGCTGCG CCGTTCTACG GCGAAAGCTC CATGATCATG
TACCGCAAGG ACCTGATGGA GAAGGCCGGG CTGGAAATGC CCGATGCGCC GACCTGGCAG
TTCATCCGCG AAGCGGCCGC CGCCATGACC GACCGCGAGA ACGACATCAA CGGCATCTGC
CTGCGCGGCA AGGCCGGCTG GGGCGAAGGC GGCGCGTTCA TCACCGTCAC CGCGAACTCC
TTTGGCGCGC GCTGGTTCGA CGAGGACTGG AACGCCCAAT TCGACCAACC CGAGTGGAAA
GAGGCGCTGG AATTCTTCGT CGGCATGATG AACGAGTCCG GGCCGAACGG CTACGCCACC
AACGGCTTCA ACGAGAACCT GAACCTGTTC CAGCAGGGCA AGTGCGGCAT GTGGATCGAC
GCCACGGTGG CGGCGTCCTT CGTGACCAAC CCCAACGACT CGACCGTGGC CGACCAGGTG
GGCTTCGCCC TCGCCCCGAA CAGCGAGGGC ATCGAGAAGC GCGCGAACTG GCTCTGGGCC
TGGGCCCTGG CGATCCCCGC CGGCACGCAG AAGGCCGATG CCGCCAAGGA ATTCATCGAG
TGGGCCACCT CGACCGATTA TATCGAGTTG GTGGCCGCGA ACGAAGGTTG GGCCAACGTG
CCTCCGGGTG CGCGGACCTC GCTCTATGAG AACGAGAACT ACAAGGACAT TCCGTTCGCC
AAGATGACCC TGGATTCGAT CCTGGCTGCC GATCCGACCG ACCCGACCGT GGACCCGGTG
CCCTATGTTG GCATCCAGTT CGTCGCTATC CCCGAATTTG CGGGCATCGC CACCGAAGTC
AGCCAGGAAT TCTCCGCCGT CTATGCCGGT CAGCAGACCG TCGAAGAGGC GCTTGAGAAA
GCCCAGGCCC TGACCAACGA CGCCATGGAA GCCGCCGGCT ACCGCTAA
 
Protein sequence
MSLKNALYAA TALTLVSSGA MASENLVIAT VNNGDMIRMQ GLTQDFTDKT GHTVEWVTLE 
ENVLRQRVTT DIAAKGGSFD IMTIGMYETP IWGANGWLVP LDDLSADYNA DDILPAMRAG
LSHDGTLYAA PFYGESSMIM YRKDLMEKAG LEMPDAPTWQ FIREAAAAMT DRENDINGIC
LRGKAGWGEG GAFITVTANS FGARWFDEDW NAQFDQPEWK EALEFFVGMM NESGPNGYAT
NGFNENLNLF QQGKCGMWID ATVAASFVTN PNDSTVADQV GFALAPNSEG IEKRANWLWA
WALAIPAGTQ KADAAKEFIE WATSTDYIEL VAANEGWANV PPGARTSLYE NENYKDIPFA
KMTLDSILAA DPTDPTVDPV PYVGIQFVAI PEFAGIATEV SQEFSAVYAG QQTVEEALEK
AQALTNDAME AAGYR