Gene Dshi_1248 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDshi_1248 
Symbol 
ID5711806 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDinoroseobacter shibae DFL 12 
KingdomBacteria 
Replicon accessionNC_009952 
Strand
Start bp1295236 
End bp1296480 
Gene Length1245 bp 
Protein Length414 aa 
Translation table11 
GC content63% 
IMG OID641267160 
Productextracellular solute-binding protein family 1 
Protein accessionYP_001532591 
Protein GI159043797 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1653] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value0.494424 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones33 
Fosmid unclonability p-value0.639356 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGCCCAT CCACGAAATC GCTTCTGGCC GCGCTTGCCA CGTCTGTCGC CTTCACGGTG 
CCCGCGGCGG CCGAGCTGAC CGGCGAGCTG AAGATTTTCT CGGATATGTC GAACCCGGCT
CCCCGCGCCA CCATGGAAGG TCTCGTCGCC GGGTTTCAGG AGCAGCACCC CGATCTCGAC
ATCGAGCTGA CGATCATCGA CCGCGAGGCC TACAAGACGC AGATCCGCAA CTTCCTGACG
GCCAACACGC CGGATGTGGC CAACTGGTAC GCGGGCAACC GGATGCTGCC CTTCGTCGAG
GCCGGGCTTT TCGAGGATAT CTCGGACCTC TGGGATGACG AGAAATCCGC AAATCTGGCG
TCGACCAAGC CGTCCATGAC CATCGACGGC AAGCAATGGG GCGTGCCCTA TACCTACTAC
CAATGGGGCG TGTATTACCG CGAGGACATC TATAACGACC TCGGCCTGAC CGAGCCGACC
ACCTGGGCCG AGGAAAAGGC GAACTGCCAG ACTCTGCTGG AAAACGGGAT CAAGTGTTAC
ACCATCGGCA CCAAGTTCTT GTGGACCGCG GGCGGCTGGT TCGACTACCT CAACCTGCGC
ACGAATGGCT ACGAGTTCCA CATGGCGCTC ACCAACGGCG AGGTCGCCTG GACCGACGAC
CGGGTGCGCC AGACCATGGC GAACTGGCGC GAGCTGATCG ATATGGGCGC CTTCATCGAC
AACCATCAGA CCTACAGCTG GCAAGAAGCC CTGCCTTTCA TGGTCCGGGG CGAAGCCGCC
GCCTACCTGA TGGGCAATTT CGCGGTCTCG CATCTGCGCG AGGCCGGGCT CGGCGACGAC
CAGCTGGGCT TTTATCAATT CGTCGAAATC ACCCCAGGCA TCCCCAAGGC CGAGGATGCG
CCGACCGACA CGTTCCACAT CCCCGCCCAG GCCGCGAACA AGGAAGCCGC CCGCGCCTTC
CTGCGGTATG TCACCTCTCC CGAGGTCCAG ACACAGATCA ACGCGGGCGA CCAGCTCGGC
CAGTTGCCCG TGCACAAGGC CGCCAGCGTG GATGACGACA AGTTCCTGAA AGAGGGCTTC
GACATGCTCT CGAACGCCTA CGCACTGGCA CAGTTCTTCG ACCGTGACGC GCCCGCCGAG
ATGGCCAAGG CGGGGATGGA GGGCTTCCAG GAATTCATGG TGAAACCCGA CAATCTCGAC
CGCATCCTGG AGCGGATGGA GCGGGTCCGC CAGCGCGTCT ACTGA
 
Protein sequence
MRPSTKSLLA ALATSVAFTV PAAAELTGEL KIFSDMSNPA PRATMEGLVA GFQEQHPDLD 
IELTIIDREA YKTQIRNFLT ANTPDVANWY AGNRMLPFVE AGLFEDISDL WDDEKSANLA
STKPSMTIDG KQWGVPYTYY QWGVYYREDI YNDLGLTEPT TWAEEKANCQ TLLENGIKCY
TIGTKFLWTA GGWFDYLNLR TNGYEFHMAL TNGEVAWTDD RVRQTMANWR ELIDMGAFID
NHQTYSWQEA LPFMVRGEAA AYLMGNFAVS HLREAGLGDD QLGFYQFVEI TPGIPKAEDA
PTDTFHIPAQ AANKEAARAF LRYVTSPEVQ TQINAGDQLG QLPVHKAASV DDDKFLKEGF
DMLSNAYALA QFFDRDAPAE MAKAGMEGFQ EFMVKPDNLD RILERMERVR QRVY