Gene Dshi_1413 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDshi_1413 
Symbol 
ID5712589 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDinoroseobacter shibae DFL 12 
KingdomBacteria 
Replicon accessionNC_009952 
Strand
Start bp1466558 
End bp1467793 
Gene Length1236 bp 
Protein Length411 aa 
Translation table11 
GC content67% 
IMG OID641267325 
Productextracellular solute-binding protein family 1 
Protein accessionYP_001532756 
Protein GI159043962 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1653] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value0.127019 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones35 
Fosmid unclonability p-value0.973051 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATTTCA TCAAAACCGG ATTTTTGCCC GCAGCGATTC TGGCCGTGGG GATCGGCGGG 
GCGTCGGCCT GCGAGCTGGC CGAAGGCTCC GTGCGCATCC TGTCCAACGA TTTCCCCGCC
CTGCAGGCCG TTACCGGCGC GGCGGCGGCC TGTGCCACGG GCGGGGCGAC GGTGACGGCG
AACCTGACGA CCGAGCACCG CAACATCCAG GTGGCGGCCC TGACCGCGGA TCCGGCGGAA
TATACCTCGG CCGTGGTGTC GAATTCGAGC CTCGTGCCGC TGCTGACCGA CGGGCTGGTG
CGGCCGCTGG ACGATCTGAT CGCGGCCCAT GGGGCGGGGG TGTCGCCGAA CCAGAAGATC
ATGATCGACG GGCGCACCAT GGCGATCGCC TTCATGGCCA ATGCGCAGCA CCTCTATTAT
CGGCGCGACA TCCTGGAGGC GGCGGGGCTG GAGGTGCCGA CGACCTATGA GGAGGTGCTC
GCCGCGGCAG AGGCGATCCG CGACCAGGGG CTGATGGAAT TTCCCCTCGG CGGGACGTTC
AAGGCGGGCT GGAACCTGGC CCAGGAATTC GTGAACATGT ATCTCGGCCA TGGTGGGGCG
TTTTTCGAGC CCGGCTCGGC GGCCCCGGCG ATCAACAATG CCCAAGGGGT TGCGGCGCTG
GAGATGATGA AGGCGCTGAC CGCGTACATG AACCCGGATT TTCTCACCTA CGATTCCAAT
GCCTTGCAGG CCGAATTTGA AGCAGGAAAC GTGGCGCTGG CCAATTTCTG GGGCTCGCGG
GCCGGCGGCG TGACCGATGC GGAGGGCGCG ACCCCCGAGA TCGCGGCGGC CATCGGGTTC
GCGGCGGCCC CCACGGTGGC GGGGGGCACG ACGCCGGCGT CGACCCTGTG GTGGGACGGG
TTCACCATCG CCACCAACAT CCCGGACGTG GATGCGGAGG CGAGCTTCGT CGCCATGGTC
CACGGCGCCT CGACCGAGGT GGCCAATGCC AATCCCAACG CGGCGGTCTG GCTGATCGAC
GGCTACACCC CGGGCGACGC GGCGGTGGGC GTTCTGGCTA CGGCGCAGAT GGGCACGTCG
CCCTATCCGA TGCTGCCCTA TATGAGCCTG ATGCACACCG CGCTCGGCAC CGAGATCGTC
GAGTTCCTGC AAGGCAGCGA GACGGCGGAA CAGGCCCTGT CGGATGTGGA GGCGTCCTAC
AGGGCGGCCG CGCGCGAAGC GGGCTTTCTG AACTGA
 
Protein sequence
MNFIKTGFLP AAILAVGIGG ASACELAEGS VRILSNDFPA LQAVTGAAAA CATGGATVTA 
NLTTEHRNIQ VAALTADPAE YTSAVVSNSS LVPLLTDGLV RPLDDLIAAH GAGVSPNQKI
MIDGRTMAIA FMANAQHLYY RRDILEAAGL EVPTTYEEVL AAAEAIRDQG LMEFPLGGTF
KAGWNLAQEF VNMYLGHGGA FFEPGSAAPA INNAQGVAAL EMMKALTAYM NPDFLTYDSN
ALQAEFEAGN VALANFWGSR AGGVTDAEGA TPEIAAAIGF AAAPTVAGGT TPASTLWWDG
FTIATNIPDV DAEASFVAMV HGASTEVANA NPNAAVWLID GYTPGDAAVG VLATAQMGTS
PYPMLPYMSL MHTALGTEIV EFLQGSETAE QALSDVEASY RAAAREAGFL N