Gene Dshi_1353 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDshi_1353 
Symbol 
ID5711904 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDinoroseobacter shibae DFL 12 
KingdomBacteria 
Replicon accessionNC_009952 
Strand
Start bp1404655 
End bp1405959 
Gene Length1305 bp 
Protein Length434 aa 
Translation table11 
GC content66% 
IMG OID641267265 
Productextracellular solute-binding protein family 1 
Protein accessionYP_001532696 
Protein GI159043902 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1653] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones30 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones34 
Fosmid unclonability p-value0.621342 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCCTGA CCGCAGGGGC CGTCAGCGCC GACACGATCC GCTTCTGGAC CACCGAGGAG 
CAGCCCGAGC GCCTGGCCAA GCAGCAGGAA ATGGCCGCGC AATTCGAGGC GGAGACCGGC
ACCGCCGTGG AGGTGATCCC GGTCACCGAG AGCGACCTGG GCACCCGGGC CACGGCGGCC
TTCGCGGCGG GCGATCTGCC GGACGTGATC TATCACACCC TGCAATACGC GCTGCCCTGG
GCGGAGGCTG GCATTCTGGA CACCGATGCC GCCACCGAGG TGGTCGAGGA TCTAGGCGAG
GATACCTTTG CGCCCGGGGC CTTGCAGATG GCCTCCACCG GCGATGGCGT GGCCTCGGTG
CCGGTGGATG GCTGGACCCA GATGATCGTC TATCGCAAGG ACAAGTTCGA GGAGATGGGG
CTGGAGCCGC CGACCTCCTT TGCCAATGTG ACTGCCGCGC TGGAGGCGCT GCACAATCCG
CCGGAGATGT ACGGCTTCGT GGCCGCCACC AAGGTGGACG AGAACTTCAT GTCCCAGGTT
CTGGAGCATG TGTTCCTGGC CAACGGCGTC AGCCCGGTGG ACGACGATGG CTTCGCGCCG
CTCGACGAGG CCGCCACCAC CGAAGTGCTG GAGTTCTACA GGGCGATCGC CGAGGCCTCG
CCCCCGGGCG AGCTTTACTG GAAGCAGTCG CGCGAGCTCT ATTTCGCGGG ACAGGCCGCG
ATGATCATCT GGTCGCCCTT CATTCTCGAC GAGTTGGCCG GTCTGCGCGA CAGCGCGCCG
CCCACCATCA ACGACGACCC GACCAGCGCG GAATTGGCCA GCCTGACCGG CATCGTGACC
AACTTCTCCG GCCCGTCGAA CCCCGAAGGT GCTGCCTGGG GCGATATCCG GTATTTCGGC
ATCACCACGG ACGCGGACAC AGACGCGGCG ATGGAGTTCG TGAAGTTCTC GATGGACGAG
GGCTATACCC AAACCCTCAG CATCGCGCCG GAGGGCAAGT TCCCGGTCCG CAAGGGCACG
GCCGAGGATC CGCAGAAGTT CACCGAGGCC TGGTCGCAGC TGCCCGTGGG CGTGGATCGC
AAGGCGCCGT TGGGCGATCT TTATGACGCG GCGATGATCG ACGAGATCGT CGGCGGGCTC
GATGTGGCGC AGCGCTGGGG CGTGGCGGAG GGGCAGTTGT CGCTGGCCTC CAAGATGATC
AACAGCCAGG CGATCAACCG TATCGTGCGT CAGTATATCG ATGGCGAGGT CGATGCCGCT
GCCGCCGTGG CCGCGATGAA CGACGCGCTG TCGCAGATCG ACTGA
 
Protein sequence
MALTAGAVSA DTIRFWTTEE QPERLAKQQE MAAQFEAETG TAVEVIPVTE SDLGTRATAA 
FAAGDLPDVI YHTLQYALPW AEAGILDTDA ATEVVEDLGE DTFAPGALQM ASTGDGVASV
PVDGWTQMIV YRKDKFEEMG LEPPTSFANV TAALEALHNP PEMYGFVAAT KVDENFMSQV
LEHVFLANGV SPVDDDGFAP LDEAATTEVL EFYRAIAEAS PPGELYWKQS RELYFAGQAA
MIIWSPFILD ELAGLRDSAP PTINDDPTSA ELASLTGIVT NFSGPSNPEG AAWGDIRYFG
ITTDADTDAA MEFVKFSMDE GYTQTLSIAP EGKFPVRKGT AEDPQKFTEA WSQLPVGVDR
KAPLGDLYDA AMIDEIVGGL DVAQRWGVAE GQLSLASKMI NSQAINRIVR QYIDGEVDAA
AAVAAMNDAL SQID