Gene Dshi_2595 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDshi_2595 
Symbol 
ID5713493 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDinoroseobacter shibae DFL 12 
KingdomBacteria 
Replicon accessionNC_009952 
Strand
Start bp2759653 
End bp2761176 
Gene Length1524 bp 
Protein Length507 aa 
Translation table11 
GC content61% 
IMG OID641268519 
Productputative extracellular solute-binding protein 
Protein accessionYP_001533929 
Protein GI159045135 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0747] ABC-type dipeptide transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value0.00975919 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGTCACTTT TGACCCGCAT GACCGCCGCT GCCTTAATCG CGCTTGGCAC CGCCTTGCCA 
GCCGCAGCAC AGGACGACAC GCTCACCATT TCAGTTACAT TCGGCCCCAC GGCCGAGGTG
CCTGACCCGC GTGCAGGCTA CAACGGCTGG ATGTCAAACC AGACTGGCGT CACAGAAACG
CTGATGGGCA TCGACTATGA CCTGAACCTC TATCCGCGCC TGGCAGAGAG CATTGAGCAA
ACCGCACCGA CGACCTGGCG TGTGACCTTG CGCGATGGTT TGACATTCCA CGATGGCTCT
CCCGTGACGG CACAGGCGGT CATCGATGCA ATTGCGCCGA TCTCGGATGA AGCGGACCCT
GCGTTCAACA AACGCATCGC CAGTGTGCTT GATCTGGCGG GCATGTCTGC AGATGGCGAC
CGCTTTGTGG TGTTCGAGAC CAACAGCCCG AACGCCGCTT TCCAATGGAG CCTGTCTGAT
CCGGGTGTCG CCATTCTCGG CGCGCCTTCA GACGCATTCC CAATCAACGC AACCGGCCCC
TACATCTTCC GTGAGGCTGT GCCGGAGCAG CTTTACCGCG TTGAAGCAAA CTCAGACTAT
CGCATGGGCG AGCCCGGCTT CGACGAAGTG CGGGTTGTCG CCTCACCTGA TCCAGCCACA
GCCTCACTGG CCTTTGAAGC CGGTGATGTG GATATGGTGA TCAACTACCC CGAAACCGAT
TTTGCACGTA TCCAGGAAAC GGGTGCGCAG GGCTTTACCG CTCCAACCGC CCGGCTTTAC
TTCTATTCGA TGAACACAGC GTCCGGCCCG CTGGCAAACC CGTTGATCCG GCAAGCCGTA
TCTTTGGCGA TTGACCGGGA CGGCATCGTT GAGGCGGTTC TTTCCGGCGT TGGCGGTGTG
CCAGCAGGCA CCATCTATCC GGAGGGCAAG AGCTGGGCCG CGGACATCGC GCCCACATAT
GATCCGGCAC GGGCAGAGGA ATTGCTTGCC GAAGCCGGAG CGGTCAAGCA GGGCGGCACC
TGGATGCTTG ATGGTGAACC GCTTGAGATC GAGATCCTGA CCTATTCGTC GCGTGCGGCG
CTGCCGCCCA CGGCGGAGAT TACGCAGGCC TTTCTGCAGC AGATCGGGAT CACAGCATCT
GTGCGTGTGG GCGAGTACGG CGCCAGCAAC GATGCGCTCA AAGCGGGTGA CGGCGACATG
TTCCTGCAAG CATGGGTGAT GTTGCCTCAG GGCGACCCCG GTTCGATCCT CGGCTTTCTA
TTGGCCAGCG ACGGCGGCTC GAACGCGGGC AACTACGCCA ATCCCGCGAT GGACGCCCTA
CTTGTCGAAG GTCAGACCAC GTTTGAACAA GCCGAGCGTG AGCGGATTTA CGATGAGGTT
CAGCAGATCA TTGCCACCGA TGTCCCGCTT ATCCCTGTCT TCCACGTGAG CCAGGCCGTT
GTGGCCCGTG CCGGTCTGAC CGGTTACCAG GTCCACCCGA CCGAGACCTA CTGGATCACC
CACGAGACAC GCTTCGCTGA ATGA
 
Protein sequence
MSLLTRMTAA ALIALGTALP AAAQDDTLTI SVTFGPTAEV PDPRAGYNGW MSNQTGVTET 
LMGIDYDLNL YPRLAESIEQ TAPTTWRVTL RDGLTFHDGS PVTAQAVIDA IAPISDEADP
AFNKRIASVL DLAGMSADGD RFVVFETNSP NAAFQWSLSD PGVAILGAPS DAFPINATGP
YIFREAVPEQ LYRVEANSDY RMGEPGFDEV RVVASPDPAT ASLAFEAGDV DMVINYPETD
FARIQETGAQ GFTAPTARLY FYSMNTASGP LANPLIRQAV SLAIDRDGIV EAVLSGVGGV
PAGTIYPEGK SWAADIAPTY DPARAEELLA EAGAVKQGGT WMLDGEPLEI EILTYSSRAA
LPPTAEITQA FLQQIGITAS VRVGEYGASN DALKAGDGDM FLQAWVMLPQ GDPGSILGFL
LASDGGSNAG NYANPAMDAL LVEGQTTFEQ AERERIYDEV QQIIATDVPL IPVFHVSQAV
VARAGLTGYQ VHPTETYWIT HETRFAE