Gene Dshi_3792 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDshi_3792 
Symbol 
ID5714460 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDinoroseobacter shibae DFL 12 
KingdomBacteria 
Replicon accessionNC_009956 
Strand
Start bp167 
End bp1822 
Gene Length1656 bp 
Protein Length551 aa 
Translation table11 
GC content67% 
IMG OID641276707 
Productextracellular solute-binding protein 
Protein accessionYP_001542003 
Protein GI159046332 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0747] ABC-type dipeptide transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value0.639455 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones40 
Fosmid unclonability p-value0.694676 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGCACAC GATTTTGGGC AACGACTGCG GCCGCGACCA TGGCCGCCGT TCTGGCCTCC 
GGCTCGGCCT GGGCTGAAAG CGTTCTGACG ATCGGGATGA CGGCGGCGGA TATTCCGCGC
ACCTCCGGTC AGCCGGACCA GGGTTTCGAG GGCAACCGCT TCACCGGGAT TCCGATGTAC
GACGCCCTGA CCCATTGGGA CCTGTCGTCC GAGACCGAGG CGTCGGTGGT GATCCCCGGG
CTGGCAACCG AATGGTCCGT GAACCCGGAT GACACGACCA AGTGGACCTT CAAGCTGCGC
GAGGGCGTGA CCTTCCATGA CGGCTCGCCC TTCAACGCCG AGGCGGTGGT CTGGAACGTG
GAGAAGGTGC TCGACGACGA GGCGCCGCAT TTCGCCCCCG ACCAGGTGGG TGCGACCGCG
TCGCGGATGC CGACCCTGCG CTCGGCCCGG GCGATCGACG AGCTTACGGT GGAGCTGACC
ACGTCACAGC CCGATGCGTT CCTGCCGCTG AACATGACGA TGCTCTTCAT GGCCTCGCCG
ACCCATTGGC AGAGCCTCTA TGACGCGGTG GATGCCGGGG TCACCGACCC GACCGAGCGG
GCGGCCGCGG CCTGGACCGC CTTCGCCGCC AACCCGTCGG GGACGGGGCC GTTCAAGGGC
GAGACCCTGG TGCCGCGGGA GCGGTTCGAG ATGGTGCGCA ACGAGAATTA CTGGGACCCG
GCGCGCACGC CGACCATCGA CCGGGTGGTT CTGGTGCCGC TGCCGGATGC CAACGCGCGC
ACGGCGGCCT TGCTGTCGGG GCAGGTGGAC TGGATCGAGG CGCCCGCGCC GGACGTGATC
CCGCGCATCG AATCCCAGGG CAACGTGATC TACGCCAACC CGCAGCCCCA TGTCTGGCCC
TGGCAGCTGT CCTTCGTGGA GGACTCGCCC TGGCTCGACA AGCGGGTGCG CCATGCGGCC
AACCTGTGTG TGAACCGCGA GGAGCTGCGG GTGCTTCTGG GGGGCTACAT GGGGGTGCCC
CAGGGCACGG TGAACGAGGG CCATCCGTGG TGGGGCAACC CGTCCTTCGA GATCAAGTAT
GACCCGGATG CGGCGCGCGC GCTGATGGAA GAGGCCGGGT TCGGCCCGGA CAACCCGCTG
AGCGTGACGG TGCAGACCTC GGCCTCGGGG TCGGGCCAGA TGCAGCCGGT GACCATGAAC
GAGTATGTCC AGCAGAACTT GGCGGAGTGT CATTTCGACG TGAACCTGGA CGTGATCGAG
TGGAACACGC TGTTCAACAA CTGGCGGGCC GGGGCGAAAT CCGAGGGCGC GCGGGACGCG
GATGCGATCA ACGTCTCCTT TGCGACGATG GACCCGTTCT TTGCCATGGT GCGGTTCGTC
TCGACCGAGA CCTTCCCGCC GGTGTCGAAC AACTGGGGCT TCTTCGGCAA TGACGAGTTC
GACGCGCTGA TCGCGGAGGC GCGGACCTCC TTCGACGATG CGGACCGGGA TGCGGCGCTC
GCACGGCTCC ATGCCCGGGT GGTCGAAGAG GCGCCCTTCG TCTGGATCGC CAACGACGTG
GGACCGCGCG CCATGAGCCC GCGGGTCGGC AACGTGGTTC AGCCCAAGAG CTGGTTCATC
GACATCGCGC CGATGACCCT CGACGACGAC ACCTGA
 
Protein sequence
MRTRFWATTA AATMAAVLAS GSAWAESVLT IGMTAADIPR TSGQPDQGFE GNRFTGIPMY 
DALTHWDLSS ETEASVVIPG LATEWSVNPD DTTKWTFKLR EGVTFHDGSP FNAEAVVWNV
EKVLDDEAPH FAPDQVGATA SRMPTLRSAR AIDELTVELT TSQPDAFLPL NMTMLFMASP
THWQSLYDAV DAGVTDPTER AAAAWTAFAA NPSGTGPFKG ETLVPRERFE MVRNENYWDP
ARTPTIDRVV LVPLPDANAR TAALLSGQVD WIEAPAPDVI PRIESQGNVI YANPQPHVWP
WQLSFVEDSP WLDKRVRHAA NLCVNREELR VLLGGYMGVP QGTVNEGHPW WGNPSFEIKY
DPDAARALME EAGFGPDNPL SVTVQTSASG SGQMQPVTMN EYVQQNLAEC HFDVNLDVIE
WNTLFNNWRA GAKSEGARDA DAINVSFATM DPFFAMVRFV STETFPPVSN NWGFFGNDEF
DALIAEARTS FDDADRDAAL ARLHARVVEE APFVWIANDV GPRAMSPRVG NVVQPKSWFI
DIAPMTLDDD T