Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Dshi_3792 |
Symbol | |
ID | 5714460 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Dinoroseobacter shibae DFL 12 |
Kingdom | Bacteria |
Replicon accession | NC_009956 |
Strand | + |
Start bp | 167 |
End bp | 1822 |
Gene Length | 1656 bp |
Protein Length | 551 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 641276707 |
Product | extracellular solute-binding protein |
Protein accession | YP_001542003 |
Protein GI | 159046332 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0747] ABC-type dipeptide transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 0.639455 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 40 |
Fosmid unclonability p-value | 0.694676 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGCACAC GATTTTGGGC AACGACTGCG GCCGCGACCA TGGCCGCCGT TCTGGCCTCC GGCTCGGCCT GGGCTGAAAG CGTTCTGACG ATCGGGATGA CGGCGGCGGA TATTCCGCGC ACCTCCGGTC AGCCGGACCA GGGTTTCGAG GGCAACCGCT TCACCGGGAT TCCGATGTAC GACGCCCTGA CCCATTGGGA CCTGTCGTCC GAGACCGAGG CGTCGGTGGT GATCCCCGGG CTGGCAACCG AATGGTCCGT GAACCCGGAT GACACGACCA AGTGGACCTT CAAGCTGCGC GAGGGCGTGA CCTTCCATGA CGGCTCGCCC TTCAACGCCG AGGCGGTGGT CTGGAACGTG GAGAAGGTGC TCGACGACGA GGCGCCGCAT TTCGCCCCCG ACCAGGTGGG TGCGACCGCG TCGCGGATGC CGACCCTGCG CTCGGCCCGG GCGATCGACG AGCTTACGGT GGAGCTGACC ACGTCACAGC CCGATGCGTT CCTGCCGCTG AACATGACGA TGCTCTTCAT GGCCTCGCCG ACCCATTGGC AGAGCCTCTA TGACGCGGTG GATGCCGGGG TCACCGACCC GACCGAGCGG GCGGCCGCGG CCTGGACCGC CTTCGCCGCC AACCCGTCGG GGACGGGGCC GTTCAAGGGC GAGACCCTGG TGCCGCGGGA GCGGTTCGAG ATGGTGCGCA ACGAGAATTA CTGGGACCCG GCGCGCACGC CGACCATCGA CCGGGTGGTT CTGGTGCCGC TGCCGGATGC CAACGCGCGC ACGGCGGCCT TGCTGTCGGG GCAGGTGGAC TGGATCGAGG CGCCCGCGCC GGACGTGATC CCGCGCATCG AATCCCAGGG CAACGTGATC TACGCCAACC CGCAGCCCCA TGTCTGGCCC TGGCAGCTGT CCTTCGTGGA GGACTCGCCC TGGCTCGACA AGCGGGTGCG CCATGCGGCC AACCTGTGTG TGAACCGCGA GGAGCTGCGG GTGCTTCTGG GGGGCTACAT GGGGGTGCCC CAGGGCACGG TGAACGAGGG CCATCCGTGG TGGGGCAACC CGTCCTTCGA GATCAAGTAT GACCCGGATG CGGCGCGCGC GCTGATGGAA GAGGCCGGGT TCGGCCCGGA CAACCCGCTG AGCGTGACGG TGCAGACCTC GGCCTCGGGG TCGGGCCAGA TGCAGCCGGT GACCATGAAC GAGTATGTCC AGCAGAACTT GGCGGAGTGT CATTTCGACG TGAACCTGGA CGTGATCGAG TGGAACACGC TGTTCAACAA CTGGCGGGCC GGGGCGAAAT CCGAGGGCGC GCGGGACGCG GATGCGATCA ACGTCTCCTT TGCGACGATG GACCCGTTCT TTGCCATGGT GCGGTTCGTC TCGACCGAGA CCTTCCCGCC GGTGTCGAAC AACTGGGGCT TCTTCGGCAA TGACGAGTTC GACGCGCTGA TCGCGGAGGC GCGGACCTCC TTCGACGATG CGGACCGGGA TGCGGCGCTC GCACGGCTCC ATGCCCGGGT GGTCGAAGAG GCGCCCTTCG TCTGGATCGC CAACGACGTG GGACCGCGCG CCATGAGCCC GCGGGTCGGC AACGTGGTTC AGCCCAAGAG CTGGTTCATC GACATCGCGC CGATGACCCT CGACGACGAC ACCTGA
|
Protein sequence | MRTRFWATTA AATMAAVLAS GSAWAESVLT IGMTAADIPR TSGQPDQGFE GNRFTGIPMY DALTHWDLSS ETEASVVIPG LATEWSVNPD DTTKWTFKLR EGVTFHDGSP FNAEAVVWNV EKVLDDEAPH FAPDQVGATA SRMPTLRSAR AIDELTVELT TSQPDAFLPL NMTMLFMASP THWQSLYDAV DAGVTDPTER AAAAWTAFAA NPSGTGPFKG ETLVPRERFE MVRNENYWDP ARTPTIDRVV LVPLPDANAR TAALLSGQVD WIEAPAPDVI PRIESQGNVI YANPQPHVWP WQLSFVEDSP WLDKRVRHAA NLCVNREELR VLLGGYMGVP QGTVNEGHPW WGNPSFEIKY DPDAARALME EAGFGPDNPL SVTVQTSASG SGQMQPVTMN EYVQQNLAEC HFDVNLDVIE WNTLFNNWRA GAKSEGARDA DAINVSFATM DPFFAMVRFV STETFPPVSN NWGFFGNDEF DALIAEARTS FDDADRDAAL ARLHARVVEE APFVWIANDV GPRAMSPRVG NVVQPKSWFI DIAPMTLDDD T
|
| |