Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Dshi_1490 |
Symbol | |
ID | 5712669 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Dinoroseobacter shibae DFL 12 |
Kingdom | Bacteria |
Replicon accession | NC_009952 |
Strand | - |
Start bp | 1548941 |
End bp | 1550608 |
Gene Length | 1668 bp |
Protein Length | 555 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 641267405 |
Product | extracellular solute-binding protein family 5 |
Protein accession | YP_001532833 |
Protein GI | 159044039 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0747] ABC-type dipeptide transport system, periplasmic component |
TIGRFAM ID | [TIGR01409] Tat (twin-arginine translocation) pathway signal sequence |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 26 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 35 |
Fosmid unclonability p-value | 0.758744 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACGATGT CCACCATCAA CGGCAAACCC ATGCATCCGG CGGTCGCAAC CTACGCCCAG GACTGCCGCA CCGGCACGCT CAGCCGCCGC GAATTCCTGT CCTACGCCAC TGCCCTCGGC GCCACCTCCG TCGCCGCCTA CGGCATGATC GGGGCCAAAC CGGCCCGCGC GATGACCGCC ACACCGGCCC AGGGCGGCAC GCTGCGCATC CAGCACCTGG TCAAGGCGAT GAAGGAGCCG CGGACCTATG ACTGGTCCGA GCTTGGCAAC CACTCCCGCG GGTTCCTCGA ATATCTCGTG GAATACAATG CCGACGGAAC CTTCCGCGGC ATGCTGCTGG AAAGCTGGGA AGTGAACGAC GACGCCACCG TCTATACCCT CAACGTCCGG CCCGGCGTGA CCTGGAACAA CGGCGACGCC TTTACCGCCG AGGACGTGGC GCGCAACATT ACCGGCTGGT GCGACAGCTC GCTCGAAGGC AACTCCATGG CCACCCGCGT GCAGGGGCTG ATCGACGAAG CCACCGGCCA AGCCCGGGAG GGCGCGATCG AGATCGTCGA TGACATGACC GTGCGCCTGA CCCTCAGTGC GCCCGACATC ACCCTGATCG CCACCATGTC CGACTACCCC GCCGCGATCA CCCATGCCAG CTACGAGGGC GGCAACCCGT TCGACCACGG CATCGGCACC GGCCCCTATC GCCCGGTCAG TTTCGAGGCG AACGTGCGCG CCGTGCTGGA ACGCGCCACC GACCACACCT GGTGGGGCAC CGAGGTCTAT GGCGGCCCCT ATGTCGACCG GATCGAGTTC GTCGATTTCG GCACCGACCC GGCCACCTGG CTGGCGGGCG CCGAGGCCGA GGAATTCGAC CTGACCTACG AGACCACCGG CGAGTTCGTC GACATCTTCA GCGCCATCGG TTGGTCCGTC ACCGAGGCCG TGACCGGCGC CACCGTCGTG TGCCGTCCCA ACCAGGCCGC CGAGATCGAC GGCGTGACGC CCTATGCGGA CGTGAACGTG CGCCGGGCGC TTGCGATGGC CGTGGATAAC AGCGTGCTGC TGGAGCTGGG CTACAACAAC CAGGGCACCG CGGCGGAAAA TCACCATGTC TGCCCGATCC ATCCGGAATA CGCCGATATC GGCGCGCCGG AAACGGACCC GGCCAAGGCC AAGGAGATGA TCGACGCCGC CGGCCTGGGC GATTTCACCC ACACCTTCAT CACCCCCGAT GAAGAGTGGC TCGCCAATAC TGGCGCCGCC CTGACCGCGC AGCTGCGCGA CGCGGGCATC CAGGTCGATC ACCGCATCCA GCCCGGCGCC ACCTTCTGGG GCGACTGGAC CAAGCATGCG TTCTCGGCCA CCTCCTGGAA CCACCGGCCC CTGGGCGTGC AGATCCTGGC ACTGGCCTAC CGCTCGGGCG AGGCATGGAA CGAATCCGCC TATGCCAACC CGGAGTTCGA CGCGGCCCTC GCCGAGGCCC TGGCCATCGC CGATGCCGAC AAGCGCCGCG AAGTCATGGG CAAGGTGCAG CAAATCCTGC GCGACGACGG CGTGATCATC CAGCCCTACT GGCGGTCGCT CTACAACCAC CACCGGGGCG ACGTGGTCAA TGCCGAGAAG CATCCGAGCC ACGAGATCCA CGTCTACAAG CTCGGCTTCG CGGCCTGA
|
Protein sequence | MTMSTINGKP MHPAVATYAQ DCRTGTLSRR EFLSYATALG ATSVAAYGMI GAKPARAMTA TPAQGGTLRI QHLVKAMKEP RTYDWSELGN HSRGFLEYLV EYNADGTFRG MLLESWEVND DATVYTLNVR PGVTWNNGDA FTAEDVARNI TGWCDSSLEG NSMATRVQGL IDEATGQARE GAIEIVDDMT VRLTLSAPDI TLIATMSDYP AAITHASYEG GNPFDHGIGT GPYRPVSFEA NVRAVLERAT DHTWWGTEVY GGPYVDRIEF VDFGTDPATW LAGAEAEEFD LTYETTGEFV DIFSAIGWSV TEAVTGATVV CRPNQAAEID GVTPYADVNV RRALAMAVDN SVLLELGYNN QGTAAENHHV CPIHPEYADI GAPETDPAKA KEMIDAAGLG DFTHTFITPD EEWLANTGAA LTAQLRDAGI QVDHRIQPGA TFWGDWTKHA FSATSWNHRP LGVQILALAY RSGEAWNESA YANPEFDAAL AEALAIADAD KRREVMGKVQ QILRDDGVII QPYWRSLYNH HRGDVVNAEK HPSHEIHVYK LGFAA
|
| |