Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_3863 |
Symbol | dppA |
ID | 6144283 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | - |
Start bp | 3932413 |
End bp | 3934020 |
Gene Length | 1608 bp |
Protein Length | 535 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 641618690 |
Product | dipeptide ABC transporter, periplasmic dipeptide-binding protein |
Protein accession | YP_001745830 |
Protein GI | 170679624 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0747] ABC-type dipeptide transport system, periplasmic component |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 27 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 65 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGTATTT CCTTGAAAAA GTCAGGGATG CTGAAGCTTG GTCTCAGCCT GGTGGCTATG ACCGTCGCAG CAAGTGTTCA GGCTAAAACT CTGGTTTATT GCTCAGAAGG ATCTCCGGAA GGGTTTAACC CGCAGCTGTT TACCTCCGGT ACCACCTATG ACGCCTCTTC CGTACCGCTT TATAACCGCC TGGTTGAATT TAAAATCGGC ACCACCGAAG TGATCCCGGG CCTCGCTGAA AAGTGGGAAG TCAGCGAAGA CGGTAAAACC TATACCTTCC ATCTGCGTAA AGGTGTGAAG TGGCACGACA ATAAAGAATT CAAACCGACG CGCGAACTGA ACGCCGATGA CGTGGTGTTC TCGTTCGATC GTCAGAAAAA CGCGCAAAAC CCGTACCATA AAGTTTCTGG CGGCAGCTAC GAATACTTTG AAGGCATGGG CTTGCCGGAG CTGATCAGCG AAGTGAAAAA GGTGGACGAT AACACCGTTC AGTTTGTGCT GACTCGCCCG GAAGCGCCGT TCCTCGCTGA CCTGGCAATG GACTTCGCCT CTATTCTGTC AAAAGAATAT GCTGACGCGA TGATGAAAGC CGGTACACCG GAAAAACTGG ATCTCAACCC AATCGGTACC GGCCCGTTCC AGTTACAGCA GTACCAGAAA GATTCCCGTA TTCGCTATAA AGCGTTTGAT GGCTACTGGG GCACCAAACC GAAGATCGAT ACGCTGGTCT TCTCTATTAC TCCTGACGCT TCCGTGCGTT ACGCGAAATT GCAGAAGAAC GAATGCCAGG TGATGCCGTA CCCGAACCCG GCAGATATCG CCCGCATGAA GCAGGATAAA TCCATCAACC TGATGGAAAT GCCGGGGCTG AATGTCGGTT ATCTCTCGTA TAACGTGCAG AAAAAACCAC TGGATGACGT GAAAGTTCGC CAGGCTCTGA CCTACGCGGT GAACAAAGAC GCCATCATCA AAGCAGTTTA TCAGGGCGCG GGCGTATCAG CGAAAAACCT GATCCCGCCA ACCATGTGGG GCTATAACGA CGACGTTCAG GACTACACTT ACGATCCTGA AAAAGCGAAA GCCTTGCTGA AAGAAGCGGG TCTGGAAAAA GGTTTCTCCA TCGACTTGTG GGCAATGCCG GTACAACGTC CGTATAACCC GAACGCTCGC CGCATGGCGG AGATGATTCA GGCAGACTGG GCGAAAGTCG GCGTGCAGGC CAAAATCGTC ACCTACGAAT GGGGTGAGTA CCTCAAGCGT GCGAAAGATG GCGAGCATCA GACGGTAATG ATGGGCTGGA CTGGCGATAA CGGGGATCCG GATAACTTCT TCGCCACCCT GTTCAGCTGC GCCGCCTCTG AACAAGGCTC CAACTACTCA AAATGGTGCT ACAAACCGTT TGAAGATCTG ATTCAACCGG CGCGTGCTAC CGACGACCAC AATAAACGTG TTGAACTGTA CAAACAAGCA CAGGTCGTGA TGCACGATCA GGCTCCGGCA CTGATCATCG CTCACTCCAC CGTGTTTGAA CCGGTACGCA AAGAAGTTAA AGGCTATGTG GTTGATCCAT TAGGCAAACA TCACTTCGAA AATGTCTCTA TCGAATAA
|
Protein sequence | MRISLKKSGM LKLGLSLVAM TVAASVQAKT LVYCSEGSPE GFNPQLFTSG TTYDASSVPL YNRLVEFKIG TTEVIPGLAE KWEVSEDGKT YTFHLRKGVK WHDNKEFKPT RELNADDVVF SFDRQKNAQN PYHKVSGGSY EYFEGMGLPE LISEVKKVDD NTVQFVLTRP EAPFLADLAM DFASILSKEY ADAMMKAGTP EKLDLNPIGT GPFQLQQYQK DSRIRYKAFD GYWGTKPKID TLVFSITPDA SVRYAKLQKN ECQVMPYPNP ADIARMKQDK SINLMEMPGL NVGYLSYNVQ KKPLDDVKVR QALTYAVNKD AIIKAVYQGA GVSAKNLIPP TMWGYNDDVQ DYTYDPEKAK ALLKEAGLEK GFSIDLWAMP VQRPYNPNAR RMAEMIQADW AKVGVQAKIV TYEWGEYLKR AKDGEHQTVM MGWTGDNGDP DNFFATLFSC AASEQGSNYS KWCYKPFEDL IQPARATDDH NKRVELYKQA QVVMHDQAPA LIIAHSTVFE PVRKEVKGYV VDPLGKHHFE NVSIE
|
| |