Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SeD_A4010 |
Symbol | dppA |
ID | 6873882 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salmonella enterica subsp. enterica serovar Dublin str. CT_02021853 |
Kingdom | Bacteria |
Replicon accession | NC_011205 |
Strand | - |
Start bp | 3854197 |
End bp | 3855777 |
Gene Length | 1581 bp |
Protein Length | 526 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 642786964 |
Product | dipeptide ABC transporter periplasmic dipeptide-binding protein |
Protein accession | YP_002217592 |
Protein GI | 198246114 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0747] ABC-type dipeptide transport system, periplasmic component |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 68 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCTGAAGC TTGGTTTGAG CCTGGTGGCC ATGACCGTTG CAGCAAGCGT GCAGGCCAAA ACCCTGGTTT ATTGTTCAGA AGGCTCGCCG GAAGGCTTTA ACCCACAGCT CTTTACGTCT GGCACCACCT ATGATGCCAG CTCCGTACCT ATCTATAACC GTCTGGTTGA ATTCAAAACC GGCACCACGG AAGTGATCCC GGGTCTTGCT GAGAAGTGGG ATATCAGCGA AGACGGTAAA ACCTATACGT TCCACCTACG TAAAGGGGTG AAATGGCAAT CCAGCAAGGA TTTCAAACCC ACGCGCGAGC TGAACGCCGA TGATGTCGTG TTCTCTTTTG ACCGGCAGAA AAACGAGCAG AACCCGTACC ATAAAGTGTC TGGCGGCAGC TATGAATACT TTGAAGGCAT GGGGCTGCCG GATCTGATTA GCGAAGTGAA GAAGGTCGAC GATCACACGG TGCAGTTTGT GCTGACGCGT CCGGAAGCGC CGTTCCTTGC CGATTTAGCC ATGGACTTCG CCTCTATTCT TTCCAAAGAA TATGCTGACA ACATGCTGAA AGCCGGTACG CCGGAAAAAG TGGATCTGAA CCCGGTCGGC ACTGGCCCGT TCCAACTGGT GCAATATCAG AAAGACTCCC GCATTCTCTA CAAAGCCTTT GACGGCTACT GGGGCACGAA GCCGCAGATT GACCGTCTGG TCTTCTCCAT CACGCCTGAC GCCTCTGTGC GTTACGCCAA ACTGCAGAAG AACGAATGTC AGGTGATGCC GTATCCGAAC CCGGCGGATA TTGCGCGCAT GAAAGAAGAT AAAAACATCA ACCTGATGGA GCAGGCCGGT CTGAACGTGG GTTATCTCTC CTATAACGTG CAGAAAAAAC CGCTGGATGA TGTCAAAGTT CGCCAGGCGT TGACCTATGC CGTGAATAAA GAGGCCATCA TCAAAGCCGT TTATCAGGGC GCGGGCGTTG CGGCGAAAAA CCTGATCCCG CCGACGATGT GGGGCTACAA CGACGATATT AAAGACTACG GCTACGATCC GGAAAAAGCG AAGGTGCTGC TGAAAGAAGC CGGTCTGGAA AAAGGCTTCA CCATCGATCT ATGGGCGATG CCGGTACAGC GTCCCTATAA CCCGAATGCG CGTCGTATGG CGGAAATGAT CCAGGCGGAC TGGGCGAAGA TTGGCGTTCA GGCCAAAATT GTCACCTATG AATGGGGCGA ATACCTCAAG CGCGCTAAAG ATGGCGAGCA CCAGACGGTG ATGATGGGCT GGACCGGCGA TAATGGCGAT CCGGATAACT TCTTCGCCAC TCTGTTCAGC TGCGATGCCG CCCAGCAAGG CTCCAACTAT TCAAAATGGT GCTACAAGCC GTTTGAAGAC CTGATTCAGC CTGCGCGTGC GACCGATGAC CACAACAAGC GTATTGAGCT CTATAAACAG GCCCAGGTTG TGATGCATGA CCAGGCGCCA GCGCTGATCA TCGCTCACTC CACGGTTTAT GAGCCAGTGC GTAAAGAAGT TAAAGGCTAT GTGGTTGATC CATTAGGCAA ACATCACTTC GAAAACGTCT CTGTCGAATA A
|
Protein sequence | MLKLGLSLVA MTVAASVQAK TLVYCSEGSP EGFNPQLFTS GTTYDASSVP IYNRLVEFKT GTTEVIPGLA EKWDISEDGK TYTFHLRKGV KWQSSKDFKP TRELNADDVV FSFDRQKNEQ NPYHKVSGGS YEYFEGMGLP DLISEVKKVD DHTVQFVLTR PEAPFLADLA MDFASILSKE YADNMLKAGT PEKVDLNPVG TGPFQLVQYQ KDSRILYKAF DGYWGTKPQI DRLVFSITPD ASVRYAKLQK NECQVMPYPN PADIARMKED KNINLMEQAG LNVGYLSYNV QKKPLDDVKV RQALTYAVNK EAIIKAVYQG AGVAAKNLIP PTMWGYNDDI KDYGYDPEKA KVLLKEAGLE KGFTIDLWAM PVQRPYNPNA RRMAEMIQAD WAKIGVQAKI VTYEWGEYLK RAKDGEHQTV MMGWTGDNGD PDNFFATLFS CDAAQQGSNY SKWCYKPFED LIQPARATDD HNKRIELYKQ AQVVMHDQAP ALIIAHSTVY EPVRKEVKGY VVDPLGKHHF ENVSVE
|
| |