Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SeD_A4008 |
Symbol | dppC |
ID | 6875493 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salmonella enterica subsp. enterica serovar Dublin str. CT_02021853 |
Kingdom | Bacteria |
Replicon accession | NC_011205 |
Strand | - |
Start bp | 3852109 |
End bp | 3853011 |
Gene Length | 903 bp |
Protein Length | 300 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 642786962 |
Product | dipeptide transporter |
Protein accession | YP_002217590 |
Protein GI | 198245574 |
COG category | [E] Amino acid transport and metabolism [P] Inorganic ion transport and metabolism |
COG ID | [COG1173] ABC-type dipeptide/oligopeptide/nickel transport systems, permease components |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.912863 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 72 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCACAGG TTACTGAAAA TAATGTTAAT GCCGCACCGG CGCCCATGAC GCCATTGCGG GAGTTCTGGC ACTATTTCAA ACGCAACAAA GGCGCGGTCG TCGGGCTGGC GTATGTTCTC ATCGTGATCC TGATTGCGGT GTTTGCCAAC TTTATTGCGC CGTACAACCC GGCAGAGCAG TTCCGTGATG CGCTGCTGGC ACCGCCGGTC TGGCAGGAAG GCGGCAGTTG GGCGCATATT CTGGGAACGG ATGATGTTGG TCGCGATGTC CTCTCGCGCC TGATGTATGG CGCGCGTTTG TCACTGCTGG TCGGCTGTCT GGTGGTCGTC CTGTCGCTGG TAATGGGGAT CATTCTCGGC CTGGTCGCGG GCTACTTCGG CGGTCTGGTC GATAACATCA TCATGCGCGT GGTCGATATT ATGCTGGCCC TGCCGAGCCT GCTGCTGGCG CTGGTGCTGG TGGCGATCTT CGGCCCCTCC ATCGGCAACG CTGCGCTGGC GTTGACGTTT GTGGCGCTGC CGCACTATGT CCGCTTAACC CGCGCGGCGG TTCTGGTAGA GGTGAACCGC GATTATGTGA CTGCCTCCCG CGTGGCGGGC GCAGGCGCGA TGCGTCAGAT GTTCGTCAAT ATTTTCCCGA ACTGCCTTGC GCCGCTGATC GTTCAGGCGT CACTGGGCTT CTCTAACGCC ATTCTCGATA TGGCTGCCCT CGGCTTCCTG GGGATGGGCG CACAGCCGCC TACGCCGGAA TGGGGCACCA TGCTCTCTGA CGTTCTGCAG TTCGCGCAAA GCGCCTGGTG GGTCGTGACC TTCCCGGGGC TGGCGATTCT GCTGACGGTA CTGGCATTTA ACCTGATGGG TGACGGGCTG CGTGACGCGC TCGATCCCAA ACTGAAGCAG TAA
|
Protein sequence | MSQVTENNVN AAPAPMTPLR EFWHYFKRNK GAVVGLAYVL IVILIAVFAN FIAPYNPAEQ FRDALLAPPV WQEGGSWAHI LGTDDVGRDV LSRLMYGARL SLLVGCLVVV LSLVMGIILG LVAGYFGGLV DNIIMRVVDI MLALPSLLLA LVLVAIFGPS IGNAALALTF VALPHYVRLT RAAVLVEVNR DYVTASRVAG AGAMRQMFVN IFPNCLAPLI VQASLGFSNA ILDMAALGFL GMGAQPPTPE WGTMLSDVLQ FAQSAWWVVT FPGLAILLTV LAFNLMGDGL RDALDPKLKQ
|
| |