Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_3859 |
Symbol | dppF |
ID | 6146528 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | - |
Start bp | 3928336 |
End bp | 3929340 |
Gene Length | 1005 bp |
Protein Length | 334 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 641618685 |
Product | dipeptide transporter ATP-binding subunit |
Protein accession | YP_001745825 |
Protein GI | 170680787 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG4608] ABC-type oligopeptide transport system, ATPase component |
TIGRFAM ID | [TIGR01727] oligopeptide/dipeptide ABC transporter, ATP-binding protein, C-terminal domain |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 0.712136 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 61 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGTACGC AAGAGGCCAC CTCGCAACAA CCGCTGTTGC AGGCTATCGA CCTGAAAAAA CATTATCCGG TGAAGAAAGG TATGTTCGCG CCGGAACGTC TGGTGAAGGC GCTGGACGGC GTTTCGTTTA ACCTTGAACG TGGCAAAACG CTGGCAGTAG TAGGTGAATC TGGCTGCGGT AAATCGACCC TCGGTCGGTT ACTGACGATG ATCGAAACGC CCACCGGTGG TGAGCTGTAT TACCAGGGGC AGGATCTGCT CAAGCACGAT CCGCAGGCGC AGAAGTTGCG TCGGCAGAAA ATCCAGATCG TCTTTCAGAA TCCCTATGGT TCGCTAAATC CGCGTAAAAA AGTTGGGCAA ATTCTTGAAG AGCCGCTGCT TATTAATACC AGCTTAAGCA AAGATCAGCG TCGGGAAAAA GCCCTGTCGA TGATGGCGAA AGTCGGCCTG AAAACCGAGC ATTACGACCG CTATCCGCAT ATGTTCTCCG GCGGTCAGCG TCAGCGTATC GCCATTGCCC GTGGTCTGAT GCTCGACCCG GATGTGGTGA TTGCCGATGA GCCGGTTTCC GCGCTGGACG TCTCAGTGCG TGCGCAGGTG CTGAATCTGA TGATGGATTT GCAGCAGGAG TTGGGGCTGT CTTATGTCTT TATCTCCCAC GACCTGTCAG TGGTTGAGCA CATTGCCGAT GAAGTGATGG TGATGTACCT GGGCCGCTGC GTGGAGAAGG GAACGAAGGA CCAAATCTTC AATAACCCGC GTCATCCGTA CACTCAGGCG CTACTCTCCG CGACGCCGCG CCTGAACCCG GACGATCGCC GCGAGCGCAT CAAGCTCACC GGTGAACTGC CAAGCCCGCT CAATCCACCG CCGGGTTGCG CCTTCAACGC CCGCTGTCGT CGGCGCTTCG GCCCCTGCAC CCAGTTGCAG CCGCAGCTAA AAGACTACGG CGGTCAACTG GTAGCTTGTT TTGCTGTTGA TCAGGATGAA AATCCGCAGC GTTAA
|
Protein sequence | MSTQEATSQQ PLLQAIDLKK HYPVKKGMFA PERLVKALDG VSFNLERGKT LAVVGESGCG KSTLGRLLTM IETPTGGELY YQGQDLLKHD PQAQKLRRQK IQIVFQNPYG SLNPRKKVGQ ILEEPLLINT SLSKDQRREK ALSMMAKVGL KTEHYDRYPH MFSGGQRQRI AIARGLMLDP DVVIADEPVS ALDVSVRAQV LNLMMDLQQE LGLSYVFISH DLSVVEHIAD EVMVMYLGRC VEKGTKDQIF NNPRHPYTQA LLSATPRLNP DDRRERIKLT GELPSPLNPP PGCAFNARCR RRFGPCTQLQ PQLKDYGGQL VACFAVDQDE NPQR
|
| |