Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcHS_A3739 |
Symbol | dppF |
ID | 5593766 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli HS |
Kingdom | Bacteria |
Replicon accession | NC_009800 |
Strand | - |
Start bp | 3731335 |
End bp | 3732339 |
Gene Length | 1005 bp |
Protein Length | 334 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 640922853 |
Product | dipeptide transporter ATP-binding subunit |
Protein accession | YP_001460332 |
Protein GI | 157163014 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG4608] ABC-type oligopeptide transport system, ATPase component |
TIGRFAM ID | [TIGR01727] oligopeptide/dipeptide ABC transporter, ATP-binding protein, C-terminal domain |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 75 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGTACGC AAGAGGCCAC CCTGCAACAA CCGCTGTTGC AGGCTATCGA CCTGAAAAAA CATTATCCGG TGAAGAAAGG CATGTTCGCG CCGGAACGTC TGGTTAAAGC GCTGGATGGC GTTTCGTTTA ACCTTGAACG TGGCAAAACG CTGGCAGTAG TGGGCGAATC TGGCTGCGGT AAATCGACCC TCGGTCGGTT GCTGACGATG ATTGAAATGC CCACCGGTGG CGAGCTGTAT TACCAGGGGC AGGATCTGCT TAAGCACGAT CCGCAGGCGC AGAAGCTGCG TCGGCAGAAA ATCCAGATCG TCTTCCAGAA CCCTTACGGT TCGCTGAATC CGCGTAAAAA AGTCGGGCAA ATTCTTGAAG AGCCGCTGCT GATCAACACC AGCTTAAGCA AAGAACAGCG TCGGGAAAAA GCCCTGTCGA TGATGGCGAA AGTCGGCCTG AAAACCGAGC ACTATGACCG CTATCCGCAT ATGTTCTCCG GCGGTCAGCG TCAGCGTATC GCCATCGCCC GTGGTCTGAT GCTCGACCCG GATGTGGTGA TTGCCGATGA ACCGGTTTCC GCGCTGGATG TTTCAGTGCG CGCGCAGGTG CTGAATCTGA TGATGGATTT GCAGCAGGAG TTGGGGCTGT CTTATGTCTT TATCTCCCAC GACCTGTCGG TGGTGGAGCA CATTGCTGAT GAAGTGATGG TGATGTACCT GGGCCGCTGC GTGGAGAAGG GAACGAAAGA CCAAATCTTC AATAACCCGC GCCATCCGTA CACTCAGGCG CTACTTTCCG CGACGCCGCG CCTGAACCCG GACGATCGCC GCGAGCGCAT CAAGCTCAGC GGTGAACTAC CAAGCCCACT GAATCCACCG CCGGGTTGCG CCTTCAACGC CCGCTGTCGT CGGCGCTTCG GCCCCTGCAC CCAGTTGCAG CCGCAGCTAA AAGACTACGG CGGTCAACTG GTAGCTTGTT TTGCTGTTGA TCAGGATGAA AATCCGCAGC GTTAA
|
Protein sequence | MSTQEATLQQ PLLQAIDLKK HYPVKKGMFA PERLVKALDG VSFNLERGKT LAVVGESGCG KSTLGRLLTM IEMPTGGELY YQGQDLLKHD PQAQKLRRQK IQIVFQNPYG SLNPRKKVGQ ILEEPLLINT SLSKEQRREK ALSMMAKVGL KTEHYDRYPH MFSGGQRQRI AIARGLMLDP DVVIADEPVS ALDVSVRAQV LNLMMDLQQE LGLSYVFISH DLSVVEHIAD EVMVMYLGRC VEKGTKDQIF NNPRHPYTQA LLSATPRLNP DDRRERIKLS GELPSPLNPP PGCAFNARCR RRFGPCTQLQ PQLKDYGGQL VACFAVDQDE NPQR
|
| |