Gene EcSMS35_3859 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_3859 
SymboldppF 
ID6146528 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp3928336 
End bp3929340 
Gene Length1005 bp 
Protein Length334 aa 
Translation table11 
GC content56% 
IMG OID641618685 
Productdipeptide transporter ATP-binding subunit 
Protein accessionYP_001745825 
Protein GI170680787 
COG category[E] Amino acid transport and metabolism 
COG ID[COG4608] ABC-type oligopeptide transport system, ATPase component 
TIGRFAM ID[TIGR01727] oligopeptide/dipeptide ABC transporter, ATP-binding protein, C-terminal domain 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value0.712136 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones61 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTACGC AAGAGGCCAC CTCGCAACAA CCGCTGTTGC AGGCTATCGA CCTGAAAAAA 
CATTATCCGG TGAAGAAAGG TATGTTCGCG CCGGAACGTC TGGTGAAGGC GCTGGACGGC
GTTTCGTTTA ACCTTGAACG TGGCAAAACG CTGGCAGTAG TAGGTGAATC TGGCTGCGGT
AAATCGACCC TCGGTCGGTT ACTGACGATG ATCGAAACGC CCACCGGTGG TGAGCTGTAT
TACCAGGGGC AGGATCTGCT CAAGCACGAT CCGCAGGCGC AGAAGTTGCG TCGGCAGAAA
ATCCAGATCG TCTTTCAGAA TCCCTATGGT TCGCTAAATC CGCGTAAAAA AGTTGGGCAA
ATTCTTGAAG AGCCGCTGCT TATTAATACC AGCTTAAGCA AAGATCAGCG TCGGGAAAAA
GCCCTGTCGA TGATGGCGAA AGTCGGCCTG AAAACCGAGC ATTACGACCG CTATCCGCAT
ATGTTCTCCG GCGGTCAGCG TCAGCGTATC GCCATTGCCC GTGGTCTGAT GCTCGACCCG
GATGTGGTGA TTGCCGATGA GCCGGTTTCC GCGCTGGACG TCTCAGTGCG TGCGCAGGTG
CTGAATCTGA TGATGGATTT GCAGCAGGAG TTGGGGCTGT CTTATGTCTT TATCTCCCAC
GACCTGTCAG TGGTTGAGCA CATTGCCGAT GAAGTGATGG TGATGTACCT GGGCCGCTGC
GTGGAGAAGG GAACGAAGGA CCAAATCTTC AATAACCCGC GTCATCCGTA CACTCAGGCG
CTACTCTCCG CGACGCCGCG CCTGAACCCG GACGATCGCC GCGAGCGCAT CAAGCTCACC
GGTGAACTGC CAAGCCCGCT CAATCCACCG CCGGGTTGCG CCTTCAACGC CCGCTGTCGT
CGGCGCTTCG GCCCCTGCAC CCAGTTGCAG CCGCAGCTAA AAGACTACGG CGGTCAACTG
GTAGCTTGTT TTGCTGTTGA TCAGGATGAA AATCCGCAGC GTTAA
 
Protein sequence
MSTQEATSQQ PLLQAIDLKK HYPVKKGMFA PERLVKALDG VSFNLERGKT LAVVGESGCG 
KSTLGRLLTM IETPTGGELY YQGQDLLKHD PQAQKLRRQK IQIVFQNPYG SLNPRKKVGQ
ILEEPLLINT SLSKDQRREK ALSMMAKVGL KTEHYDRYPH MFSGGQRQRI AIARGLMLDP
DVVIADEPVS ALDVSVRAQV LNLMMDLQQE LGLSYVFISH DLSVVEHIAD EVMVMYLGRC
VEKGTKDQIF NNPRHPYTQA LLSATPRLNP DDRRERIKLT GELPSPLNPP PGCAFNARCR
RRFGPCTQLQ PQLKDYGGQL VACFAVDQDE NPQR