Gene ECH74115_4909 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_4909 
SymboldppD 
ID6970254 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp4548116 
End bp4549099 
Gene Length984 bp 
Protein Length327 aa 
Translation table11 
GC content56% 
IMG OID643388595 
Productdipeptide transporter ATP-binding subunit 
Protein accessionYP_002273023 
Protein GI209398420 
COG category[E] Amino acid transport and metabolism
[P] Inorganic ion transport and metabolism 
COG ID[COG0444] ABC-type dipeptide/oligopeptide/nickel transport system, ATPase component 
TIGRFAM ID[TIGR01727] oligopeptide/dipeptide ABC transporter, ATP-binding protein, C-terminal domain 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones65 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGTTAT TAAATGTAGA TAAATTATCG GTGCATTTCG GCGACGAAAG CGCACCGTTC 
CGCGCCGTAG ACCGCATTAG CTACAGCGTA AAACAGGGCG AAGTAGTCGG GATTGTGGGT
GAGTCTGGCT CCGGTAAGTC GGTCAGTTCG CTGGCGATTA TGGGGCTGAT TGATTATCCG
GGCCGCGTGA TGGCGGAAAA GCTGGAGTTT AACGGTCAGG ATTTGCAGCG TATCTCGGAA
AAAGAGCGCC GCAACCTGGT GGGTGCCGAA GTGGCGATGA TCTTCCAGGA CCCGATGACC
AGCCTTAACC CGTGCTACAC CGTGGGTTTC CAGATTATGG AAGCGATTAA GGTGCATCAG
GGCGGTAATA AGAGCACTCG TCGTCAGCGG GCGATTGACT TGTTGAATCA GGTCGGTATT
CCCGATCCGG CTTCGCGTCT GGATGTTTAC CCGCATCAGC TTTCCGGCGG CATGAGCCAG
CGCGTGATGA TTGCGATGGC CATTGCCTGT CGGCCAAAAC TGCTGATTGC CGATGAACCA
ACCACTGCGC TGGACGTGAC CATTCAGGCG CAAATCATTG AACTACTGCT GGAGCTACAG
CAGAAAGAGA ACATGGCGCT GGTGTTAATT ACCCATGACC TGGCGCTGGT GGCGGAAGCG
GCACATAAAA TCATCGTGAT GTATGCCGGT CAGGTGGTGG AAACGGGCGA TGCGCACGCC
ATCTTCCATG CGCCGCGTCA CCCGTATACT CAGGCATTGC TGCGTGCGCT GCCGGAATTT
GCTCAGGACA AAGAACGTCT GGCATCGTTG CCTGGCGTTG TTCCCGGCAA GTACGACCGC
CCGAACGGCT GCTTGCTTAA CCCGCGCTGC CCCTATGCCA CTGACAGATG TCGCGCTGAA
GAACCGGCGC TGAATATGCT CGCTGACGGG CGTCAGTCCA AATGCCATTA CCCACTTGAT
GATGCCGGGA GGCCGACACT ATGA
 
Protein sequence
MALLNVDKLS VHFGDESAPF RAVDRISYSV KQGEVVGIVG ESGSGKSVSS LAIMGLIDYP 
GRVMAEKLEF NGQDLQRISE KERRNLVGAE VAMIFQDPMT SLNPCYTVGF QIMEAIKVHQ
GGNKSTRRQR AIDLLNQVGI PDPASRLDVY PHQLSGGMSQ RVMIAMAIAC RPKLLIADEP
TTALDVTIQA QIIELLLELQ QKENMALVLI THDLALVAEA AHKIIVMYAG QVVETGDAHA
IFHAPRHPYT QALLRALPEF AQDKERLASL PGVVPGKYDR PNGCLLNPRC PYATDRCRAE
EPALNMLADG RQSKCHYPLD DAGRPTL