Gene ECH74115_4911 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_4911 
SymboldppB 
ID6970111 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp4550022 
End bp4551167 
Gene Length1146 bp 
Protein Length381 aa 
Translation table11 
GC content56% 
IMG OID643388597 
Productdipeptide transporter permease DppB 
Protein accessionYP_002273025 
Protein GI209398114 
COG category[E] Amino acid transport and metabolism
[P] Inorganic ion transport and metabolism 
COG ID[COG0601] ABC-type dipeptide/oligopeptide/nickel transport systems, permease components 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones62 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCCTCTGC GTGCAGCGCC ATCCGGCAGC AATACTTCAT TCCCTGCCCG ACCTGGGCGG 
GAATTGATTT GTGAGCAATA CAGACACGCA GTTCCAGGCT GCGGGTCACT ACAGAGAATC
CGGGTTATGT TGCAGTTTAT TCTCCGACGT TTGGGACTCG TCATCCCCAC GTTTATCGGT
ATTACCCTTC TCACATTTGC CTTTGTCCAC ATGATCCCGG GCGATCCTGT GATGATCATG
GCGGGCGAAC GTGGGATCTC CCCAGAGCGT CACGCGCAGT TGCTGGCTGA ACTCGGCTTA
GATAAACCGA TGTGGCAGCA GTATCTCCAT TACATTTGGG GCGTTATGCA TGGCGATCTA
GGCATTTCAA TGAAAAGCCG AATTCCGGTA TGGGAAGAGT TCGTGCCGCG CTTTCAGGCC
ACGCTGGAAC TTGGCGTCTG CGCGATGATT TTTGCGACCG CGGTCGGCAT TCCGGTTGGT
GTGCTGGCCG CGGTTAAACG CGGTTCCATT TTCGATCACA CAGCGGTTGG CCTGGCGCTG
ACAGGTTATT CAATGCCTAT CTTCTGGTGG GGCATGATGC TGATCATGCT GGTTTCGGTG
CACTGGAACC TGACACCCGT CTCCGGTCGC GTGAGCGATA TGGTGTTCCT CGATGATTCC
AATCCGTTAA CCGGTTTTAT GCTGATCGAC ACCGCCATCT GGGGTGAAGA CGGTAACTTT
ATCGATGCCG TCGCCCATAT GATCTTGCCC GCCATTGTGC TGGGTACTAT TCCGCTGGCG
GTCATTGTGC GTATGACACG CTCCTCGATG CTGGAAGTGC TGGGTGAAGA TTACATCCGC
ACCGCGCGCG CCAAAGGGTT GACCCGCATG CGGGTGATTA TCGTCCATGC GCTGCGTAAC
GCGATGCTGC CGGTAGTGAC CGTTATCGGC CTCCAGGTGG GAACATTGCT GGCGGGGGCG
ATTCTGACCG AAACCATCTT CTCGTGGCCC GGTCTGGGGC GCTGGTTGAT TGACGCACTG
CAACGGCGCG ATTATCCGGT GGTGCAGGGC GGCGTATTGC TGGTGGCGAC GATGATTATC
CTCGTCAACC TGCTGGTCGA TCTGCTGTAT GGCGTGGTGA ACCCGCGTAT TCGTCATAAG
AAGTAA
 
Protein sequence
MPLRAAPSGS NTSFPARPGR ELICEQYRHA VPGCGSLQRI RVMLQFILRR LGLVIPTFIG 
ITLLTFAFVH MIPGDPVMIM AGERGISPER HAQLLAELGL DKPMWQQYLH YIWGVMHGDL
GISMKSRIPV WEEFVPRFQA TLELGVCAMI FATAVGIPVG VLAAVKRGSI FDHTAVGLAL
TGYSMPIFWW GMMLIMLVSV HWNLTPVSGR VSDMVFLDDS NPLTGFMLID TAIWGEDGNF
IDAVAHMILP AIVLGTIPLA VIVRMTRSSM LEVLGEDYIR TARAKGLTRM RVIIVHALRN
AMLPVVTVIG LQVGTLLAGA ILTETIFSWP GLGRWLIDAL QRRDYPVVQG GVLLVATMII
LVNLLVDLLY GVVNPRIRHK K