Gene EcSMS35_1565 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_1565 
SymboltppB 
ID6146773 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp1548584 
End bp1550086 
Gene Length1503 bp 
Protein Length500 aa 
Translation table11 
GC content52% 
IMG OID641616442 
Productputative tripeptide transporter permease 
Protein accessionYP_001743620 
Protein GI170682111 
COG category[E] Amino acid transport and metabolism 
COG ID[COG3104] Dipeptide/tripeptide permease 
TIGRFAM ID[TIGR00924] amino acid/peptide transporter (Peptide:H+ symporter), bacterial 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones66 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGTCCACTG CAAACCAAAA ACCAACTGAA AGCGTCAGTT TGAACGCTTT CAAACAACCG 
AAGGCGTTCT ATCTCATCTT CTCGATTGAG TTATGGGAAC GTTTTGGTTA TTACGGCCTA
CAAGGAATTA TGGCTGTTTA CCTGGTTAAA CAACTGGGTA TGTCTGAAGC GGATTCAATC
ACCCTTTTCT CTTCCTTTAG TGCCCTGGTT TATGGTCTGG TCGCTATCGG CGGCTGGTTA
GGTGACAAGG TACTGGGTAC TAAACGCGTA ATTATGCTTG GCGCTATTGT GCTGGCGATT
GGTTATGCTC TGGTTGCCTG GTCTGGTCAC GACGCCGGTA TCGTTTATAT GGGTATGGCG
GCTATTGCGG TCGGTAACGG CCTGTTTAAA GCTAACCCGT CTTCTCTGCT TTCTACATGC
TATGAGAAAA ACGACCCGCG TCTGGACGGT GCATTCACCA TGTACTACAT GTCCGTCAAC
ATCGGCTCTT TCTTCTCTAT GATCGCCACC CCGTGGCTGG CCGCGAAATA CGGCTGGAGT
GTTGCGTTTG CGTTGAGCGT TGTAGGCCTG CTGATCACTA TCGTTAACTT CGCCTTCTGC
CAACGCTGGG TTAAACAGTA CGGTTCAAAA CCAGACTTCG AGCCTATCAA CTACCGTAAC
CTGCTGCTGA CCATTATTGG TGTTGTGGCA CTGATCGCTA TCGCCACCTG GCTGCTGCAC
AATCAGGAAG TTGCGCGTAT GGCGCTGGGC GTTGTTGCCT TCGGTATTGT GGTTATCTTC
GGTAAAGAAG CCTTCGCGAT GAAAGGTGCT GCGCGTCGTA AAATGATCGT TGCCTTCATC
CTGATGCTCG AAGCCATTAT CTTCTTCGTG CTGTACAGCC AGATGCCAAC GTCACTGAAC
TTCTTTGCGA TTCGTAACGT TGAGCACTCC ATTCTGGGTC TGGCCGTAGA ACCTGAGCAG
TATCAGGCAC TGAACCCGTT CTGGATCATC ATCGGTAGTC CGATTCTGGC CGCTATCTAT
AACAAGATGG GCGATACCCT GCCGATGCCA ACCAAGTTTG CAATCGGCAT GGTGATGTGT
TCTGGTGCGT TCCTGATTCT GCCGCTGGGT GCGAAATTCG CGTCTGACGC TGGTATCGTG
TCTGTAAGCT GGCTGGTCGC GAGCTATGGC CTGCAGAGTA TCGGGGAACT GATGATCTCT
GGTCTGGGTC TGGCAATGGT CGCACAACTC GTTCCGCAGC GTCTGATGGG CTTCATTATG
GGTAGCTGGT TCCTGACCAC TGCCGGTGCA AACCTGATTG GTGGTTATGT TGCGGGTATG
ATGGCTGTGC CGGATAACGT TACCGATCCG CTGATGTCAC TGGAAGTCTA TGGTCGCGTA
TTCTTGCAGA TTGGTGTCGC TACTGCCGTT ATTGCAGTAC TGATGCTGCT GACCGCGCCG
AAACTGCACC GCATGACGCA GGATGACGCT GCAGACAAAG CGGCGAAAGC TGCTGTAGCG
TAA
 
Protein sequence
MSTANQKPTE SVSLNAFKQP KAFYLIFSIE LWERFGYYGL QGIMAVYLVK QLGMSEADSI 
TLFSSFSALV YGLVAIGGWL GDKVLGTKRV IMLGAIVLAI GYALVAWSGH DAGIVYMGMA
AIAVGNGLFK ANPSSLLSTC YEKNDPRLDG AFTMYYMSVN IGSFFSMIAT PWLAAKYGWS
VAFALSVVGL LITIVNFAFC QRWVKQYGSK PDFEPINYRN LLLTIIGVVA LIAIATWLLH
NQEVARMALG VVAFGIVVIF GKEAFAMKGA ARRKMIVAFI LMLEAIIFFV LYSQMPTSLN
FFAIRNVEHS ILGLAVEPEQ YQALNPFWII IGSPILAAIY NKMGDTLPMP TKFAIGMVMC
SGAFLILPLG AKFASDAGIV SVSWLVASYG LQSIGELMIS GLGLAMVAQL VPQRLMGFIM
GSWFLTTAGA NLIGGYVAGM MAVPDNVTDP LMSLEVYGRV FLQIGVATAV IAVLMLLTAP
KLHRMTQDDA ADKAAKAAVA