Gene EcHS_A1710 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcHS_A1710 
SymboltppB 
ID5594864 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli HS 
KingdomBacteria 
Replicon accessionNC_009800 
Strand
Start bp1734722 
End bp1736224 
Gene Length1503 bp 
Protein Length500 aa 
Translation table11 
GC content52% 
IMG OID640920858 
Productputative tripeptide transporter permease 
Protein accessionYP_001458414 
Protein GI157161096 
COG category[E] Amino acid transport and metabolism 
COG ID[COG3104] Dipeptide/tripeptide permease 
TIGRFAM ID[TIGR00924] amino acid/peptide transporter (Peptide:H+ symporter), bacterial 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value0.0064967 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGTCCACTG CAAACCAAAA ACCAACTGAA AGCGTCAGTT TGAACGCTTT CAAACAACCG 
AAGGCGTTCT ATCTCATCTT CTCGATTGAG TTATGGGAAC GTTTTGGTTA TTACGGCCTA
CAAGGAATTA TGGCTGTTTA CCTGGTTAAA CAACTGGGTA TGTCTGAAGC GGATTCAATC
ACCCTTTTCT CTTCCTTTAG TGCCCTGGTT TATGGTCTGG TCGCTATCGG CGGCTGGTTA
GGTGACAAGG TACTGGGTAC TAAACGCGTA ATTATGCTCG GCGCTATTGT GCTGGCGATT
GGTTATGCTC TGGTTGCCTG GTCTGGTCAC GACGCCGGTA TCGTTTATAT GGGTATGGCG
GCTATTGCGG TCGGTAACGG CCTGTTTAAA GCTAACCCGT CTTCTCTGCT TTCTACATGC
TATGAGAAAA ACGACCCGCG TCTGGACGGT GCATTCACCA TGTACTACAT GTCCGTCAAC
ATCGGCTCTT TCTTCTCTAT GATTGCTACG CCGTGGCTGG CCGCGAAATA CGGCTGGAGT
GTTGCGTTTG CGTTGAGCGT TGTAGGCCTG CTGATCACTA TCGTTAACTT CGCCTTCTGC
CAACGCTGGG TTAAACAGTA CGGTTCAAAA CCAGACTTCG AGCCTATCAA CTACCGTAAC
CTGCTGCTGA CCATTATTGG TGTTGTGGCA CTGATCGCTA TCGCCACCTG GCTGCTGCAC
AATCAGGAAG TTGCGCGTAT GGCGCTGGGC GTTGTTGCCT TCGGTATCGT GGTTATCTTC
GGTAAAGAAG CCTTCACGAT GAAAGGTGCT GCGCGTCGTA AAATGATCGT TGCCTTCATC
CTGATGCTCG AAGCCATTAT CTTCTTCGTG CTGTACAGCC AGATGCCAAC GTCACTGAAC
TTCTTTGCGA TTCGTAACGT TGAGCACTCC ATTCTGGGTC TGGCCGTAGA ACCTGAGCAG
TATCAGGCAC TGAACCCGTT CTGGATCATC ATCGGTAGTC CGATTCTGGC CGCTATCTAT
AACAAGATGG GCGATACCCT GCCGATGCCA ACCAAGTTTG CAATCGGCAT GGTGATGTGT
TCTGGTGCGT TCCTGATTCT GCCGCTGGGT GCGAAATTCG CGTCTGACGC TGGTATCGTG
TCTGTAAGCT GGCTGGTCGC AAGCTATGGC CTGCAGAGCA TCGGGGAACT GATGATCTCT
GGTCTGGGTC TGGCAATGGT TGCTCAACTC GTTCCGCAGC GTCTGATGGG CTTCATTATG
GGTAGCTGGT TCCTGACCAC TGCCGGTGCA AACCTGATTG GTGGTTATGT TGCGGGTATG
ATGGCTGTGC CGGATAACGT TACCGATCCG CTGATGTCAC TGGAAGTCTA TGGTCGCGTA
TTCTTGCAGA TTGGTGTCGC TACTGCCGTT ATTGCAGTAC TGATGCTGCT GACCGCGCCG
AAACTGCACC GCATGACGCA GGATGACGCT GCAGACAAAG CGGCGAAAGC AGCCGTAGCG
TAA
 
Protein sequence
MSTANQKPTE SVSLNAFKQP KAFYLIFSIE LWERFGYYGL QGIMAVYLVK QLGMSEADSI 
TLFSSFSALV YGLVAIGGWL GDKVLGTKRV IMLGAIVLAI GYALVAWSGH DAGIVYMGMA
AIAVGNGLFK ANPSSLLSTC YEKNDPRLDG AFTMYYMSVN IGSFFSMIAT PWLAAKYGWS
VAFALSVVGL LITIVNFAFC QRWVKQYGSK PDFEPINYRN LLLTIIGVVA LIAIATWLLH
NQEVARMALG VVAFGIVVIF GKEAFTMKGA ARRKMIVAFI LMLEAIIFFV LYSQMPTSLN
FFAIRNVEHS ILGLAVEPEQ YQALNPFWII IGSPILAAIY NKMGDTLPMP TKFAIGMVMC
SGAFLILPLG AKFASDAGIV SVSWLVASYG LQSIGELMIS GLGLAMVAQL VPQRLMGFIM
GSWFLTTAGA NLIGGYVAGM MAVPDNVTDP LMSLEVYGRV FLQIGVATAV IAVLMLLTAP
KLHRMTQDDA ADKAAKAAVA