Gene EcolC_1995 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcolC_1995 
SymboltppB 
ID6068146 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli ATCC 8739 
KingdomBacteria 
Replicon accessionNC_010468 
Strand
Start bp2202404 
End bp2203906 
Gene Length1503 bp 
Protein Length500 aa 
Translation table11 
GC content52% 
IMG OID641601409 
Productputative tripeptide transporter permease 
Protein accessionYP_001724968 
Protein GI170020014 
COG category[E] Amino acid transport and metabolism 
COG ID[COG3104] Dipeptide/tripeptide permease 
TIGRFAM ID[TIGR00924] amino acid/peptide transporter (Peptide:H+ symporter), bacterial 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.46141 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0309863 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGTCCACTG CAAACCAAAA ACCAACTGAA AGCGTCAGTT TGAACGCTTT CAAACAACCG 
AAGGCGTTCT ATCTCATCTT CTCGATTGAG TTATGGGAAC GTTTTGGTTA TTACGGCCTA
CAAGGAATTA TGGCTGTTTA CCTGGTTAAA CAACTGGGTA TGTCTGAAGC GGATTCAATC
ACCCTTTTCT CTTCCTTTAG TGCCCTGGTT TATGGTCTGG TCGCTATCGG CGGCTGGTTA
GGTGACAAGG TACTGGGTAC TAAACGCGTA ATTATGCTCG GCGCTATTGT GCTGGCGATT
GGTTATGCTC TGGTTGCCTG GTCTGGTCAC GACGCCGGTA TCGTTTATAT GGGTATGGCG
GCTATTGCGG TCGGTAACGG CCTGTTTAAA GCTAACCCGT CTTCTCTGCT TTCTACATGC
TATGAGAAAA ACGACCCGCG TCTGGACGGT GCATTCACCA TGTACTACAT GTCCGTCAAC
ATCGGCTCTT TCTTCTCTAT GATTGCTACG CCGTGGCTGG CCGCGAAATA CGGCTGGAGT
GTTGCGTTTG CGTTGAGCGT TGTAGGCCTG CTGATCACTA TCGTTAACTT CGCCTTCTGC
CAACGCTGGG TTAAACAGTA CGGTTCAAAA CCAGACTTCG AGCCTATCAA CTACCGTAAC
CTGCTGCTGA CCATTATTGG TGTTGTGGCA CTGATCGCTA TCGCCACCTG GCTGCTGCAC
AATCAGGAAG TTGCGCGTAT GGCGCTGGGC GTTGTTGCCT TCGGTATCGT GGTTATCTTC
GGTAAAGAAG CCTTCGCGAT GAAAGGTGCT GCGCGTCGTA AAATGATCGT TGCCTTCATC
CTGATGCTCG AAGCCATTAT CTTCTTCGTG CTGTACAGCC AGATGCCAAC GTCACTGAAC
TTCTTTGCGA TTCGTAACGT TGAGCACTCC ATTCTGGGTC TGGCCGTAGA ACCTGAGCAG
TATCAGGCAC TGAACCCGTT CTGGATCATC ATCGGTAGTC CGATTCTGGC CGCTATCTAT
AACAAGATGG GCGATACCCT GCCGATGCCA ACCAAGTTTG CAATCGGCAT GGTGATGTGT
TCTGGTGCGT TCCTGATTCT GCCGCTGGGT GCGAAATTCG CGTCTGACGC TGGTATCGTG
TCTGTAAGCT GGCTGGTCGC AAGCTATGGC CTGCAGAGCA TCGGGGAACT GATGATCTCT
GGTCTGGGTC TGGCAATGGT TGCTCAACTC GTTCCGCAGC GTCTGATGGG CTTCATTATG
GGTAGCTGGT TCCTGACCAC TGCCGGTGCA AACCTGATTG GTGGTTATGT TGCGGGTATG
ATGGCTGTGC CGGATAACGT TACCGATCCG CTGATGTCAC TGGAAGTCTA TGGTCGCGTA
TTCTTGCAGA TTGGTGTCGC TACTGCCGTT ATTGCAGTAC TGATGCTGCT GACCGCGCCG
AAACTGCACC GCATGACGCA GGATGACGCT GCAGACAAAG CGGCGAAAGC AGCCGTAGCG
TAA
 
Protein sequence
MSTANQKPTE SVSLNAFKQP KAFYLIFSIE LWERFGYYGL QGIMAVYLVK QLGMSEADSI 
TLFSSFSALV YGLVAIGGWL GDKVLGTKRV IMLGAIVLAI GYALVAWSGH DAGIVYMGMA
AIAVGNGLFK ANPSSLLSTC YEKNDPRLDG AFTMYYMSVN IGSFFSMIAT PWLAAKYGWS
VAFALSVVGL LITIVNFAFC QRWVKQYGSK PDFEPINYRN LLLTIIGVVA LIAIATWLLH
NQEVARMALG VVAFGIVVIF GKEAFAMKGA ARRKMIVAFI LMLEAIIFFV LYSQMPTSLN
FFAIRNVEHS ILGLAVEPEQ YQALNPFWII IGSPILAAIY NKMGDTLPMP TKFAIGMVMC
SGAFLILPLG AKFASDAGIV SVSWLVASYG LQSIGELMIS GLGLAMVAQL VPQRLMGFIM
GSWFLTTAGA NLIGGYVAGM MAVPDNVTDP LMSLEVYGRV FLQIGVATAV IAVLMLLTAP
KLHRMTQDDA ADKAAKAAVA