Gene EcSMS35_1688 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_1688 
SymbolyddQ 
ID6146099 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp1692239 
End bp1693135 
Gene Length897 bp 
Protein Length298 aa 
Translation table11 
GC content54% 
IMG OID641616564 
ProductATP-dependent peptide transporter membrane subunit 
Protein accessionYP_001743742 
Protein GI170681230 
COG category[E] Amino acid transport and metabolism
[P] Inorganic ion transport and metabolism 
COG ID[COG1173] ABC-type dipeptide/oligopeptide/nickel transport systems, permease components 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.175944 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones60 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATGCTAA GCGAGGAAAC GTCAGCCGTG CGCCCACCAA AACAAACGCG ATTTAACGGT 
GCAAAACTGG TATGGATGCT GAAAGGCAGC CCGCTCACCG TGACCGGCGC AGTCATCATT
GTATTAATGC TATTGATGAT GATTTTTTCA CCGTGGCTGG CGACGCATGA TCCCAACGCC
ATTGATTTAA CCGCCCGCCT TTTGCCGCCT TCTGCGGCGC ACTGGTTTGG CACCGATGAA
GTGGGACGCG ATCTGTTCAG CCGCGTACTG GTCGGCAGTC AGCAATCAAT TCTCGCCGGA
TTAGTGGTGG TCGCCATTGC AGGTATGATG GGGTCGCTAC TCGGGTGTCT ATCCGGTGTG
CTTGGCGGAC GCGCAGACGC GATTATCATG CGCATCATGG ACATTATGCT GTCGATTCCT
TCGCTGGTAC TGACAATGGC ACTGGCAGCC GCTCTCGGGC CGAGTTTGTT TAACGCCATG
CTGGCGATTG CTATTGTACG AATTCCCTTT TATGTGCGCC TGGCGCGGGG GCAAACATTA
GTTGTTCGCC AGTATACCTA TGTTCAGGCG GCGAAAACCT TTGGTGCGTC GCGTTGGCAT
CTGATCAACT GGCATATTTT ACGTAACTCC CTGCCGCCGC TGATTGTGCA GGCATCGCTG
GATATCGGTA GCGCGATTTT AATGGCCGCC ACGTTGGGAT TTATTGGCCT TGGTGCTCAA
CAACCGAGTG CTGAATGGGG GGCGATGGTG GCGAATGGTC GCAACTATGT GCTCGATCAA
TGGTGGTATT GCGCATTTCC GGGGGCAGCG ATTTTGCTTA CCGCCGTCGG GTTTAATCTC
TTTGGCGATG GTATTCGCGA TCTGCTTGAC CCGAAAGCAG GAGGAAAGCA GTCATGA
 
Protein sequence
MMLSEETSAV RPPKQTRFNG AKLVWMLKGS PLTVTGAVII VLMLLMMIFS PWLATHDPNA 
IDLTARLLPP SAAHWFGTDE VGRDLFSRVL VGSQQSILAG LVVVAIAGMM GSLLGCLSGV
LGGRADAIIM RIMDIMLSIP SLVLTMALAA ALGPSLFNAM LAIAIVRIPF YVRLARGQTL
VVRQYTYVQA AKTFGASRWH LINWHILRNS LPPLIVQASL DIGSAILMAA TLGFIGLGAQ
QPSAEWGAMV ANGRNYVLDQ WWYCAFPGAA ILLTAVGFNL FGDGIRDLLD PKAGGKQS