Gene EcSMS35_1686 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_1686 
Symbol 
ID6145053 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp1689668 
End bp1691218 
Gene Length1551 bp 
Protein Length516 aa 
Translation table11 
GC content50% 
IMG OID641616562 
Productputative ABC transporter periplasmic-binding protein yddS precursor 
Protein accessionYP_001743740 
Protein GI170681069 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0747] ABC-type dipeptide transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones56 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGAGAT CGATTTTGTT TCGTCCTACA TTGCTCGCGA TCGTCCTTGC CACAACAATG 
CCGGTTGCGC ACGCCGCCGT ACCGAAAGAT ATGCTGGTGA TCGGTAAAGC TGCCGACCCA
CAAACCCTCG ACCCAGCGGT GACAATTGAT AATAACGACT GGACAGTGAC CTACCCGTCT
TATCAGCGAC TGGTTCAGTA CAAAACGGAC GGTGATAAAG GCTCAACCGA CGTTGAAGGC
GATCTGGCAA GTAGCTGGAA AGCGTCTGAC GATCAAAAAG AGTGGACGTT CACCCTGAAA
AACGACGCTA AATTTGCCGA TGGCACACCT GTCACTGCCG AAGCAGTAAA ACTCTCTTTT
GAGCGGTTAC TAAAAATCGG CCAGGGGCCA GCAGAAGCAT TTCCCAAAGA TTTAAAGATT
GATGCTCCCG ACGAACATAC AGTGAGGTTT ACCCTTAGCC AGCCATTCGC ACCGTTCCTC
TACACGCTGG CGAATGACGG TGCATCCATT ATCAATCCGG CGGTCTTAAA GGAGCATGCG
GCGGATGATG CCCGTGGTTT CCTCGCGCAA AATACCGCTG GCTCCGGACC GTTTATGCTG
AAAAGCTGGC AAAAAGGTCA GCAATTAGTT CTGGTGCCAA ATCCGCATTA CCCCGGCAAT
AAACCGAATT TCAAACGGGT ATCGGTAAAA ATTATCGGTG AAAGTGCTTC CCGTCGCCTG
CAGCTCTCCC GTGGCGACAT TGACATTGCC GATGCGCTGC CGGTGGATCA ACTCAACGCA
CTGAAGCAGG AAAACAAAGT CAATGTGGCA GAGTATCCGT CACTGCGCGT CACCTATCTG
TATCTGAATA ACAGCAAAGC GCCACTTAAT CAGGCGGATC TGCGTCGGGC CATTTCCTGG
TCTACCGATT ATCAGGGTAT GGTTAACGGC ATTCTGAGTG GTAACGGAAA ACAAATGCGC
GGCCCGATTC CGGAAGGCAT GTGGGGATAC GATGCGACGG CAATGCAATA CAACCATGAC
GAAACGAAAG CCAAAGCCGA ATGGGATAAA GTGACGAGCA AACCCACCAG CCTGACGTTT
CTCTATTCTG ATAATGATCC GAACTGGGAG CCTATTGCTC TGGCGACACA ATCCAGTCTC
AACAAGCTGG GCATCAATGT GAAGCTGGAA AAGCTGGCGA ACGCCACCAT GCGCGACAGA
GTGGGTAAAG GTGATTACGA CATCGCGATT GGCAACTGGA GTCCGGATTT TGCCGACCCG
TATATGTTTA TGAATTACTG GTTTGAGTCC GACAAAAAAG GTCTGCCGGG TAACCGCTCG
TTCTATGAAA ACAGTGAGGT CGATAAGTTA CTGCGCAATG CGCTAGCGAC CACCGACCAG
ACGCAGCGTA CCCGGGACTA CCAGCAGGCA CAGAAAATCG TCATTGATGA CGCTGCTTAT
GTGTATCTGT TCCAGAAAAA CTACCAACTG GCGATGAACA AAGAGGTGAA AGGCTTTGTG
TTCAATCCCA TGCTGGAACA GGTCTTCAAT ATCAATACCA TGAGTAAATA A
 
Protein sequence
MKRSILFRPT LLAIVLATTM PVAHAAVPKD MLVIGKAADP QTLDPAVTID NNDWTVTYPS 
YQRLVQYKTD GDKGSTDVEG DLASSWKASD DQKEWTFTLK NDAKFADGTP VTAEAVKLSF
ERLLKIGQGP AEAFPKDLKI DAPDEHTVRF TLSQPFAPFL YTLANDGASI INPAVLKEHA
ADDARGFLAQ NTAGSGPFML KSWQKGQQLV LVPNPHYPGN KPNFKRVSVK IIGESASRRL
QLSRGDIDIA DALPVDQLNA LKQENKVNVA EYPSLRVTYL YLNNSKAPLN QADLRRAISW
STDYQGMVNG ILSGNGKQMR GPIPEGMWGY DATAMQYNHD ETKAKAEWDK VTSKPTSLTF
LYSDNDPNWE PIALATQSSL NKLGINVKLE KLANATMRDR VGKGDYDIAI GNWSPDFADP
YMFMNYWFES DKKGLPGNRS FYENSEVDKL LRNALATTDQ TQRTRDYQQA QKIVIDDAAY
VYLFQKNYQL AMNKEVKGFV FNPMLEQVFN INTMSK