Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_1686 |
Symbol | |
ID | 6145053 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | + |
Start bp | 1689668 |
End bp | 1691218 |
Gene Length | 1551 bp |
Protein Length | 516 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 641616562 |
Product | putative ABC transporter periplasmic-binding protein yddS precursor |
Protein accession | YP_001743740 |
Protein GI | 170681069 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0747] ABC-type dipeptide transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 56 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAGAGAT CGATTTTGTT TCGTCCTACA TTGCTCGCGA TCGTCCTTGC CACAACAATG CCGGTTGCGC ACGCCGCCGT ACCGAAAGAT ATGCTGGTGA TCGGTAAAGC TGCCGACCCA CAAACCCTCG ACCCAGCGGT GACAATTGAT AATAACGACT GGACAGTGAC CTACCCGTCT TATCAGCGAC TGGTTCAGTA CAAAACGGAC GGTGATAAAG GCTCAACCGA CGTTGAAGGC GATCTGGCAA GTAGCTGGAA AGCGTCTGAC GATCAAAAAG AGTGGACGTT CACCCTGAAA AACGACGCTA AATTTGCCGA TGGCACACCT GTCACTGCCG AAGCAGTAAA ACTCTCTTTT GAGCGGTTAC TAAAAATCGG CCAGGGGCCA GCAGAAGCAT TTCCCAAAGA TTTAAAGATT GATGCTCCCG ACGAACATAC AGTGAGGTTT ACCCTTAGCC AGCCATTCGC ACCGTTCCTC TACACGCTGG CGAATGACGG TGCATCCATT ATCAATCCGG CGGTCTTAAA GGAGCATGCG GCGGATGATG CCCGTGGTTT CCTCGCGCAA AATACCGCTG GCTCCGGACC GTTTATGCTG AAAAGCTGGC AAAAAGGTCA GCAATTAGTT CTGGTGCCAA ATCCGCATTA CCCCGGCAAT AAACCGAATT TCAAACGGGT ATCGGTAAAA ATTATCGGTG AAAGTGCTTC CCGTCGCCTG CAGCTCTCCC GTGGCGACAT TGACATTGCC GATGCGCTGC CGGTGGATCA ACTCAACGCA CTGAAGCAGG AAAACAAAGT CAATGTGGCA GAGTATCCGT CACTGCGCGT CACCTATCTG TATCTGAATA ACAGCAAAGC GCCACTTAAT CAGGCGGATC TGCGTCGGGC CATTTCCTGG TCTACCGATT ATCAGGGTAT GGTTAACGGC ATTCTGAGTG GTAACGGAAA ACAAATGCGC GGCCCGATTC CGGAAGGCAT GTGGGGATAC GATGCGACGG CAATGCAATA CAACCATGAC GAAACGAAAG CCAAAGCCGA ATGGGATAAA GTGACGAGCA AACCCACCAG CCTGACGTTT CTCTATTCTG ATAATGATCC GAACTGGGAG CCTATTGCTC TGGCGACACA ATCCAGTCTC AACAAGCTGG GCATCAATGT GAAGCTGGAA AAGCTGGCGA ACGCCACCAT GCGCGACAGA GTGGGTAAAG GTGATTACGA CATCGCGATT GGCAACTGGA GTCCGGATTT TGCCGACCCG TATATGTTTA TGAATTACTG GTTTGAGTCC GACAAAAAAG GTCTGCCGGG TAACCGCTCG TTCTATGAAA ACAGTGAGGT CGATAAGTTA CTGCGCAATG CGCTAGCGAC CACCGACCAG ACGCAGCGTA CCCGGGACTA CCAGCAGGCA CAGAAAATCG TCATTGATGA CGCTGCTTAT GTGTATCTGT TCCAGAAAAA CTACCAACTG GCGATGAACA AAGAGGTGAA AGGCTTTGTG TTCAATCCCA TGCTGGAACA GGTCTTCAAT ATCAATACCA TGAGTAAATA A
|
Protein sequence | MKRSILFRPT LLAIVLATTM PVAHAAVPKD MLVIGKAADP QTLDPAVTID NNDWTVTYPS YQRLVQYKTD GDKGSTDVEG DLASSWKASD DQKEWTFTLK NDAKFADGTP VTAEAVKLSF ERLLKIGQGP AEAFPKDLKI DAPDEHTVRF TLSQPFAPFL YTLANDGASI INPAVLKEHA ADDARGFLAQ NTAGSGPFML KSWQKGQQLV LVPNPHYPGN KPNFKRVSVK IIGESASRRL QLSRGDIDIA DALPVDQLNA LKQENKVNVA EYPSLRVTYL YLNNSKAPLN QADLRRAISW STDYQGMVNG ILSGNGKQMR GPIPEGMWGY DATAMQYNHD ETKAKAEWDK VTSKPTSLTF LYSDNDPNWE PIALATQSSL NKLGINVKLE KLANATMRDR VGKGDYDIAI GNWSPDFADP YMFMNYWFES DKKGLPGNRS FYENSEVDKL LRNALATTDQ TQRTRDYQQA QKIVIDDAAY VYLFQKNYQL AMNKEVKGFV FNPMLEQVFN INTMSK
|
| |