Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_4044 |
Symbol | |
ID | 6145768 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | - |
Start bp | 4134658 |
End bp | 4136373 |
Gene Length | 1716 bp |
Protein Length | 571 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 641618869 |
Product | putative symporter YidK |
Protein accession | YP_001746007 |
Protein GI | 170680795 |
COG category | [R] General function prediction only |
COG ID | [COG4146] Predicted symporter |
TIGRFAM ID | [TIGR00813] transporter, SSS family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 26 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 49 |
Fosmid unclonability p-value | 0.978949 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAATTCGT TACAAATCTT GAGTTTTGTC GGTTTTACGC TGCTGGTGGC GATCATCACC TGGTGGAAGG TTCGCAAAAC AGATACCGGA TCGCAACAAG GCTATTTTCT TGCCGGACGT TCACTAAAAG CGCCGGTTAT TGCCGCTTCG TTAATGCTAA CCAACCTTTC CACGGAACAA CTGGTTGGAC TTTCCGGGCA GGCCTACAAA AGCGGCATGT CGGTGATGGG CTGGGAAGTC ACTTCTGCGG TGACGCTGAT CTTCCTCGCG CTAATCTTTT TACCGCGCTA TCTGAAGCGT GGCATTGCCA CCATCCCCGA TTTCCTGGAG GAACGTTATG ATAAAACGAC GCGTATTATC ATCGACTTCT GCTTCCTCAT TGCCACCGGT GTCTGCTTTC TGCCGATTGT TCTCTACTCC GGCGCGTTGG CGCTCAACAG CCTGTTTCAC GTCGGGGAAT CGTTACAAAT TTCCCACGGT GCGGCTATCT GGCTATTGGT AATTTTGCTT GGTCTGGCGG GAATTTTGTA TGCGGTGATC GGCGGACTGC GCGCAATGGC AGTGGCGGAC TCCATCAACG GTATTGGGCT GGTTATTGGC GGGTTGATGG TGCCGATATT TGGCCTGATC GCGATGGGCA AGGGCAGCTT TATGCAGGGC ATTGAGCAAC TCACCACCGT TCACGCCGAG AAATTAAACT CAGTCGGTGG CCCGACCGAT CCCTTGCCGA TTGGCGCGGC ATTTACCGGT TTGATTCTGG TGAACACCTT TTACTGGTGT ACAAATCAGG GCATCGTGCA ACGCACGCTG GCGTCAAAAA GCCTGGCGGA AGGGCAAAAG GGGGCGCTGT TAACGGCGGT GCTGAAAATG CTCGACCCGC TGGTACTGGT GCTGCCAGGG TTGATTGCGT TTCATCTGTA TCAGGATTTA CCTAAAGCCG ACATGGCCTA CCCGACGCTG GTCAATAACG TTCTGCCAGT GCCACTGGTG GGTTTCTTCG GCGCGGTGTT ATTTGGTGCG GTGATCAGTA CCTTCAACGG CTTTCTGAAT AGCGCCAGTA CGTTATTCAG TATGGGTATT TACCGGCGCA TCATTAACCA GAATGCCGAG CCGCAGCAGC TGGTCACCGT TGGGCGCAAA TTTGGTTTCT TTATCGCCAT CGTTTCGGTG CTGGTCGCGC CGTGGATCGC CAACGCGCCG CAGGGGCTGT ATAGCTGGAT GAAACAGCTC AACGGTATTT ACAACGTGCC GCTGGTTACC ATCATCATTA TGGGCTTTTT CTTCCCGCGC ATCCCGGCGC TGGCGGCAAA AGTGGCGATG GGGATTGGCA TAATCAGCTA CATCACCATC AACTATCTGG TGAAGTTCGA CTTCCATTTC CTCTATGTGC TGGCCTGTAC GTTCTGCATC AACGTGGTCG TGATGCTGGT GATCGGTTTT ATCAAACCGC GCGCCACGCC GTTCACCTTC AAAGATGCGT TTGCGGTGGA CATGAAACCG TGGAAAAACG TCAAGATCGC GTCAATTGGC ATCCTGTTCG CTATGATTGG CGTCTATGCC GGGCTGGCTG AATTCGGCGG CTACGGTACG CGCTGGTTAG CGATGATCAG TTATTTCATC GCTGCCGTAG TGATTGTCTA CCTGATTTTT GACAGCTGGC GGCATCGTCA CGACCCAGCC GTAACCTTTA CTCCCGACGC GAAGGATAGC CTATGA
|
Protein sequence | MNSLQILSFV GFTLLVAIIT WWKVRKTDTG SQQGYFLAGR SLKAPVIAAS LMLTNLSTEQ LVGLSGQAYK SGMSVMGWEV TSAVTLIFLA LIFLPRYLKR GIATIPDFLE ERYDKTTRII IDFCFLIATG VCFLPIVLYS GALALNSLFH VGESLQISHG AAIWLLVILL GLAGILYAVI GGLRAMAVAD SINGIGLVIG GLMVPIFGLI AMGKGSFMQG IEQLTTVHAE KLNSVGGPTD PLPIGAAFTG LILVNTFYWC TNQGIVQRTL ASKSLAEGQK GALLTAVLKM LDPLVLVLPG LIAFHLYQDL PKADMAYPTL VNNVLPVPLV GFFGAVLFGA VISTFNGFLN SASTLFSMGI YRRIINQNAE PQQLVTVGRK FGFFIAIVSV LVAPWIANAP QGLYSWMKQL NGIYNVPLVT IIIMGFFFPR IPALAAKVAM GIGIISYITI NYLVKFDFHF LYVLACTFCI NVVVMLVIGF IKPRATPFTF KDAFAVDMKP WKNVKIASIG ILFAMIGVYA GLAEFGGYGT RWLAMISYFI AAVVIVYLIF DSWRHRHDPA VTFTPDAKDS L
|
| |