Gene EcSMS35_4044 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_4044 
Symbol 
ID6145768 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp4134658 
End bp4136373 
Gene Length1716 bp 
Protein Length571 aa 
Translation table11 
GC content53% 
IMG OID641618869 
Productputative symporter YidK 
Protein accessionYP_001746007 
Protein GI170680795 
COG category[R] General function prediction only 
COG ID[COG4146] Predicted symporter 
TIGRFAM ID[TIGR00813] transporter, SSS family 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones49 
Fosmid unclonability p-value0.978949 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATTCGT TACAAATCTT GAGTTTTGTC GGTTTTACGC TGCTGGTGGC GATCATCACC 
TGGTGGAAGG TTCGCAAAAC AGATACCGGA TCGCAACAAG GCTATTTTCT TGCCGGACGT
TCACTAAAAG CGCCGGTTAT TGCCGCTTCG TTAATGCTAA CCAACCTTTC CACGGAACAA
CTGGTTGGAC TTTCCGGGCA GGCCTACAAA AGCGGCATGT CGGTGATGGG CTGGGAAGTC
ACTTCTGCGG TGACGCTGAT CTTCCTCGCG CTAATCTTTT TACCGCGCTA TCTGAAGCGT
GGCATTGCCA CCATCCCCGA TTTCCTGGAG GAACGTTATG ATAAAACGAC GCGTATTATC
ATCGACTTCT GCTTCCTCAT TGCCACCGGT GTCTGCTTTC TGCCGATTGT TCTCTACTCC
GGCGCGTTGG CGCTCAACAG CCTGTTTCAC GTCGGGGAAT CGTTACAAAT TTCCCACGGT
GCGGCTATCT GGCTATTGGT AATTTTGCTT GGTCTGGCGG GAATTTTGTA TGCGGTGATC
GGCGGACTGC GCGCAATGGC AGTGGCGGAC TCCATCAACG GTATTGGGCT GGTTATTGGC
GGGTTGATGG TGCCGATATT TGGCCTGATC GCGATGGGCA AGGGCAGCTT TATGCAGGGC
ATTGAGCAAC TCACCACCGT TCACGCCGAG AAATTAAACT CAGTCGGTGG CCCGACCGAT
CCCTTGCCGA TTGGCGCGGC ATTTACCGGT TTGATTCTGG TGAACACCTT TTACTGGTGT
ACAAATCAGG GCATCGTGCA ACGCACGCTG GCGTCAAAAA GCCTGGCGGA AGGGCAAAAG
GGGGCGCTGT TAACGGCGGT GCTGAAAATG CTCGACCCGC TGGTACTGGT GCTGCCAGGG
TTGATTGCGT TTCATCTGTA TCAGGATTTA CCTAAAGCCG ACATGGCCTA CCCGACGCTG
GTCAATAACG TTCTGCCAGT GCCACTGGTG GGTTTCTTCG GCGCGGTGTT ATTTGGTGCG
GTGATCAGTA CCTTCAACGG CTTTCTGAAT AGCGCCAGTA CGTTATTCAG TATGGGTATT
TACCGGCGCA TCATTAACCA GAATGCCGAG CCGCAGCAGC TGGTCACCGT TGGGCGCAAA
TTTGGTTTCT TTATCGCCAT CGTTTCGGTG CTGGTCGCGC CGTGGATCGC CAACGCGCCG
CAGGGGCTGT ATAGCTGGAT GAAACAGCTC AACGGTATTT ACAACGTGCC GCTGGTTACC
ATCATCATTA TGGGCTTTTT CTTCCCGCGC ATCCCGGCGC TGGCGGCAAA AGTGGCGATG
GGGATTGGCA TAATCAGCTA CATCACCATC AACTATCTGG TGAAGTTCGA CTTCCATTTC
CTCTATGTGC TGGCCTGTAC GTTCTGCATC AACGTGGTCG TGATGCTGGT GATCGGTTTT
ATCAAACCGC GCGCCACGCC GTTCACCTTC AAAGATGCGT TTGCGGTGGA CATGAAACCG
TGGAAAAACG TCAAGATCGC GTCAATTGGC ATCCTGTTCG CTATGATTGG CGTCTATGCC
GGGCTGGCTG AATTCGGCGG CTACGGTACG CGCTGGTTAG CGATGATCAG TTATTTCATC
GCTGCCGTAG TGATTGTCTA CCTGATTTTT GACAGCTGGC GGCATCGTCA CGACCCAGCC
GTAACCTTTA CTCCCGACGC GAAGGATAGC CTATGA
 
Protein sequence
MNSLQILSFV GFTLLVAIIT WWKVRKTDTG SQQGYFLAGR SLKAPVIAAS LMLTNLSTEQ 
LVGLSGQAYK SGMSVMGWEV TSAVTLIFLA LIFLPRYLKR GIATIPDFLE ERYDKTTRII
IDFCFLIATG VCFLPIVLYS GALALNSLFH VGESLQISHG AAIWLLVILL GLAGILYAVI
GGLRAMAVAD SINGIGLVIG GLMVPIFGLI AMGKGSFMQG IEQLTTVHAE KLNSVGGPTD
PLPIGAAFTG LILVNTFYWC TNQGIVQRTL ASKSLAEGQK GALLTAVLKM LDPLVLVLPG
LIAFHLYQDL PKADMAYPTL VNNVLPVPLV GFFGAVLFGA VISTFNGFLN SASTLFSMGI
YRRIINQNAE PQQLVTVGRK FGFFIAIVSV LVAPWIANAP QGLYSWMKQL NGIYNVPLVT
IIIMGFFFPR IPALAAKVAM GIGIISYITI NYLVKFDFHF LYVLACTFCI NVVVMLVIGF
IKPRATPFTF KDAFAVDMKP WKNVKIASIG ILFAMIGVYA GLAEFGGYGT RWLAMISYFI
AAVVIVYLIF DSWRHRHDPA VTFTPDAKDS L