Gene EcSMS35_3940 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_3940 
SymbollldP 
ID6143557 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp4018766 
End bp4020421 
Gene Length1656 bp 
Protein Length551 aa 
Translation table11 
GC content55% 
IMG OID641618766 
ProductL-lactate permease 
Protein accessionYP_001745905 
Protein GI170681119 
COG category[C] Energy production and conversion 
COG ID[COG1620] L-lactate permease 
TIGRFAM ID[TIGR00795] L-lactate transport 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones49 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATCTCT GGCAACAAAA CTACGATCCC GCCGGGAATA TCTGGCTTTC CAGCCTGATA 
GCATCGCTTC CCATCCTGTT TTTCTTCTTT GCGCTGATTA AGCTCAAACT GAAAGGATAC
GTCGCCGCCT CATGGACGGT GGCAATCGCC CTTGCCGTGG CTTTGCTGTT CTATAAAATG
CCGGTCGCTA ACGCGCTGGC CTCGGTGGTT TATGGCTTCT TCTACGGGTT GTGGCCCATC
GCGTGGATCA TTATTGCAGC GGTGTTCGTC TATAAGATCT CGGTGAAAAC CGGGCAGTTT
GACATCATCC GCTCGTCTAT TCTTTCGATA ACCCCTGACC AGCGCCTGCA AATGCTGATT
GTCGGTTTCT GTTTCGGCGC TTTCCTTGAA GGAGCCGCAG GCTTTGGCGC ACCGGTAGCA
ATTACCGCCG CATTGCTGGT CGGTCTGGGC TTTAAACCAC TCTACGCCGC CGGACTGTGC
CTGATTGTTA ACACCGCACC AGTGGCATTT GGCGCGATGG GTATTCCGAT TCTGGTCGCC
GGGCAGGTAA CAGGCATCGA CAGCTTTGAG ATTGGTCAGA TGGTGGGGCG TCAGCTGCCG
TTTATGACCA TTATCGTGCT GTTCTGGATC ATGGCGATTA TGGACGGTTG GCGCGGTATC
AAAGAGACGT GGCCTGCGGT CGTGGTTGCA GGTGGTTCGT TTGCCATTGC TCAATACCTC
AGCTCTAACT TCATTGGGCC GGAACTGCCA GACATTATCT CTTCGCTGGT ATCACTGCTC
TGCCTGACGC TGTTCCTCAA ACGCTGGCAA CCAGTGCGCG TATTCCGCTT CGGTGACTTA
GGGGCGTCAC AGGTTGATAT GACGCTGGCT CACACAGGTT ACACGGCAGG TCAGGTGCTG
CGCGCCTGGA CACCGTTCCT GTTCCTGACC GCCACCGTAA CGCTGTGGAG TATCCCGCCG
TTTAAAGCCC TGTTCGCTTC AGGCGGTGCG CTGTATGAGT GGGTGATCAA CATTCCGGTG
CCGTACCTCG ATAAACTGGT TGCCCGTATG CCGCCAGTGG TCAGCGAAGC GACAGCGTAT
GCCGCCGTGT TTAAGTTTGA CTGGTTCTCT GCCACCGGTA CCGCCATTCT GTTTGCTGCC
CTGCTCTCGA TTGTCTGGCT GAAGATGAAA CCATCTGACG CTATCAGCAC CTTCGGCAGC
ACGCTGAAAG AACTGGCTCT GCCCATCTAC TCCATCGGTA TGGTGCTGGC ATTCGCCTTT
ATCTCGAACT ATTCCGGTTT GTCATCAACA CTGGCGCTGG CACTGGCGCA CACCGGTCAT
GCATTCACCT TCTTCTCGCC GTTCCTCGGC TGGCTTGGGG TATTCCTGAC CGGGTCGGAT
ACGTCATCTA ATGCTTTGTT TGCCGCTTTG CAGGCTACCG CCGCACAACA AATTGGCGTT
TCTGACCTGT TGTTGGTTGC CGCCAATACC ACCGGTGGCG TCACCGGTAA GATGATCTCC
CCGCAATCTA TCGCTATCGC CTGTGCGGCG GTAGGCCTGG TGGGCAAAGA GTCTGATTTG
TTCCGCTTTA CTGTCAAACA CAGCCTGATC TTCACCTGTA TGGTGGGCGT GATCACCACG
CTTCAGGCCT ATGTCTTAAC GTGGATGATT CCTTAA
 
Protein sequence
MNLWQQNYDP AGNIWLSSLI ASLPILFFFF ALIKLKLKGY VAASWTVAIA LAVALLFYKM 
PVANALASVV YGFFYGLWPI AWIIIAAVFV YKISVKTGQF DIIRSSILSI TPDQRLQMLI
VGFCFGAFLE GAAGFGAPVA ITAALLVGLG FKPLYAAGLC LIVNTAPVAF GAMGIPILVA
GQVTGIDSFE IGQMVGRQLP FMTIIVLFWI MAIMDGWRGI KETWPAVVVA GGSFAIAQYL
SSNFIGPELP DIISSLVSLL CLTLFLKRWQ PVRVFRFGDL GASQVDMTLA HTGYTAGQVL
RAWTPFLFLT ATVTLWSIPP FKALFASGGA LYEWVINIPV PYLDKLVARM PPVVSEATAY
AAVFKFDWFS ATGTAILFAA LLSIVWLKMK PSDAISTFGS TLKELALPIY SIGMVLAFAF
ISNYSGLSST LALALAHTGH AFTFFSPFLG WLGVFLTGSD TSSNALFAAL QATAAQQIGV
SDLLLVAANT TGGVTGKMIS PQSIAIACAA VGLVGKESDL FRFTVKHSLI FTCMVGVITT
LQAYVLTWMI P