Gene EcSMS35_3115 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_3115 
SymbollacY 
ID6146325 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp3202040 
End bp3203317 
Gene Length1278 bp 
Protein Length425 aa 
Translation table11 
GC content46% 
IMG OID641617982 
Productgalactoside permease 
Protein accessionYP_001745132 
Protein GI170683854 
COG category 
COG ID 
TIGRFAM ID[TIGR00882] oligosaccharide:H+ symporter 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones34 
Fosmid unclonability p-value0.0255021 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATTCTG CAAGTACGCA TAAAAATCCC GATTTCTGGA TTTTCGGTCT GTTCTTCTTT 
CTCTACTTTT TCATCATGGC AACCTGTTTT CCGTTTTTGC CTGTATGGCT GTCCGATGTG
GTCGGACTGA GTAAAACGGA TACAGGTATA GTCTTTTCAT GTCTTTCTCT GTTTGCCATC
AGTTTCCAGC CATTGCTTGG GGTCATATCA GATCGCCTGG GACTGAAAAA AAACCTGATC
TGGAGTATCA GCCTGTTACT TGTATTTTTC GCCCCCTTTT TTTTATATGT ATTTGCGCCC
CTGCTGCGCT TCAATATCTG GGCAGGCGCA CTGACTGGCG GTGTCTTTAT TGGTTTTGTG
TTTTCTGCAG GTGCCGGAGC TATTGAAGCT TATATAGAGC GGGTCAGTCG CAGTCGTGGA
TTTGAATACG GTAAAGCGAG GATGTTCGGG TGTCTGGGCT GGGCGTTATG TGCGGCTATG
GCTGGAATGC TTTTTAATGT CGATCCTTCT CTGGTTTTCT GGATGGGGTC AGGAAGCGCA
TTATTGTTGC TTCTTCTGTT GTTTCTGGCG CGCCCCAGTA CCAGCCAGAC GGCAATGGTT
ATGAATACAC TGGGTGCCAA TTCTTCCCTG ATTTCGACCA GAATGGTCTT CAGCCTGTTT
CGCATGCGTC AGATGTGGAT GTTTGTTCTC TACACGATTG GTGTGGCCTG TGTCTATGAT
GTATTTGATC AGCAGTTTGC CACATTTTTT CGTTCATTCT TTGACACTCC TCAGGCAGGA
ATAAAAGCAT TCGGATTTGC TACCACTGCG GGGGAGATTT GTAATGCCAT TATCATGTTC
TGTACACCAT GGATAATTCA TCGCATTGGT GCCAAAAATA CCCTGCTTGT TGCGGGGGGA
ATTATGACCA TCCGCATTAC CGGTTCTGCT TTTGCCACCA CCGCGACAGA AGTGGTGATT
CTGAAAATGC TTCACGCTCT TGAAGTTCCA TTTTTGCTGG TTGGGGCGTT CAAATATATT
ACGGCAGTGT TTGACACCCG ACTGTCAGCG ACCGTTTATT TAATAGGTTT TCAGTTTTCC
AAACAACTTG CTGCAATACT TCTCTCTACC TTTGCCGGCC ACCTGTATGA TCGTATGGGA
TTCCAGAATA CGTATTTTGT GCTCGGGATG ATTGCTCTGA CTGTTACCGT GATATCAGTT
TTCACGCTGA GTTCTTCCCG CGGGAGCGTA CACCCTTCTG TAGAAAAAGC CCCTGCAGCG
CATTCGGAGA TTAACTGA
 
Protein sequence
MNSASTHKNP DFWIFGLFFF LYFFIMATCF PFLPVWLSDV VGLSKTDTGI VFSCLSLFAI 
SFQPLLGVIS DRLGLKKNLI WSISLLLVFF APFFLYVFAP LLRFNIWAGA LTGGVFIGFV
FSAGAGAIEA YIERVSRSRG FEYGKARMFG CLGWALCAAM AGMLFNVDPS LVFWMGSGSA
LLLLLLLFLA RPSTSQTAMV MNTLGANSSL ISTRMVFSLF RMRQMWMFVL YTIGVACVYD
VFDQQFATFF RSFFDTPQAG IKAFGFATTA GEICNAIIMF CTPWIIHRIG AKNTLLVAGG
IMTIRITGSA FATTATEVVI LKMLHALEVP FLLVGAFKYI TAVFDTRLSA TVYLIGFQFS
KQLAAILLST FAGHLYDRMG FQNTYFVLGM IALTVTVISV FTLSSSRGSV HPSVEKAPAA
HSEIN