Gene EcSMS35_2906 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_2906 
SymbolscrY 
ID6146601 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp2978525 
End bp2980042 
Gene Length1518 bp 
Protein Length505 aa 
Translation table11 
GC content47% 
IMG OID641617775 
Productsucrose porin ScrY 
Protein accessionYP_001744930 
Protein GI170682892 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG4580] Maltoporin (phage lambda and maltose receptor) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00076782 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones53 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTATAAAA AAACAACTTT GGCAATGTTA ATTGCTTTGC TGACCGGTGC TACAACGGTA 
CATGCGCAAA CGGATATTAG CAGTATCGAA TCTCGACTGG CGGCGTTGGA ACAACGTTTA
AAAAATGCGG AATCCCGCGC CCAGGCGGCA GAAGCAAGGG CCAAAACAGC TGAATTACAG
GTTCAGAAAC TGGCTGAAAC ACAACAACAA AATCAGCTAA CAACTCAAGA AGTAGCACAG
AGAACAGTTC AGCTCGAACA GAAATCCGCA GAAAACAGTG GTTTTGAGTT TCATGGCTAT
GCCCGTTCCG GGTTACTGAT GAATGATGCC GCTTCCAGTA GCAAAAGTGG GCCGTATCTG
ACTCCCGCAG GTGAAACTGG TGGAGCTGTT GGCCGTCTGG GAAATGAAGC CGATACCTAT
GTCGAGTTAA ATGTAGAACA TAAACAAACA CTAGATAACG GTGCGACCAC ACGCTTTAAA
GCAATGTTGG CTGACGGACA AAGAGATTAC AACGACTGGA CTGGCGGCTC CAGTAACCTG
AATATCCGAC AGGCTTTTGC CGAACTGGGC GCATTACCAA GTTTTACCGG AGCATTCAAA
GACAGTACTG TCTGGGCTGG TAAACGCTTT GATCGCGACA ATTTTGATAT TCACTGGTTA
GACTCCGATG TCGTATTTTT AGCGGGAACG GGCGGCGGTA TCTATGACGT AAAATGGAAC
GATACATTCC GCAGTAACTT TTCTCTCTAC GGACGTAATT TCGGCGATCT TGATGATATC
GACAATAACG TTCAGAACTA CATCCTCACC ATGAATCATT ATGCAGGCCC CTTCCAGTTG
ATGGTTAGCG GATTACGGGC AAAAGATAAT GATGATCGAA AAGATGCCAA TGGTGATCTC
ATTCAAACTG ATGCTGCAAA TACTGGCGTA CATGCGTTAG TTGGTCTGCA CAATGACACT
TTCTATGGCC TGCGTGAAGG GACGGCAAAA ACAGCACTGC TATATGGCCA TGGCCTGGGT
GCGGAAGTCA AAGGGATTGG CTCCGATGGC GCTCTGCTGT CTGAGGCGAA TACCTGGCGC
TTCGCATCTT ACGGCACAAC ACCTCTGGGA AGCGGTTGGT ATGTTGCGCC AGCAATTCTC
GCACAAAGCA GTAAAGATCG TTACGTCAAA GGCGATAGCT ACGAATGGGT GACCTTCAAT
ACACGTCTGA TCAAAGAGGT AACACAGAAT TTTGCTCTGG CCTTTGAGGG TAGCTATCAA
TATATGGATC TGAAGCCAAA GGGGTATCAA AACCACAACG CCGTAAACGG CAGCTTCTAT
AAACTCACCT TTGCTCCAAC TCTAAAAGCT AACGATATCA ATAATTTCTT TAGCCGTCCG
GAGCTTCGCC TGTTTGCCAC CTGGATGGAC TGGAGCAGCA AACTTGATGA TTTTGCCAGC
AATGACGCTT TCGGCAGCAG TGGTTTCAAT ACTGGTGGAG AGTGGAATTT TGGTGTCCAA
ATGGAAACCT GGTTTTAA
 
Protein sequence
MYKKTTLAML IALLTGATTV HAQTDISSIE SRLAALEQRL KNAESRAQAA EARAKTAELQ 
VQKLAETQQQ NQLTTQEVAQ RTVQLEQKSA ENSGFEFHGY ARSGLLMNDA ASSSKSGPYL
TPAGETGGAV GRLGNEADTY VELNVEHKQT LDNGATTRFK AMLADGQRDY NDWTGGSSNL
NIRQAFAELG ALPSFTGAFK DSTVWAGKRF DRDNFDIHWL DSDVVFLAGT GGGIYDVKWN
DTFRSNFSLY GRNFGDLDDI DNNVQNYILT MNHYAGPFQL MVSGLRAKDN DDRKDANGDL
IQTDAANTGV HALVGLHNDT FYGLREGTAK TALLYGHGLG AEVKGIGSDG ALLSEANTWR
FASYGTTPLG SGWYVAPAIL AQSSKDRYVK GDSYEWVTFN TRLIKEVTQN FALAFEGSYQ
YMDLKPKGYQ NHNAVNGSFY KLTFAPTLKA NDINNFFSRP ELRLFATWMD WSSKLDDFAS
NDAFGSSGFN TGGEWNFGVQ METWF