Gene EcSMS35_1577 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_1577 
SymbolmalY 
ID6143442 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp1560535 
End bp1561707 
Gene Length1173 bp 
Protein Length390 aa 
Translation table11 
GC content52% 
IMG OID641616454 
Productmaltose regulon modulator MalY 
Protein accessionYP_001743632 
Protein GI170682436 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1168] Bifunctional PLP-dependent enzyme with beta-cystathionase and maltose regulon repressor activities 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones69 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTCGATT TTTCAAAGGT CGTGGATCGT CATGGCACAT GGTGTACGCA GTGGGATTAT 
GTCGCCGACC GTTTCGGCAC TGCTGACCTG TTACCGTTCA CGATTTCAGA CATGGATTTT
GCCACTGCCC CCTGCATTAT CGAGGCGCTG AATCAGCGTC TGATGCACGG CGTGTTTGGC
TACAGCCGCT GGAAAAACGA TGAGTTTCTC GCGGCTATTG CCCACTGGTT TTCCACCCAG
CATTACACCG CCATCGATCC CCAGTCGGTG GTGTATGGCC CTTCTGTCAT CTATATGGTT
TCAGAACTGA TTCGTCAGTG GTCTGAAACA GGTGAAGGCG TGGTGATCCA CACACCCGCC
TATGACGCAT TTTACAAGGC CATTGAAGGT AACCAGCGCA CAGTAATACC CGTTGCTTTA
GAGAAGCAGG CTGACGGCTG GTTTTGCGAT ATGGGCAAGC TGGAAGCCGT GCTGGCGAAA
CCAGAATGTA AAATTATGCT CCTGTGTAGC CCACAGAATC CTACCGGGAA AGTGTGGACG
TGCGATGAGC TGGAGATCAT GGCTGACCTG TGCGAGCGTC ATGGTGTGCG GGTTATTTCC
GATGAAATCC ATATGGATAT GGTTTGGGGC GAGCAGCCGC ATATTCCATG GTGTAATGTG
GCGCGCGGAG ACTGGGCGTT GCTAACGTCG GGATCAAAAA GTTTCAATAT TCCCGCCCTG
ACCGGTGCTT ACGGGATTAT AGAAAATAGC AGTAGCCGCG ATGCCTATTT ATCGGCACTG
AAAGGACGTG ATGGGCTTTC TTCCCCTTCG GTACTGGCGT TAACTGCTCA TATCGCCGCC
TATCAGCAAG GCGCGCCGTG GCTGGATGCC TTACGCGTCT ATCTGAAAGA TAACCTGACG
TATATCGCAG ATAAAATGAA CGCTGCGTTT CCTGAACTCA ACTGGCAGAT CCCGCAATCC
ACTTATCTGG CCTGGCTTGA TCTACGTCCA TTGAACATTG ACGACAACGC GTTGCAAAAA
GCGCTTATCG AGCAAGAAAA GGTCGCGATC ATGCCGGGAT ATACCTACGG TGAAGAAGGT
CGTGGTTTTG TCCGACTCAA TGCCGGGTGC CCGCGTTCGA AACTGGAAAA AGGTGTTGCT
GGATTAATTA ACGCCATCCG CGCTGTTCGT TAA
 
Protein sequence
MFDFSKVVDR HGTWCTQWDY VADRFGTADL LPFTISDMDF ATAPCIIEAL NQRLMHGVFG 
YSRWKNDEFL AAIAHWFSTQ HYTAIDPQSV VYGPSVIYMV SELIRQWSET GEGVVIHTPA
YDAFYKAIEG NQRTVIPVAL EKQADGWFCD MGKLEAVLAK PECKIMLLCS PQNPTGKVWT
CDELEIMADL CERHGVRVIS DEIHMDMVWG EQPHIPWCNV ARGDWALLTS GSKSFNIPAL
TGAYGIIENS SSRDAYLSAL KGRDGLSSPS VLALTAHIAA YQQGAPWLDA LRVYLKDNLT
YIADKMNAAF PELNWQIPQS TYLAWLDLRP LNIDDNALQK ALIEQEKVAI MPGYTYGEEG
RGFVRLNAGC PRSKLEKGVA GLINAIRAVR