Gene EcHS_A1697 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcHS_A1697 
SymbolmalY 
ID5591856 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli HS 
KingdomBacteria 
Replicon accessionNC_009800 
Strand
Start bp1722910 
End bp1724082 
Gene Length1173 bp 
Protein Length390 aa 
Translation table11 
GC content51% 
IMG OID640920845 
ProductMalY protein 
Protein accessionYP_001458401 
Protein GI157161083 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1168] Bifunctional PLP-dependent enzyme with beta-cystathionase and maltose regulon repressor activities 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones61 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTTCGATT TTTCAAAGGT CGTGGATCGT CATGGCACAT GGTGTACACA GTGGGATTAT 
GTCGCTGACC GTTTCGGCAC TGCTGACCTG TTACCGTTCA CGATTTCAGA CATGGATTTT
GCCACTGCCC CCTGCATTAT CGAGGCGCTG AATCAGCGCC TGATGCACGG CGTATTTGGC
TACAGCCGCT GGAAAAACGA TGAGTTTCTC GCGGCTATTG CCCACTGGTT TTCCACCCAG
CATTACACCG CCATCGATTC TCAGACGGTG GTGTATGGCC CTTCTGTCAT CTATATGGTT
TCAGAACTGA TTCGTCAGTG GTCTGAAACA GGTGAAGGCG TGGTGATCCA CACACCCGCC
TATGACGCAT TTTACAAGGC CATTGAAGGT AACCAGCGCA CAGTAATGCC CGTTGCTTTA
GAGAAGCAGG CTGATGGTTG GTTTTGCGAT ATGGGCAAGT TGGAAGCCGT GTTGGCGAAA
CCAGAATGTA AAATTATGCT CCTGTGTAGC CCACAGAATC CTACCGGGAA AGTGTGGACG
TGCGATGAGC TGGAGATCAT GGCTGACCTG TGCGAGCGTC ATGGTGTGCG GGTTATTTCC
GATGAAATCC ATATGGATAT GGTTTGGGGC GAGCAGCCGC ATATTCCCTG GAGTAATGTG
GCTCGCGGAG ACTGGGCGTT GCTAACGTCG GGCTCGAAAA GTTTCAATAT TCCCGCCCTG
ACCGGTGCTT ACGGGATTAT AGAAAATAGC AGTAGCCGCG ATGCCTATTT ATCGGCACTG
AAAGGCCGTG ATGGGCTTTC TTCCCCTTCG GTACTGGCGT TAACTGCCCA TATCGCCGCC
TATCAGCAAG GCGCGCCGTG GCTGGATGCC TTACGCATCT ATCTGAAAGA TAACCTGACG
TATATCGCAG ATAAAATGAA CGCCGCGTTT CCTGAACTCA ACTGGCAGAT CCCACAATCC
ACTTATCTGG CATGGCTTGA TTTACGTCCG TTGAATATTG ACGACAACGC GTTGCAAAAA
GCACTTATCG AACAAGAAAA AGTCGCGATC ATGCCGGGGT ATACCTACGG TGAAGAAGGT
CGTGGTTTTG TCCGTCTCAA TGCCGGCTGC CCACGTTCGA AACTGGAAAA AGGTGTGGCT
GGATTAATTA ACGCCATCCG CGCTGTTCGT TAA
 
Protein sequence
MFDFSKVVDR HGTWCTQWDY VADRFGTADL LPFTISDMDF ATAPCIIEAL NQRLMHGVFG 
YSRWKNDEFL AAIAHWFSTQ HYTAIDSQTV VYGPSVIYMV SELIRQWSET GEGVVIHTPA
YDAFYKAIEG NQRTVMPVAL EKQADGWFCD MGKLEAVLAK PECKIMLLCS PQNPTGKVWT
CDELEIMADL CERHGVRVIS DEIHMDMVWG EQPHIPWSNV ARGDWALLTS GSKSFNIPAL
TGAYGIIENS SSRDAYLSAL KGRDGLSSPS VLALTAHIAA YQQGAPWLDA LRIYLKDNLT
YIADKMNAAF PELNWQIPQS TYLAWLDLRP LNIDDNALQK ALIEQEKVAI MPGYTYGEEG
RGFVRLNAGC PRSKLEKGVA GLINAIRAVR