Gene EcHS_A4273 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcHS_A4273 
SymbolmalF 
ID5594199 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli HS 
KingdomBacteria 
Replicon accessionNC_009800 
Strand
Start bp4275275 
End bp4276819 
Gene Length1545 bp 
Protein Length514 aa 
Translation table11 
GC content53% 
IMG OID640923375 
Productmaltose transporter membrane protein 
Protein accessionYP_001460820 
Protein GI157163502 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1175] ABC-type sugar transport systems, permease components 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones59 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGATGTCA TTAAAAAGAA ACATTGGTGG CAAAGCGACG CGCTGAAATG GTCAGTGCTA 
GGTCTGCTCG GCCTGCTGGT GGGTTACCTT GTTGTTTTAA TGTACGCACA AGGGGAATAC
CTGTTCGCCA TTACCACGCT GATATTGAGT TCAGCGGGGC TGTATATTTT CGCCAATCGT
AAAGCCTACG CCTGGCGCTA TGTTTACCCG GGAATGGCTG GAATGGGATT ATTCGTCCTC
TTCCCTCTGG TCTGCACCAT CGCCATTGCC TTCACCAACT ACAGCAGCAC TAACCAGCTG
ACTTTTGAAC GTGCGCAGGA AGTGTTGTTA GATCGCTCCT GGCAAGCAGG CAAAACCTAT
AACTTTGGTC TTTACCCGGC GGGCGATGAG TGGCAACTGG CGCTCAGTGA CGGCGAAACC
GGCAAAAATT ACCTTTCCGA CGCTTTTAAA TTTGGCGGCG AGCAAAAACT GCAACTGAAA
GAAACGACCG CCCAGCCCGA AGGCGAACGC GCGAATCTAC GCGTGATTAC CCAGAATCGT
CAGGCGCTGA GTGACATTAC CGCCATTCTG CCGGATGGCA ACAAAGTGAT GATGAGCTCC
CTGCGCCAGT TTTCTGGCAC GCAGCCGCTC TACACACTCG ACGGTGACGG CACGTTGACG
AATAATCAGA GCGGCGTGAA ATATCGTCCG AATAACCAAA TTGGCTTTTA CCAGTCCATT
ACCGCCGACG GCAACTGGGG TGATGAAAAG CTAAGCCCCG GTTACACCGT GACCACCGGC
TGGAAAAACT TTACCCGCGT CTTTACCGAC GAAGGCATTC AGAAACCGTT CCTCGCCATT
TTCGTCTGGA CCGTGGTGTT CTCGCTGATC ACTGTCTTTT TAACGGTGGC GGTCGGCATG
GTTCTGGCAT GTCTGGTGCA GTGGGAAGCG TTGCGCGGCA AAGCGGTCTA TCGCGTCCTG
CTGATTCTGC CCTACGCGGT GCCATCGTTC ATTTCAATCT TGATTTTCAA AGGATTGTTT
AACCAGAGCT TCGGTGAAAT CAACATGATG TTGAGCGCGC TGTTTGGCGT GAAGCCCGCC
TGGTTCAGCG ATCCGACCAC CGCCCGCACG ATGCTGATTA TCGTCAATAC TTGGCTGGGT
TATCCGTACA TGATGATCCT CTGCATGGGC TTGCTGAAAG CGATTCCGGA CGATTTGTAT
GAAGCCTCAG CAATGGATGG CGCAGGTCCG TTCCAGAACT TCTTTAAGAT TACGCTACCG
CTGCTGATTA AACCGCTGAC GCCGCTGATG ATCGCCAGCT TCGCCTTTAA CTTTAACAAC
TTCGTGCTGA TTCAGCTGTT AACCAACGGC GGCCCGGATC GTCTTGGCAC GACCACGCCA
GCCGGTTATA CCGACCTGCT TGTTAACTAC ACCTACCGTA TCGCTTTTGA AGGCGGCGGG
GGTCAGGACT TCGGTCTGGC TGCAGCAATT GCCACGCTGA TCTTCCTGCT GGTGGGTGCG
CTGGCGATAG TGAACCTGAA AGCCACGCGA ATGAAGTTTG ATTAA
 
Protein sequence
MDVIKKKHWW QSDALKWSVL GLLGLLVGYL VVLMYAQGEY LFAITTLILS SAGLYIFANR 
KAYAWRYVYP GMAGMGLFVL FPLVCTIAIA FTNYSSTNQL TFERAQEVLL DRSWQAGKTY
NFGLYPAGDE WQLALSDGET GKNYLSDAFK FGGEQKLQLK ETTAQPEGER ANLRVITQNR
QALSDITAIL PDGNKVMMSS LRQFSGTQPL YTLDGDGTLT NNQSGVKYRP NNQIGFYQSI
TADGNWGDEK LSPGYTVTTG WKNFTRVFTD EGIQKPFLAI FVWTVVFSLI TVFLTVAVGM
VLACLVQWEA LRGKAVYRVL LILPYAVPSF ISILIFKGLF NQSFGEINMM LSALFGVKPA
WFSDPTTART MLIIVNTWLG YPYMMILCMG LLKAIPDDLY EASAMDGAGP FQNFFKITLP
LLIKPLTPLM IASFAFNFNN FVLIQLLTNG GPDRLGTTTP AGYTDLLVNY TYRIAFEGGG
GQDFGLAAAI ATLIFLLVGA LAIVNLKATR MKFD