Gene SNSL254_A4571 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSNSL254_A4571 
SymbolmalF 
ID6483258 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Newport str. SL254 
KingdomBacteria 
Replicon accessionNC_011080 
Strand
Start bp4438820 
End bp4440364 
Gene Length1545 bp 
Protein Length514 aa 
Translation table11 
GC content52% 
IMG OID642739796 
Productmaltose transporter membrane protein 
Protein accessionYP_002043478 
Protein GI194442589 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1175] ABC-type sugar transport systems, permease components 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones49 
Fosmid unclonability p-value0.0143691 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGATGTCA TTAAAAAGAA ACACTGGTGG CAAAGTGACC AGCTTAAATG GTCAGTAATT 
GGTCTGCTGG GCCTGCTGGT GGGTTACCTT GTAGTTTTAA TGTACGTACA AGGGGAGTAT
CTGTTCGCCA TCATGACGCT GATTTTAAGC TCTGCTGGCC TGTATATTTT CGCAAATCGT
AAAACCTATG CCTGGCGCTA CGTTTACCCT GGCCTCGCCG GCATGGGGCT ATTTGTGCTA
TTCCCGCTGG TGTGCACCAT CGCTATCGCC TTTACTAACT ACAGCAGCAC CAACCAGCTC
ACCTTTGAAC GCGCCCAGCA AGTGCTGATG GACCGTTCTT ATCAGGCGGG AAAAACCTAT
AACTTCGGTC TTTACCCGAC CGGCGACGAG TGGCAACTGG CCCTTACCGA CGGAGAAACG
GGCAAACATT ACCTGTCTGG CGCATTTTCC TTCGGCGGCG AACAAAAATT ACAGCTGAAA
GAGACCGATG CCCTGCCGGG CGGCGAACGC GCCAATCTGC GGATAATCAC CCAGAACCGT
CTGGCGTTGA ACCAGATAAC CGCCGTCCTG CCGGATGAAA GTAAAGTGAT TATGAGCTCG
CTGCGTCAGT TTTCCGGTAC CCGTCCGCTG TACACACTGG CTGATGACGG CCTGCTTACC
AATAACCAGA GCGGCGTAAA ATACCGGCCA AATAACGATA GCGGTTATTA TCAGTCGATC
AACGCCGACG GAAGCTGGGG CGATGAAAAA CTCAGTCCTG GTTATACCGT TACTATCGGC
GCGAAAAACT TTACGCGCGT CTTTACCGAC GAAGGGATCC AGAAGCCTTT TTTCGCTATC
TTCGTCTGGA CCGTGGTCTT TTCGGTCCTC ACTGTGGTGT TAACCGTGGC GGTTGGGATG
GTATTGGCCT GCCTCGTACA GTGGGAAGCG CTGAAAGGTA AAGCTATCTA CCGCGTCCTG
CTGATTCTGC CCTACGCCGT ACCGTCGTTT ATTTCAATTT TGATTTTCAA AGGGTTATTC
AACCAAAGCT TTGGCGAAAT CAATATGATG CTGAGCGCGC TGTTTGGCAT TAAACCTGCC
TGGTTCAGCG ACCCCAACAC CGCGCGGGCA ATGGTGATTA TCGTGAATAC CTGGCTGGGC
TATCCCTACA TGATGATCCT GTGCATGGGA CTGCTGAAAG CGATTCCGGA TGACCTGTAC
GAAGCCTCCG CAATGGACGG CGCCGGTCCA TTTCAGAATT TCTTTAAGAT CACCTTACCG
CTGCTGATTA AGCCGCTAAC GCCGCTGATG ATCGCCAGCT TCGCCTTTAA CTTTAATAAC
TTTGTCCTGA TTCAGTTATT GACCAACGGC GGGCCAGACC GTCTCGGCAC CACCACGCCT
GCCGGTTATA CCGATTTGCT CGTCAGCTAC ACCTACCGTA TCGCCTTTGA AGGCGGCGGC
GGTCAGGACT TCGGTCTGGC GGCGGCCATT GCCACGCTGA TCTTCCTGCT GGTAGGCGCG
CTGGCAATAG TGAACCTGAA AGCCACGCGT ATGAAGTTTG ATTAA
 
Protein sequence
MDVIKKKHWW QSDQLKWSVI GLLGLLVGYL VVLMYVQGEY LFAIMTLILS SAGLYIFANR 
KTYAWRYVYP GLAGMGLFVL FPLVCTIAIA FTNYSSTNQL TFERAQQVLM DRSYQAGKTY
NFGLYPTGDE WQLALTDGET GKHYLSGAFS FGGEQKLQLK ETDALPGGER ANLRIITQNR
LALNQITAVL PDESKVIMSS LRQFSGTRPL YTLADDGLLT NNQSGVKYRP NNDSGYYQSI
NADGSWGDEK LSPGYTVTIG AKNFTRVFTD EGIQKPFFAI FVWTVVFSVL TVVLTVAVGM
VLACLVQWEA LKGKAIYRVL LILPYAVPSF ISILIFKGLF NQSFGEINMM LSALFGIKPA
WFSDPNTARA MVIIVNTWLG YPYMMILCMG LLKAIPDDLY EASAMDGAGP FQNFFKITLP
LLIKPLTPLM IASFAFNFNN FVLIQLLTNG GPDRLGTTTP AGYTDLLVSY TYRIAFEGGG
GQDFGLAAAI ATLIFLLVGA LAIVNLKATR MKFD