Gene SNSL254_A3942 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSNSL254_A3942 
SymbolmalS 
ID6482297 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Newport str. SL254 
KingdomBacteria 
Replicon accessionNC_011080 
Strand
Start bp3822881 
End bp3824908 
Gene Length2028 bp 
Protein Length675 aa 
Translation table11 
GC content56% 
IMG OID642739202 
Productperiplasmic alpha-amylase precursor 
Protein accessionYP_002042912 
Protein GI194445720 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0366] Glycosidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones65 
Fosmid unclonability p-value0.973795 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAACTTG CCGCCTTCGC TCTGACGCTG ATACCCGGTA TCGCGATCGC CTCATCATGG 
ACCTCACCCG GTTTTCCGAC GTTCTCGACG CAGGAAACCG GGCGCTTTAC CAGCCATGCT
GCATTAACAA AAGGTACACG CGCGTTAACG CTCCATATTG ACCAGCAGTG CTGGCAGCCC
TCCGGCGCCA TAAAACTCAA CCAAATGTTG TCGTTAAAAC CCTGCGAAGG CGCGCCACCG
CAATGGCGTT TGTTTAAAGA TGGCGACTAT ACGATAACGG TAGATACCCG CTCCGGTACG
CCGACCCTGT TGTTATCGAT AAAAACTGAA CCTGAGCGCA CCGCGCAGCT CGCCTACCAG
TGTCCCGTCT GGGATGGTTC GCCGCTCACG CTGGATGTTC GCCAGACCTT TCCGGAAGGG
ACGGTAGTGC GCGATTATTA CAGTGGTCAA ACCGATACCG TCCAAAACGG GCAAATCACG
CTGCAACCTG CCGACAGCCA CGGGCTTTTA TTGCTGGAAC GCGCGGAAAC CCATGCGTCA
GCGCCTTTTA ACTGGCGCAA CGCCACCGTT TATTTTGTGC TTACGGATCG CTTTCGCAAT
GGCGATCCAA CCAATGACCA CAGCTATGGT CGCCATAAGG ATGGTATGCA AGAGATTGGC
ACTTTCCACG GCGGCGATTT ACGCGGGTTG ACGAGTCAAC TGGACTATCT ACAGCAATTA
GGCGTGAACG CCTTGTGGAT AAGCTCGCCG TTTGAACAGA TCCACGGCTG GGTCGGCGGC
GGAACAAAAG GCGATTTTCC TCATTACGCC TATCACGGCT ATTACACTCA GGACTGGACG
ACGCTGGATG CCAATATGGG CAGCGAAGCC GATCTCCGCG CGCTGGTCGA CGGCGCGCAC
CAGCGCGGCA TCCGTATTTT ATTTGACGTA GTAATGAATC ATGCCGGTTA CGCCACGCTG
GCGGATATGC AGGAGTATCA GTTCGGCGCG CTCTATTTAT CCGGCGCGGA ACGGCAAAAA
ATTCTCGGCG ATCGCTGGAC AAACTGGCGA CCGGCCACCG GACAAAGCTG GCACAGCTTT
AACGACTACA TCAACTTCAG CGACAGCGCC GCCTGGGAAA AATGGTGGGG GAAAAAGTGG
ATTCGTACCG ATATTGGCGA CTACGACAGT CCGGGATTTG ACGATTTAAC CCTGTCGCTG
GCCTTCCTGC CGGATATAAA AACGGAATCT ACCACGCCTT CCGGTCTACC CGCGTTCTAT
GCCAACAAAC CCGATACTAA AGCAAAGTTC ATTGAAGGCT ATACGCCACG GGATTATCTG
ACTCACTGGT TAAGCCAGTG GGTGCATGAT TACGGCATTG ACGGGTTCCG GGTCGATACT
GCCAAAAACG TTGAGCTTCC TGCCTGGCAA CAGCTAAAAA CCCAGGCCAG CGCGGCGTTA
CGTGAATGGA AGCAGACCAA TCCGGACAAA GCGCTGGATA ATAGCCCGTT CTGGATGACT
GGCGAAGCGT GGGGCCACGG CGTCATGAAA AGTGATTATT ATCGCTATGG TTTCGACGCG
ATGATCAATT TTGATTATCA GGAGCAGGCG GCAAAAGCGG TCGATTGCCT GGCGGAAATG
GGGCCAGTCT GGCAGCAGAT GGCGGATAAA ATGCAGGATT TCAACGTATT AAGTTACCTC
TCCTCGCATG ATACGCGCCT TTTCCGTGAG GGCGGTGATA AGGCGGCGGA ACTGCTGCTG
CTTTCGCCGG GCGCGGTGCA GATCTTTTAC GGCGATGAAT CCGCTCGTCC CTTCGGCCCC
ACCGGCTCCG ACCCGCTGCA AGGCACCCGT TCAGATATGA ACTGGCAGGA CGTGAGCGGA
AAGTCAGCCG CAGCCGTCGC GCACTGGCAG CGTATTAGCC AGTTCCGCGC CAGACATCCC
GCCATCGGCG CAGGCCAACA AACCACGCTG ACGCTAAAAC ACGGGTACGG TTTTGTACGC
CAGTACGGCG ACGATACGGT GATGGTCGTC TGGGCGGGCC GCCGCTAA
 
Protein sequence
MKLAAFALTL IPGIAIASSW TSPGFPTFST QETGRFTSHA ALTKGTRALT LHIDQQCWQP 
SGAIKLNQML SLKPCEGAPP QWRLFKDGDY TITVDTRSGT PTLLLSIKTE PERTAQLAYQ
CPVWDGSPLT LDVRQTFPEG TVVRDYYSGQ TDTVQNGQIT LQPADSHGLL LLERAETHAS
APFNWRNATV YFVLTDRFRN GDPTNDHSYG RHKDGMQEIG TFHGGDLRGL TSQLDYLQQL
GVNALWISSP FEQIHGWVGG GTKGDFPHYA YHGYYTQDWT TLDANMGSEA DLRALVDGAH
QRGIRILFDV VMNHAGYATL ADMQEYQFGA LYLSGAERQK ILGDRWTNWR PATGQSWHSF
NDYINFSDSA AWEKWWGKKW IRTDIGDYDS PGFDDLTLSL AFLPDIKTES TTPSGLPAFY
ANKPDTKAKF IEGYTPRDYL THWLSQWVHD YGIDGFRVDT AKNVELPAWQ QLKTQASAAL
REWKQTNPDK ALDNSPFWMT GEAWGHGVMK SDYYRYGFDA MINFDYQEQA AKAVDCLAEM
GPVWQQMADK MQDFNVLSYL SSHDTRLFRE GGDKAAELLL LSPGAVQIFY GDESARPFGP
TGSDPLQGTR SDMNWQDVSG KSAAAVAHWQ RISQFRARHP AIGAGQQTTL TLKHGYGFVR
QYGDDTVMVV WAGRR