Gene SeD_A4048 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeD_A4048 
SymbolmalS 
ID6873316 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Dublin str. CT_02021853 
KingdomBacteria 
Replicon accessionNC_011205 
Strand
Start bp3890779 
End bp3892806 
Gene Length2028 bp 
Protein Length675 aa 
Translation table11 
GC content56% 
IMG OID642786997 
Productperiplasmic alpha-amylase precursor 
Protein accessionYP_002217624 
Protein GI198244327 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0366] Glycosidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones60 
Fosmid unclonability p-value0.555384 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAACTTG CCGCCTTCGC TCTGACGCTG ATACCCGGTA TCGCGATCGC CTCATCATGG 
ACCTCACCCG GTTTTCCGAC GTTCTCGACG CAGGAAACCG GGCGCTTTAC CAGCCATGCT
GCATTAACAA AAGGTACACG CGCGTTAACG CTCCATATTG ACCAGCAGTG CTGGCAGCCC
TCCGGCGCCA TAAAACTCAA CCAAATGTTG TCGTTAAAAC CCTGCGAAGG CGCGCCACCG
CAATGGCGTT TGTTTAAAGA TGGCGACTAT ACGATAACGG TAGATACCCG CTCCGGTACG
CCGACCCTGT TGTTATCGAT AAAAACTGAA CCTGAGCGCA CCGCGCAGCT CGCCTACCAG
TGTCCCGTCT GGGATGGTTC GCCGCTCACG CTGGATGTTC GCCAGACCTT TCCGGAAGGG
ACGGTAGTGC GCGATTATTA CAGTGGTCAA ACCGATACCG TCCAAAACGG GCAAATCACG
CTGCAACCTG CCGACAGCCA CGGGCTTTTA TTGCTGGAAC GCGCGGAAAC CCATGCGTCA
GCGCCTTTTA ACTGGCGCAA CGCCACCGTT TATTTTGTGC TTACGGATCG CTTTCGCAAT
GGCGATCCAA CCAATGACCA CAGCTATGGT CGCCATAAGG ATGGTATGCA AGAGATTGGC
ACTTTCCACG GCGGCGATTT GCGTGGGTTG ACGAGTAAAC TGGACTATCT ACAGCAATTA
GGCGTGAGCG CCTTGTGGAT AAGCTCGCCG TTTGAACAGA TCCACGGCTG GGTCGGCGGC
GGAGCAAAAG GCGATTTTCC TCATTACGCC TATCACGGCT ATTACACTCA GGACTGGACG
ACGCTGGATG CCAATATGGG CAACGAAGCC GATCTCCGCG CGCTGGTCGA CGGCGCGCAC
CAGCGCGGCA TCCGTATTTT ATTTGACGTA GTAATGAATC ATGCCGGTTA CGCTACGCTG
GAGGATATGC AGGAGTATCA GTTTGGCGCG CTCTATTTAT CCGGCGCGGA ACGGCAAAAA
ATTCTCGGCG ATCGCTGGAC AAACTGGCGA CCCGCCGCCG GACAAAGCTG GCACAGCTTT
AACGACTACA TCAACTTCAG CGACAGCGCC GCCTGGGAAA AATGGTGGGG GAAAAAGTGG
ATTCGTACCG ATATTGGCGA CTACGACAGT CCGGGATTTG ACGATTTAAC CCTGTCGCTG
GCCTTCCTGC CGGATATAAA AACGGAATCT ACCACGCCTT CCGGTCTACC CGCGTTCTAT
GCCAACAAAC CCGATACTAA AGCAAAGTTC ATTGAAGGCT ATACGCCACG GGATTATCTG
ACTCACTGGT TAAGCCAGTG GGTGCATGAT TACGGCATTG ACGGGTTCCG GGTCGATACT
GCCAAAAACG TTGAGCTTCC TGCCTGGCAA CAGCTAAAAA CCCAGGCCAG CGCGGCGTTA
CATGAATGGA AGCAGGCCAA TCCGGACAAA GCGCTGGATG ATAGCCCGTT CTGGATGACT
GGCGAAGCGT GGGGCCACGG CGTCATGAAA AGTGATTATT ATCGCTATGG TTTCGACGCG
ATGATCAATT TTGATTATCA GGAGCAGGCG GCGAAAGCGG TCGATTGCCT GGCGGAAATG
GGGCCAGTCT GGCAGCAGAT GGCGGATAAA ATGCAGGATT TCAACGTATT AAGTTACCTC
TCCTCGCATG ATACGCGCCT TTTCCGTGAG GGCGGTGATA AGGCGGCGGA ACTGCTGCTG
CTTTCGCCAG GCGCGGTGCA GATCTTTTAC GGCGATGAAT CCGCTCGTCC CTTCGGCCCC
ACCGGCTCCG ACCCGCTGCA AGGCACCCGT TCAGATATGA ACTGGCAGGA CGTGAGCGGA
AAGTCAGCCG CAGCCGTCGC GCACTGGCAG CGTATTAGCC AGTTCCGCGC CAGACATCCC
GCCATCGGCG CAGGCCAACA AACCACGCTG ACGCTAAAAC ACGGGTACGG TTTTGTACGC
CAGTACGGCG ACGATACGGT GATGGTCGTC TGGGCGGGCC GCCGCTAA
 
Protein sequence
MKLAAFALTL IPGIAIASSW TSPGFPTFST QETGRFTSHA ALTKGTRALT LHIDQQCWQP 
SGAIKLNQML SLKPCEGAPP QWRLFKDGDY TITVDTRSGT PTLLLSIKTE PERTAQLAYQ
CPVWDGSPLT LDVRQTFPEG TVVRDYYSGQ TDTVQNGQIT LQPADSHGLL LLERAETHAS
APFNWRNATV YFVLTDRFRN GDPTNDHSYG RHKDGMQEIG TFHGGDLRGL TSKLDYLQQL
GVSALWISSP FEQIHGWVGG GAKGDFPHYA YHGYYTQDWT TLDANMGNEA DLRALVDGAH
QRGIRILFDV VMNHAGYATL EDMQEYQFGA LYLSGAERQK ILGDRWTNWR PAAGQSWHSF
NDYINFSDSA AWEKWWGKKW IRTDIGDYDS PGFDDLTLSL AFLPDIKTES TTPSGLPAFY
ANKPDTKAKF IEGYTPRDYL THWLSQWVHD YGIDGFRVDT AKNVELPAWQ QLKTQASAAL
HEWKQANPDK ALDDSPFWMT GEAWGHGVMK SDYYRYGFDA MINFDYQEQA AKAVDCLAEM
GPVWQQMADK MQDFNVLSYL SSHDTRLFRE GGDKAAELLL LSPGAVQIFY GDESARPFGP
TGSDPLQGTR SDMNWQDVSG KSAAAVAHWQ RISQFRARHP AIGAGQQTTL TLKHGYGFVR
QYGDDTVMVV WAGRR