Gene SeHA_C3987 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeHA_C3987 
SymbolmalS 
ID6490751 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Heidelberg str. SL476 
KingdomBacteria 
Replicon accessionNC_011083 
Strand
Start bp3863622 
End bp3865649 
Gene Length2028 bp 
Protein Length675 aa 
Translation table11 
GC content56% 
IMG OID642744088 
Productperiplasmic alpha-amylase precursor 
Protein accessionYP_002047693 
Protein GI194450400 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0366] Glycosidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones83 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAACTTG CCGCCTTCGC TCTGACGCTG ATACCCGGTA TCGCGATCGC CTCATCATGG 
ACCTCACCCG GTTTTCCGAC GTTCTCGACG CAGGAAACCG GGCGCTTTAC CAGCCATGCT
GCATTAACAA AAGGTACACG CGCGTTAACG CTCCATATTG ACCAGCAGTG CTGGCAGCCC
TCCGGCGCCA TAAAACTCAA CCAAATGTTG TCGTTAAAAC CCTGCGAAGG CGCGCCACCG
CAATGGCGTT TGTTTAAAGA TGGCGACTAT ACGATAACGG TAGATACCCG CTCCGGTACG
CCGACCCTGT TGTTATCGAT AAAAACTGAA CCTAAACGCA CCGCGCAGCT CGCCTACCAG
TGTCCCGTCT GGGATGGTTC GCCGCTCACG CTGGATGTTC GCCAGACCTT TCCGGAAGGG
ACGGTAGTGC GCGATTATTA CAGTGGTCAA ACCGATACCG TCCAAAACGG GCAAATCACG
CTGCAACCTG CCGACAGCCA CGGGCTTTTA TTGCTGGAAC GCGCGGAAAC CCATGCGTCA
GCGCCTTTTA ACTGGCGCAA CGCCACCGTT TATTTTGTGC TTACGGATCG CTTTCGCAAT
GGCGATCCAA CCAATGACCA CAGCTATGGT CGCCATAAGG ATGGTATGCA AGAGATTGGC
ACTTTCCACG GCGGCGATTT GCGTGGGTTG ACGAGTCAAC TGGACTATCT ACAGCAATTA
GGCGTGAACG CCTTGTGGAT AAGCTCGCCG TTTGAACAGA TCCACGGCTG GGTCGGCGGC
GGGACAAAAG GCGATTTTCC TCATTACGCC TATCACGGCT ATTACACTCA GGACTGGACG
ACACTGGATG CCAACATGGG CAGCGAAGCC GATCTCCGCG CACTGGTCGA CGGCGCGCAC
CAGCGCGGCA TCCGTATTTT ATTTGACGTA GTAATGAATC ATGCCGGTTA CGCCACGCTG
GCGGATATGC AGGAGTATCA GTTCGGCGCG CTCTATTTAT CTGGCGCGGA ACGGCAAAAA
ATTCTCGGCG ATCGCTGGAC AAACTGGCGA CCCACCACCG GACAAAGCTG GCACAGCTTT
AACGACTACA TCAACTTCAG CGACAGCGCC GCCTGGGAAA AATGGTGGGG GAAAAAGTGG
ATTCGTACCG ATATTGGCGA CTACGACAGT CCGGGATTTG ACGATTTAAC CCTGTCGCTG
GCCTTCCTGC CGGATATAAA AACGGAATCT ACCACGCCTT CCGGTCTACC CGCGTTCTAT
GCCAACAAAC CCGATACTAA AGCAAAGTTC ATTGAAGGCT ATACGCCACG GGATTATCTG
ACTCACTGGT TAAGCCAGTG GGTGCATGAT TACGGCATTG ACGGGTTCCG GGTCGATACT
GCCAAAAACG TTGAGCTTCC TGCCTGGCAA CAGCTAAAAA CCCAGGCCAG CGCGGCGTTA
CGTGAATGGA AGCAGGCCAA TCCGGACAAA GCGCTGGATA ATAGCCCGTT CTGGATGACT
GGCGAAGCGT GGGGCCACGG CGTCATGAAA AGTGATTATT ATCGCTATGG TTTCGACGCG
ATGATCAATT TTGATTATCA GGAGCAGGCG GCGAAAGCGG TCGATTGCCT GGCGGAAATG
GGACCAGTCT GGCAGCAGAT GGCGGATAAA ATGCAGGATT TCAACGTATT AAGTTACCTC
TCCTCGCATG ATACGCGCCT TTTCCGTGAG GGCGGTGATA AGGCGGCGGA ACTGCTGCTG
CTTTCGCCGG GCGCGGTGCA GATCTTTTAC GGCGATGAAT CCGCTCGTCC CTTCGGCCCC
ACCGGCTCCG ACCCGCTGCA AGGCACCCGT TCAGATATGA ACTGGCAGGA CGTGAGCGGA
AAGTCAGCCG CAGCCGTCGC GCACTGGCAG CGTATTAGCC AGTTCCGCGC CAGACATCCC
GCCATCGGCG CAGGCCAACA AACCACGCTG ACGCTAAAAC ACGGGTACGG TTTTGTACGC
CAGTACGGCG ACGATACGGT GATGGTCGTC TGGGCGGGCC GCCGCTAA
 
Protein sequence
MKLAAFALTL IPGIAIASSW TSPGFPTFST QETGRFTSHA ALTKGTRALT LHIDQQCWQP 
SGAIKLNQML SLKPCEGAPP QWRLFKDGDY TITVDTRSGT PTLLLSIKTE PKRTAQLAYQ
CPVWDGSPLT LDVRQTFPEG TVVRDYYSGQ TDTVQNGQIT LQPADSHGLL LLERAETHAS
APFNWRNATV YFVLTDRFRN GDPTNDHSYG RHKDGMQEIG TFHGGDLRGL TSQLDYLQQL
GVNALWISSP FEQIHGWVGG GTKGDFPHYA YHGYYTQDWT TLDANMGSEA DLRALVDGAH
QRGIRILFDV VMNHAGYATL ADMQEYQFGA LYLSGAERQK ILGDRWTNWR PTTGQSWHSF
NDYINFSDSA AWEKWWGKKW IRTDIGDYDS PGFDDLTLSL AFLPDIKTES TTPSGLPAFY
ANKPDTKAKF IEGYTPRDYL THWLSQWVHD YGIDGFRVDT AKNVELPAWQ QLKTQASAAL
REWKQANPDK ALDNSPFWMT GEAWGHGVMK SDYYRYGFDA MINFDYQEQA AKAVDCLAEM
GPVWQQMADK MQDFNVLSYL SSHDTRLFRE GGDKAAELLL LSPGAVQIFY GDESARPFGP
TGSDPLQGTR SDMNWQDVSG KSAAAVAHWQ RISQFRARHP AIGAGQQTTL TLKHGYGFVR
QYGDDTVMVV WAGRR