Gene YpsIP31758_4138 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagYpsIP31758_4138 
SymbolmalS 
ID5385644 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameYersinia pseudotuberculosis IP 31758 
KingdomBacteria 
Replicon accessionNC_009708 
Strand
Start bp4670910 
End bp4672973 
Gene Length2064 bp 
Protein Length687 aa 
Translation table11 
GC content53% 
IMG OID640867167 
Productperiplasmic alpha-amylase precursor 
Protein accessionYP_001403081 
Protein GI153950894 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0366] Glycosidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones48 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAACGTC TCACACTCCC CTTGCTACTG GTTCTTTCAC CAGCAGCGAT GGCGAACTGG 
TCCCTACAGC ATTTCCCGGC TTTTACTGAA CAGGACAGCG GTATTTTTCT CAGTAGCAGT
GCACTGACCA AAGGCGAATA CCCTCTAAAA TTTTATCAGG ATAAACAGTG CTGGCAGCCA
ACCGGGCCGG TAAAGCTGAA CCAAATGCTC TCGTTGCAGC CTTGCCAGGA GAATATCGCG
ATACCGTGGC GGTTGTTCCG TGATGGCCAG TATCAAGCGC GAATCGATAC CCGTAGCGGC
ACCCCGACGC TGACCTTAAG TGTCACCGCA CCCGTAGCGG AGGCAGTCCA GGTCGTCACC
CACGTTTGCC AACGCTGGGA TGGCAATCCT GTGACCGTGG ATGTCAGTAA AACCTTTGCC
GAAGGTGAAA TTGTACGGGA TTTCTATTCC GGCCAAACGG CAACCGTTTC TCTCGGCAAA
ATTACGCTAC GACCTGCCCC AGAGAGTGGT GGTTTACTGC TGTTGGAATC GGCCCAAACA
CAGCAAGCCG CCCCTTTTAG CTGGCAAAAC GCTACGGTCT ATTTTGCCCT GACGGACAGA
TTTAATAACG GCAACCCAGC GAACGACCAT AGCTATGGCC GTCATGGCGA TGGGATGCAG
GAGATAGGAA CGTTTCACGG CGGCGATTTG GCGGGGCTTA CCGAGAAGCT GGATTATCTG
CAACAGCTTG GGGTCAACGC ACTGTGGATC AGTTCTCCAC TGGAACAAAT TCACGGCTGG
GTCGGCGGGG GGACCAAAGG CGACTTCCCA CATTATGCCT ATCATGGCTA CTACGGGCTG
GACTGGACCC GTCTGGATGC CAATATGGGC ACCGAACAGG ATTTACGCAC ACTGGTTGAA
CAGGCACATA AACGCGGCAT TCGCATCCTA TTTGATGTGG TGATGAATCA TGTGGGTTAT
GCAACGCTGG CGGATATGCA GAACGACCAA TTCGGGGCGC TCTACCTGCA AGGCGATCAG
CAGGAAAAAA CCTTGGGTAA GCGTTGGAGC GACTGGACCC CTGGCAGCGG GCAGACCTGG
CACAGCTTTA ATGACTACAT CAACTTCAGT GATAAGACCG CTTGGGATAA TTGGTGGGGT
AAAAAATGGA TCCGCACCGA TATTGGTGAT TACGACACCC CCGGTTATGA CGATCTGACG
ATGTCGCTGG CCTTCCTACC CGATATCAAA ACAGAATCGA CGCAGTCCAG CGGTTTGCCG
GTGTTTTACC GTAACAAACC GGACACCGCA GCCCAAGAAA TCGCCGGTGC GACACCCCGT
GATTATATGA CGCATTGGTT AAGCCAATGG GTACGCGATT ACGGCATTGA CGGTTTTCGG
GTTGATACTG CCAAGCATGT AGAGAAACCC GCCTGGCAAC AATTAAAGCA GCAGAGCATC
GCGGCACTGG CCGAATGGAA AGCCGCACAT CCAGAACAGG CGCTGGATAA TCTGCCATTT
TGGATGACCG GAGAGGCTTG GGGCCACGGT GTCATGAAAA GCGATTATTA CCAAAATGGC
TTTGATGCCA TGATTAATTT TGATTTTCAG GATCAGGCAA ATCAGGCGCT GGCCTGCTTC
TCATCTATCG AGAGTACCTA CAACCAAATG GCGGAGAAAC TGCAAAACTT CAATGTGTTG
AGCTACCTCT CGTCTCACGA TACCCGGTTA TTCTTTAAAG ACGATGCACA ACAGTCACTG
GCAAAACAGC AGCGAGCAGG CTCTTTACTG TTGTTGGCTC CGGGGGCAGT ACAAATCTTC
TACGGTGATG AAAGCGGGCG GAAGTTTGGC CCAACCGGTT CCGATCCGTT GCAGGGTACC
CGTTCGGATA TGAACTGGAG TGAGCTATCG GGCGAAAAAG GCGCACTGTT GGCCCATTGG
CAAAAAGTCA GTCAATTCCG CGCCCGTCAT CCCGCGATAG GCGCTGGTGT ACAACAATCG
CAACAAACCG CCAATTACTA TGCCTTTAGC CGCCAACATC AGGGCGATAA GGTTCTGGTC
GTTTGGGTCG GTGATAAGAA CTGA
 
Protein sequence
MKRLTLPLLL VLSPAAMANW SLQHFPAFTE QDSGIFLSSS ALTKGEYPLK FYQDKQCWQP 
TGPVKLNQML SLQPCQENIA IPWRLFRDGQ YQARIDTRSG TPTLTLSVTA PVAEAVQVVT
HVCQRWDGNP VTVDVSKTFA EGEIVRDFYS GQTATVSLGK ITLRPAPESG GLLLLESAQT
QQAAPFSWQN ATVYFALTDR FNNGNPANDH SYGRHGDGMQ EIGTFHGGDL AGLTEKLDYL
QQLGVNALWI SSPLEQIHGW VGGGTKGDFP HYAYHGYYGL DWTRLDANMG TEQDLRTLVE
QAHKRGIRIL FDVVMNHVGY ATLADMQNDQ FGALYLQGDQ QEKTLGKRWS DWTPGSGQTW
HSFNDYINFS DKTAWDNWWG KKWIRTDIGD YDTPGYDDLT MSLAFLPDIK TESTQSSGLP
VFYRNKPDTA AQEIAGATPR DYMTHWLSQW VRDYGIDGFR VDTAKHVEKP AWQQLKQQSI
AALAEWKAAH PEQALDNLPF WMTGEAWGHG VMKSDYYQNG FDAMINFDFQ DQANQALACF
SSIESTYNQM AEKLQNFNVL SYLSSHDTRL FFKDDAQQSL AKQQRAGSLL LLAPGAVQIF
YGDESGRKFG PTGSDPLQGT RSDMNWSELS GEKGALLAHW QKVSQFRARH PAIGAGVQQS
QQTANYYAFS RQHQGDKVLV VWVGDKN