Gene EcSMS35_1813 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_1813 
Symbol 
ID6142721 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp1833953 
End bp1835632 
Gene Length1680 bp 
Protein Length559 aa 
Translation table11 
GC content44% 
IMG OID641616689 
Productalpha amylase family protein 
Protein accessionYP_001743867 
Protein GI170679871 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0366] Glycosidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones45 
Fosmid unclonability p-value0.653374 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAACAGA AAATTACGGA TTACCTGGAC GAAATCTACG GTGGAACATT TACCGCAACT 
CATTTACAGA AACTTGTAAC GCGTCTTGAG AGTGCGAAAC GATTAATTAC ACTGCGACGT
AAAAAACACT GGGATGAAAG TGATGTCGTG TTAATCACCT ATGCCGATCA ATTTCACAGT
AATGATTTAA AACCACTACC CACATTTAAT CAGTTTTACC CTCAATGGCT GCAAAGCATT
TTTTCACATG TTCATTTATT ACCTTTTTAT CCATGGTCAT CTGATGATGG CTTTTCGGTA
ATTGATTATC AACAGGTCGC TAGTGAAGCG GGGGAGTGGC AGGATATTCA GCAACTCGGT
GAATGCAGTC ATTTAATGTT TGATTTTGTC TGCAACCATA TGTCGGCAAA AAGTGAATGG
TTTAAAAACT ATTTACAACA GCAGCCAGGT TTTGAAGATT TTTTTATTGC CGTTGACCCG
CAAACCGATC TCAGCGTCGT CACTCGCCCG CGTGCGTTAC CGTTATTAAC GCCATTCCAG
ATGCGCGATC ATTCAACGCG CCATTTATGG ACCACCTTTA GTGACGATCA AATTGACCTG
AATTACCGTA GCCCTGAAGT GTTACTGGCG ATGGTGGATG TATTGCTGTG CTACCTGGAA
AAGGGCGCAG AATATGTCCG TCTGGATGCC GTTGGCTTTA TGTGGAAAGA GCCGGGAACA
AGCTGCATCC ATCTGGAAAA AACACATCTG ATTATCAAAC TGTTACGGTC GATTATTGAT
GACGTAGCAC CAGGTACAGT GATCATTACC GAGACCAATG TTCCGCACAG AGACAACATT
GCTTACTTTG GCAACGGTGA TGACGAAGCG CATATGGTGT ACCAGTTCTC GCTGCCGCCG
CTGGTGCTGC ATGCGGTGCA AAAACAGAAC GTTGAGGCGC TTTGTGCGTG GGCGCAAAAC
CTGACACTAC CTTCCAGCAA CACCACCTGG TTTAACTTCC TCGCCTCTCA CGATGGCATC
GGGCTTAACC CACTGCGTGG CTTGCTACCC GAAAGCGAAA TATTAGCGCT GGTCGAGGCA
TTACAGCAGG AAGGGGCATT AGTTAACTGG AAAAATAATC CCGACGGTAC GCGTAGCCCG
TATGAAATGA ATGTCACTTA TATGGATGCG TTAAACCGCC GCGAGAGTAG CGATGAAGAA
CGTTGCGCCA GGTTTATCCT TGCCCATGCG ATTTTGTTAA GTTTCCCCGG TGTGCCAGCG
ATATATATTC AAAGTATTCT GGGCTCGCGT AATGATTACG CAGGTGTCGA AAAACTGGGA
TATAACCGTG CGATTAACCG TAAAAAATAT TACAGTAAAG AGATCACGAC CGAACTGAAC
AATAAAACGA CGTTAAGGCA CGCGGTATAT CATGAATTGT CGCGACTAAT TAAAATTCGT
CGAAGTCATA ACGAGTTTCA TCCAGATAAT GATTTTACCA TCGACACGGT TAATTCATCC
GTAATGTGTA TTCAAAGAAG CAACGCGGAT GGTAATTGTC TGACAGGATT GTTTAATGTC
AGTGAAAATA TTCAGCATAT AAATATTACT GACCTGCACG GTCGGGATCT GATTAGTGAA
GTTGATATAG TGGGTAATGA AATAACGCTG CGCCCCTGGC AGGTTATGTG GATTAAATAA
 
Protein sequence
MKQKITDYLD EIYGGTFTAT HLQKLVTRLE SAKRLITLRR KKHWDESDVV LITYADQFHS 
NDLKPLPTFN QFYPQWLQSI FSHVHLLPFY PWSSDDGFSV IDYQQVASEA GEWQDIQQLG
ECSHLMFDFV CNHMSAKSEW FKNYLQQQPG FEDFFIAVDP QTDLSVVTRP RALPLLTPFQ
MRDHSTRHLW TTFSDDQIDL NYRSPEVLLA MVDVLLCYLE KGAEYVRLDA VGFMWKEPGT
SCIHLEKTHL IIKLLRSIID DVAPGTVIIT ETNVPHRDNI AYFGNGDDEA HMVYQFSLPP
LVLHAVQKQN VEALCAWAQN LTLPSSNTTW FNFLASHDGI GLNPLRGLLP ESEILALVEA
LQQEGALVNW KNNPDGTRSP YEMNVTYMDA LNRRESSDEE RCARFILAHA ILLSFPGVPA
IYIQSILGSR NDYAGVEKLG YNRAINRKKY YSKEITTELN NKTTLRHAVY HELSRLIKIR
RSHNEFHPDN DFTIDTVNSS VMCIQRSNAD GNCLTGLFNV SENIQHINIT DLHGRDLISE
VDIVGNEITL RPWQVMWIK