Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_1813 |
Symbol | |
ID | 6142721 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | - |
Start bp | 1833953 |
End bp | 1835632 |
Gene Length | 1680 bp |
Protein Length | 559 aa |
Translation table | 11 |
GC content | 44% |
IMG OID | 641616689 |
Product | alpha amylase family protein |
Protein accession | YP_001743867 |
Protein GI | 170679871 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG0366] Glycosidases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 26 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 45 |
Fosmid unclonability p-value | 0.653374 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAACAGA AAATTACGGA TTACCTGGAC GAAATCTACG GTGGAACATT TACCGCAACT CATTTACAGA AACTTGTAAC GCGTCTTGAG AGTGCGAAAC GATTAATTAC ACTGCGACGT AAAAAACACT GGGATGAAAG TGATGTCGTG TTAATCACCT ATGCCGATCA ATTTCACAGT AATGATTTAA AACCACTACC CACATTTAAT CAGTTTTACC CTCAATGGCT GCAAAGCATT TTTTCACATG TTCATTTATT ACCTTTTTAT CCATGGTCAT CTGATGATGG CTTTTCGGTA ATTGATTATC AACAGGTCGC TAGTGAAGCG GGGGAGTGGC AGGATATTCA GCAACTCGGT GAATGCAGTC ATTTAATGTT TGATTTTGTC TGCAACCATA TGTCGGCAAA AAGTGAATGG TTTAAAAACT ATTTACAACA GCAGCCAGGT TTTGAAGATT TTTTTATTGC CGTTGACCCG CAAACCGATC TCAGCGTCGT CACTCGCCCG CGTGCGTTAC CGTTATTAAC GCCATTCCAG ATGCGCGATC ATTCAACGCG CCATTTATGG ACCACCTTTA GTGACGATCA AATTGACCTG AATTACCGTA GCCCTGAAGT GTTACTGGCG ATGGTGGATG TATTGCTGTG CTACCTGGAA AAGGGCGCAG AATATGTCCG TCTGGATGCC GTTGGCTTTA TGTGGAAAGA GCCGGGAACA AGCTGCATCC ATCTGGAAAA AACACATCTG ATTATCAAAC TGTTACGGTC GATTATTGAT GACGTAGCAC CAGGTACAGT GATCATTACC GAGACCAATG TTCCGCACAG AGACAACATT GCTTACTTTG GCAACGGTGA TGACGAAGCG CATATGGTGT ACCAGTTCTC GCTGCCGCCG CTGGTGCTGC ATGCGGTGCA AAAACAGAAC GTTGAGGCGC TTTGTGCGTG GGCGCAAAAC CTGACACTAC CTTCCAGCAA CACCACCTGG TTTAACTTCC TCGCCTCTCA CGATGGCATC GGGCTTAACC CACTGCGTGG CTTGCTACCC GAAAGCGAAA TATTAGCGCT GGTCGAGGCA TTACAGCAGG AAGGGGCATT AGTTAACTGG AAAAATAATC CCGACGGTAC GCGTAGCCCG TATGAAATGA ATGTCACTTA TATGGATGCG TTAAACCGCC GCGAGAGTAG CGATGAAGAA CGTTGCGCCA GGTTTATCCT TGCCCATGCG ATTTTGTTAA GTTTCCCCGG TGTGCCAGCG ATATATATTC AAAGTATTCT GGGCTCGCGT AATGATTACG CAGGTGTCGA AAAACTGGGA TATAACCGTG CGATTAACCG TAAAAAATAT TACAGTAAAG AGATCACGAC CGAACTGAAC AATAAAACGA CGTTAAGGCA CGCGGTATAT CATGAATTGT CGCGACTAAT TAAAATTCGT CGAAGTCATA ACGAGTTTCA TCCAGATAAT GATTTTACCA TCGACACGGT TAATTCATCC GTAATGTGTA TTCAAAGAAG CAACGCGGAT GGTAATTGTC TGACAGGATT GTTTAATGTC AGTGAAAATA TTCAGCATAT AAATATTACT GACCTGCACG GTCGGGATCT GATTAGTGAA GTTGATATAG TGGGTAATGA AATAACGCTG CGCCCCTGGC AGGTTATGTG GATTAAATAA
|
Protein sequence | MKQKITDYLD EIYGGTFTAT HLQKLVTRLE SAKRLITLRR KKHWDESDVV LITYADQFHS NDLKPLPTFN QFYPQWLQSI FSHVHLLPFY PWSSDDGFSV IDYQQVASEA GEWQDIQQLG ECSHLMFDFV CNHMSAKSEW FKNYLQQQPG FEDFFIAVDP QTDLSVVTRP RALPLLTPFQ MRDHSTRHLW TTFSDDQIDL NYRSPEVLLA MVDVLLCYLE KGAEYVRLDA VGFMWKEPGT SCIHLEKTHL IIKLLRSIID DVAPGTVIIT ETNVPHRDNI AYFGNGDDEA HMVYQFSLPP LVLHAVQKQN VEALCAWAQN LTLPSSNTTW FNFLASHDGI GLNPLRGLLP ESEILALVEA LQQEGALVNW KNNPDGTRSP YEMNVTYMDA LNRRESSDEE RCARFILAHA ILLSFPGVPA IYIQSILGSR NDYAGVEKLG YNRAINRKKY YSKEITTELN NKTTLRHAVY HELSRLIKIR RSHNEFHPDN DFTIDTVNSS VMCIQRSNAD GNCLTGLFNV SENIQHINIT DLHGRDLISE VDIVGNEITL RPWQVMWIK
|
| |