Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_2778 |
Symbol | |
ID | 6143706 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | + |
Start bp | 2860012 |
End bp | 2862264 |
Gene Length | 2253 bp |
Protein Length | 750 aa |
Translation table | 11 |
GC content | 38% |
IMG OID | 641617647 |
Product | alpha amylase family protein |
Protein accession | YP_001744807 |
Protein GI | 170681220 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG0366] Glycosidases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.859165 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 46 |
Fosmid unclonability p-value | 0.73378 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTTTTCTA TAAAACCAGG ACCAAGAAAT TTACCTATCG ACAACCCCAC ATTGTTATCA TGGAACATTA CTGACGGGGA TCTAAATTCC AAATTAAATA CATTAGAATA TCTAAACTGT ATAACAAATA TTATTAATGC TTGTGGAGTT TATCCTCAGG ATTTAAAAGA CAGAGAAATT ATATCAACTT TTCACGCAGA AAAAGTCATT AATGATCTGT TAAAAAACGA TTATAAAATT TCCCTTTCTC CAGATACAAC TTACCGAGAG CTGAATAAAG CTGCACAGCG TAGCATTACA GCGCCAGACA GGATAGGAGA AGGGAAAACA TGGGTTTATC AACGAGATAC AATGGTTGAA AGAGGTGATA ACAGCAGCGT TCATCAGTAT GGTCCAGCTG AACATTTCAC TCACATTATA TCTGACAAAC CATCCCCAAA AGATAAGTAT GTTGCATATG CTATTAACAT TCCTGACTAT GAGCTGGCAG CCGATGTATA TAATATTAAC GTGACGTCAC CTTCCGGACA GCAAGAAACA TTTAAAATAC TAATCAATCC AGAACATCTA CGGCAAACAC TTGAGCGTAA ATCTCTTACT GCTGTTCAGA AATCACAATG TGAAATCATC ACCCCCAAAA AACCTGGCGA AGCGATTCTT CATGCTTTTA ATGCCACCTA CCAGCAAATC AGGGAAAATA TGTCTGAGTT TGCACGTTCC CATTATGGGT ATATACAAAT TCCTCCAGTG ACAACTTTCC GCGCCGACGG ACCAGAAACT CCCGAAGAAG AAAAAGGTTA CTGGTTTCAC GCTTATCAAC CCGAAGATCT TTGTACCATC CATAATCCAA TGGGAGATTT GCAGGATTTT ATCGCATTGG TTAAAGATGC TAAAAAATTT GGTATCGATA TCATTCCTGA TTATACCTTT AACTTTATGG GAATTGGGGG TAGTGGTAAA AATGACCTGG ATTATCCCTC TGCTGATATA CGAGCGAAGA TCAGTAAAGA TATAGAAAGT GGTATCCCTG GCTATTGGCA AGGTCAGGTT TTGATTCCAT TTACTATAGA TCCAGTAACA AAAGAACGTA AACAAATCCA TCCAGAAGAT ATACATCTCA CTGCAAAAGA CTTTGAAACA AGTAAAGATA ACATCTCTAA GGATGAATGG GAAAACCTCC ATGCATTAAA AGAAAAGCGT TTAAATGGAA TGCCTAAAAC AACCCCCAAA AGTGACCAGG TTATTATGTT GCAAAATCAA TACGTTCGTG AAATGCGAAA ATATGGCGTA CGAGGTTTAC GTTACGATGC AGCAAAACAC TCAAAACATG AACAAATAGA AAGATCAATA ACTCCACCGC TTACAATTTA TAATGAGCGA TTACACAATA CTAACTTATT TAAGCCAATA TATCATGAAA AAGCCGTTAT GAATTACATG GAATACCTGG TAACTTGTCA GTTGGATGAA GAACAAATGT CATCGCTGCT TTATGAAAGA GATGATTTAA GCGCCATTGA TTTTTCATTG CTCATGAAGA CGATAAAAGC CTTTTCATTT GGTGGCGATC TCCAAACCCT TGCATCAAAA CCGGGTTCAA CAATCTCAAG CATCCCGTCA AAAAGACGGA TATTGATTAA CATTAACCAC GATTTTCCTA ACAATGGCAA TCTTTTCAAT GACTTTCTAT TTAACCATCA ACAAGATGAA CAATTAGCAA TGGCATATAT GGCCGCTCTC CCGTTCAGCA GGCCTTTAGT TTACTGGGAT GGCCAAGTAT TAAAATCAAC GACTGAAATT AAAAATTATG ATGGGTCGAC GCGTGTCGGC GGTGAGGCGT GGCTTAATAA AGGTTGCTCT ACCTATCAGC AGCTCTACAA TGAATTCCAC GCATTATATA TAGATAAAGC AGGAATATGG AGCGCATTTG AGGGTGTATT TGCAACTAAA AACGTTCTGG CCTTTAGTCG TGGGGATTCT GTGAACATTA ATCACTCTCC TCATGATGGA CTAGTTATAA TAAATAAAGG AAACGAAGAA GTTGAAGGTA CCTGGCCTAA CAAATTGCAA CCTGGAATAT ATAAAAACAT GGGGAGTAAT AGCGTTAACA TTATTATTAA TAATACCCGA AAAATTATCC CCCCTGGTAA AGCATTTATG CTTAGAGGCG GAACTCTAAA TATCAATATT CCTGGGCGTA GCGCTCTTCT TTTAGGGAAA ACTGGAGAAC CGCCGAACTA TCTCTATTTG TAA
|
Protein sequence | MFSIKPGPRN LPIDNPTLLS WNITDGDLNS KLNTLEYLNC ITNIINACGV YPQDLKDREI ISTFHAEKVI NDLLKNDYKI SLSPDTTYRE LNKAAQRSIT APDRIGEGKT WVYQRDTMVE RGDNSSVHQY GPAEHFTHII SDKPSPKDKY VAYAINIPDY ELAADVYNIN VTSPSGQQET FKILINPEHL RQTLERKSLT AVQKSQCEII TPKKPGEAIL HAFNATYQQI RENMSEFARS HYGYIQIPPV TTFRADGPET PEEEKGYWFH AYQPEDLCTI HNPMGDLQDF IALVKDAKKF GIDIIPDYTF NFMGIGGSGK NDLDYPSADI RAKISKDIES GIPGYWQGQV LIPFTIDPVT KERKQIHPED IHLTAKDFET SKDNISKDEW ENLHALKEKR LNGMPKTTPK SDQVIMLQNQ YVREMRKYGV RGLRYDAAKH SKHEQIERSI TPPLTIYNER LHNTNLFKPI YHEKAVMNYM EYLVTCQLDE EQMSSLLYER DDLSAIDFSL LMKTIKAFSF GGDLQTLASK PGSTISSIPS KRRILININH DFPNNGNLFN DFLFNHQQDE QLAMAYMAAL PFSRPLVYWD GQVLKSTTEI KNYDGSTRVG GEAWLNKGCS TYQQLYNEFH ALYIDKAGIW SAFEGVFATK NVLAFSRGDS VNINHSPHDG LVIINKGNEE VEGTWPNKLQ PGIYKNMGSN SVNIIINNTR KIIPPGKAFM LRGGTLNINI PGRSALLLGK TGEPPNYLYL
|
| |