Gene EcSMS35_2778 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_2778 
Symbol 
ID6143706 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp2860012 
End bp2862264 
Gene Length2253 bp 
Protein Length750 aa 
Translation table11 
GC content38% 
IMG OID641617647 
Productalpha amylase family protein 
Protein accessionYP_001744807 
Protein GI170681220 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0366] Glycosidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.859165 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones46 
Fosmid unclonability p-value0.73378 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTTTCTA TAAAACCAGG ACCAAGAAAT TTACCTATCG ACAACCCCAC ATTGTTATCA 
TGGAACATTA CTGACGGGGA TCTAAATTCC AAATTAAATA CATTAGAATA TCTAAACTGT
ATAACAAATA TTATTAATGC TTGTGGAGTT TATCCTCAGG ATTTAAAAGA CAGAGAAATT
ATATCAACTT TTCACGCAGA AAAAGTCATT AATGATCTGT TAAAAAACGA TTATAAAATT
TCCCTTTCTC CAGATACAAC TTACCGAGAG CTGAATAAAG CTGCACAGCG TAGCATTACA
GCGCCAGACA GGATAGGAGA AGGGAAAACA TGGGTTTATC AACGAGATAC AATGGTTGAA
AGAGGTGATA ACAGCAGCGT TCATCAGTAT GGTCCAGCTG AACATTTCAC TCACATTATA
TCTGACAAAC CATCCCCAAA AGATAAGTAT GTTGCATATG CTATTAACAT TCCTGACTAT
GAGCTGGCAG CCGATGTATA TAATATTAAC GTGACGTCAC CTTCCGGACA GCAAGAAACA
TTTAAAATAC TAATCAATCC AGAACATCTA CGGCAAACAC TTGAGCGTAA ATCTCTTACT
GCTGTTCAGA AATCACAATG TGAAATCATC ACCCCCAAAA AACCTGGCGA AGCGATTCTT
CATGCTTTTA ATGCCACCTA CCAGCAAATC AGGGAAAATA TGTCTGAGTT TGCACGTTCC
CATTATGGGT ATATACAAAT TCCTCCAGTG ACAACTTTCC GCGCCGACGG ACCAGAAACT
CCCGAAGAAG AAAAAGGTTA CTGGTTTCAC GCTTATCAAC CCGAAGATCT TTGTACCATC
CATAATCCAA TGGGAGATTT GCAGGATTTT ATCGCATTGG TTAAAGATGC TAAAAAATTT
GGTATCGATA TCATTCCTGA TTATACCTTT AACTTTATGG GAATTGGGGG TAGTGGTAAA
AATGACCTGG ATTATCCCTC TGCTGATATA CGAGCGAAGA TCAGTAAAGA TATAGAAAGT
GGTATCCCTG GCTATTGGCA AGGTCAGGTT TTGATTCCAT TTACTATAGA TCCAGTAACA
AAAGAACGTA AACAAATCCA TCCAGAAGAT ATACATCTCA CTGCAAAAGA CTTTGAAACA
AGTAAAGATA ACATCTCTAA GGATGAATGG GAAAACCTCC ATGCATTAAA AGAAAAGCGT
TTAAATGGAA TGCCTAAAAC AACCCCCAAA AGTGACCAGG TTATTATGTT GCAAAATCAA
TACGTTCGTG AAATGCGAAA ATATGGCGTA CGAGGTTTAC GTTACGATGC AGCAAAACAC
TCAAAACATG AACAAATAGA AAGATCAATA ACTCCACCGC TTACAATTTA TAATGAGCGA
TTACACAATA CTAACTTATT TAAGCCAATA TATCATGAAA AAGCCGTTAT GAATTACATG
GAATACCTGG TAACTTGTCA GTTGGATGAA GAACAAATGT CATCGCTGCT TTATGAAAGA
GATGATTTAA GCGCCATTGA TTTTTCATTG CTCATGAAGA CGATAAAAGC CTTTTCATTT
GGTGGCGATC TCCAAACCCT TGCATCAAAA CCGGGTTCAA CAATCTCAAG CATCCCGTCA
AAAAGACGGA TATTGATTAA CATTAACCAC GATTTTCCTA ACAATGGCAA TCTTTTCAAT
GACTTTCTAT TTAACCATCA ACAAGATGAA CAATTAGCAA TGGCATATAT GGCCGCTCTC
CCGTTCAGCA GGCCTTTAGT TTACTGGGAT GGCCAAGTAT TAAAATCAAC GACTGAAATT
AAAAATTATG ATGGGTCGAC GCGTGTCGGC GGTGAGGCGT GGCTTAATAA AGGTTGCTCT
ACCTATCAGC AGCTCTACAA TGAATTCCAC GCATTATATA TAGATAAAGC AGGAATATGG
AGCGCATTTG AGGGTGTATT TGCAACTAAA AACGTTCTGG CCTTTAGTCG TGGGGATTCT
GTGAACATTA ATCACTCTCC TCATGATGGA CTAGTTATAA TAAATAAAGG AAACGAAGAA
GTTGAAGGTA CCTGGCCTAA CAAATTGCAA CCTGGAATAT ATAAAAACAT GGGGAGTAAT
AGCGTTAACA TTATTATTAA TAATACCCGA AAAATTATCC CCCCTGGTAA AGCATTTATG
CTTAGAGGCG GAACTCTAAA TATCAATATT CCTGGGCGTA GCGCTCTTCT TTTAGGGAAA
ACTGGAGAAC CGCCGAACTA TCTCTATTTG TAA
 
Protein sequence
MFSIKPGPRN LPIDNPTLLS WNITDGDLNS KLNTLEYLNC ITNIINACGV YPQDLKDREI 
ISTFHAEKVI NDLLKNDYKI SLSPDTTYRE LNKAAQRSIT APDRIGEGKT WVYQRDTMVE
RGDNSSVHQY GPAEHFTHII SDKPSPKDKY VAYAINIPDY ELAADVYNIN VTSPSGQQET
FKILINPEHL RQTLERKSLT AVQKSQCEII TPKKPGEAIL HAFNATYQQI RENMSEFARS
HYGYIQIPPV TTFRADGPET PEEEKGYWFH AYQPEDLCTI HNPMGDLQDF IALVKDAKKF
GIDIIPDYTF NFMGIGGSGK NDLDYPSADI RAKISKDIES GIPGYWQGQV LIPFTIDPVT
KERKQIHPED IHLTAKDFET SKDNISKDEW ENLHALKEKR LNGMPKTTPK SDQVIMLQNQ
YVREMRKYGV RGLRYDAAKH SKHEQIERSI TPPLTIYNER LHNTNLFKPI YHEKAVMNYM
EYLVTCQLDE EQMSSLLYER DDLSAIDFSL LMKTIKAFSF GGDLQTLASK PGSTISSIPS
KRRILININH DFPNNGNLFN DFLFNHQQDE QLAMAYMAAL PFSRPLVYWD GQVLKSTTEI
KNYDGSTRVG GEAWLNKGCS TYQQLYNEFH ALYIDKAGIW SAFEGVFATK NVLAFSRGDS
VNINHSPHDG LVIINKGNEE VEGTWPNKLQ PGIYKNMGSN SVNIIINNTR KIIPPGKAFM
LRGGTLNINI PGRSALLLGK TGEPPNYLYL