Gene EcolC_1048 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcolC_1048 
Symbol 
ID6066394 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli ATCC 8739 
KingdomBacteria 
Replicon accessionNC_010468 
Strand
Start bp1135932 
End bp1137773 
Gene Length1842 bp 
Protein Length613 aa 
Translation table11 
GC content37% 
IMG OID641600461 
Productalpha amylase catalytic region 
Protein accessionYP_001724044 
Protein GI170019090 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0366] Glycosidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones29 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTTTCTA TAAAACCAGG ACCCAGAAAT TTACCTATCG ACAACCCCAC ATTGTTATCA 
TGGAACATTA CTGACGGGGA TCTAAATTCC AAATTAAATA CATTAGAATA TCTAAACTGT
ATAACAAATA TTATTAATTC TTGTGGAGTT TACCCTCAAG GATTAAAAGA CAGAGAAATT
ATATCAACTT TTCACGCAGA AAAAGTTATT AATGATCTGT TAAAAAACGA TTATAAAATT
TCCCTTTCTC CAGATACAAC TTATCGAGAG TTGAATAAAG CAGCACAGCG TAGCATTACA
GCGCCAGACA GGATAGGAGA AGGAAAAACA TGGGTTTATC AACGAGATAC AATGATTGAA
AGAGGTGATA ACAGCGGTGT TTATCAGTAT GGTCCTGCTG AACACTTCAC CCACATTATA
TCTGACAAAC CTTCCCCAAA AGATAAATAT GTTGCATATG CTATTAACAT TCCTGACTAT
GAGCTGGCAG CCGATGTATA TAATATTAAC GTGACGTCAC CTTCCGGACA GCAAGAAACA
TTTAAAATAT TAATCAATCC AGAACATCTA CGGCAAACAC TTGAGCGTAA ATCTCTTACT
GCTGTTCAGA AATCACAATG TGAAATCATC ACCCCCAAAA AACCTGGCGA AGCGATTCTT
CATGCTTTTA ATGCCACCTA CCAGCAGATC AGAGAAAATA TGTCTGAATT TGCACGTTGC
CATTATGGGT ATATACAAAT CCCTCCAGTG ACAACTTTCC GTGCCGACGG ACCAGAAACT
CCCGAAGAAG AAAAAGGTTA CTGGTTTCAC GCTTATCAAC CCGAAGATCT TTGTACCATC
CACAATCCAA TGGGAGATTT GCAGGATTTT ATCGCATTGG TTAAAGATGC TAAAAAATTT
GGTATCGATA TCATTCCTGA TTATACCTTT AACTTTATGG GAATCGGGGG TAGTGGTAAA
AATGACCTGG ATTATCCCTC TGCTGATATA CGAGCGAAGA TCAGTAAAGA TATAGAAAGT
GGTATCCCTG GCTATTGGCA AGGTCAGGTT TTGATTCCAT TTACTATAGA TCCAGTAACA
AAAGAACGTA AACAAATCCA TCCAGAAGAT ATACATCTCA CTGCAAAAGA CTTCGAAGCA
AGTAAAGATA ACATCTCTAA GGATGAATGG GAAAACCTCC ATGCATTAAA AGAAAAGCGT
TTAAATGGAA TGCCTAAAAC AACACCCAAA AGTGACCAGG TTATTATGTT GCAAAATCAA
TACGTTCGTG AAATGCGAAA ATATGGCGTA CGAGGTTTAC GTTATGATGC GGCAAAACAC
TCAAAACATG AACAAATAGA AAGATCAATA ACCCCACCGC TTAAAAATTA TAATGAGCGG
TTACACAATA CTAACTTATT TAACCCAAAA TATCATAAAA AAGCCGTTAT GAATTACATG
GAATATCTGG TAACTTGTCA GTTGGATGAA CAACAAATGT CATCACTGCT TTATGAAAGA
GATGATTTAA GCGCCATTGA TTTTTCATTG CTCATGAAAA CGATAAAAGC CTTTTCATTT
GGTGGAGATC TCCAAACCCT TGCATCAAAA CCGGGTTCCA CAATCTCAAG TATCCCATCA
GAAAGACGGA TATTGATTAA CATTAACCAC GATTTTCCTA ACAATGGTAA TCTTTTCAAT
GACTTTCTAT TTAACCATCA ACAAGATGAA CAATTAGCAA TGGCATATAT AGCCGCTCTG
CCGTTCAGCA GGCCTTTAGT TTACTGGGAT GGCCAAGTAT TAAAATCAAC GACTGAAATT
AAAAATTATG ATGGGTCCAC GCGTGTCGGC GGTGAGGCGT AG
 
Protein sequence
MFSIKPGPRN LPIDNPTLLS WNITDGDLNS KLNTLEYLNC ITNIINSCGV YPQGLKDREI 
ISTFHAEKVI NDLLKNDYKI SLSPDTTYRE LNKAAQRSIT APDRIGEGKT WVYQRDTMIE
RGDNSGVYQY GPAEHFTHII SDKPSPKDKY VAYAINIPDY ELAADVYNIN VTSPSGQQET
FKILINPEHL RQTLERKSLT AVQKSQCEII TPKKPGEAIL HAFNATYQQI RENMSEFARC
HYGYIQIPPV TTFRADGPET PEEEKGYWFH AYQPEDLCTI HNPMGDLQDF IALVKDAKKF
GIDIIPDYTF NFMGIGGSGK NDLDYPSADI RAKISKDIES GIPGYWQGQV LIPFTIDPVT
KERKQIHPED IHLTAKDFEA SKDNISKDEW ENLHALKEKR LNGMPKTTPK SDQVIMLQNQ
YVREMRKYGV RGLRYDAAKH SKHEQIERSI TPPLKNYNER LHNTNLFNPK YHKKAVMNYM
EYLVTCQLDE QQMSSLLYER DDLSAIDFSL LMKTIKAFSF GGDLQTLASK PGSTISSIPS
ERRILININH DFPNNGNLFN DFLFNHQQDE QLAMAYIAAL PFSRPLVYWD GQVLKSTTEI
KNYDGSTRVG GEA