Gene EcHS_A2793 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcHS_A2793 
Symbol 
ID5595421 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli HS 
KingdomBacteria 
Replicon accessionNC_009800 
Strand
Start bp2803842 
End bp2806094 
Gene Length2253 bp 
Protein Length750 aa 
Translation table11 
GC content38% 
IMG OID640921909 
Productalpha amylase family protein 
Protein accessionYP_001459426 
Protein GI157162108 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0366] Glycosidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value0.00275664 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTTTTCTA TAAAACCAGG ACCCAGAAAT TTACCTATCG ACAACCCCAC ATTGTTATCA 
TGGAACATTA CTGACGGGGA TCTAAATTCC AAATTAAATA CATTAGAATA TCTAAACTGT
ATAACAAACA TTATTAATTC TTGTGGAGTT TACCCTCAAG GATTAAAAGA CAGAGAAATT
ATATCAACTT TTCACGCAGA AAAAGTTATT AATGATCTGT TAAAAAACGA TTATAAAATT
TCCCTTTCTC CAGATACAAC TTATCGAGAG TTGAATAAAG CAGCACAGCG TAGCATTACA
GCGCCAGACA GGATAGGAGA AGGAAAAACA TGGGTTTATC AACGAGATAC AATAATTGAA
AGAGGTGATA ACAGCGGTGT TTATCAGTAT GGTCCTGCTG AACACTTCAC CCACATTATA
TCTGACAAAC CTTCCCCAAA AGATAAATAT GTTGCATATG CTATTAACAT TCCTGACTAT
GAGCTGGCAG CCGATGTATA TAATATTAAC GTGACGTCAC CTTCCGGACA GCAAGAAACA
TTTAAAATAT TAATCAATCC AGAACATCTA CGGCAAACAC TTGAGCGTAA ATCTCTTACT
GCTGTTCAGA AATCACAATG TGAAATCATC ACCCCCAAAA AACCTGGCGA AGCGATTCTT
CATGCTTTTA ATGCCACCTA CCAGCAGATC AGAGAAAATA TGTCTGAATT TGCACGTTGC
CATTATGGGT ATATACAAAT CCCTCCAGTG ACAACTTTCC GTGCCGACGG ACCAGAAACT
CCCGAAGAAG AAAAAGGTTA CTGGTTTCAC GCTTATCAAC CCGAAGATCT TTGTACCATC
CACAATCCAA TGGGAGATTT GCAGGATTTT ATCGCATTGG TTAAAGATGC TAAAAAATTT
GGTATCGATA TCATTCCTGA TTATACCTTT AACTTTATGG GAATCGGGGG TAGTGGTAAA
AATGACCTGG ATTATCCCTC TGCTGATATA CGAGCGAAGA TCAGTAAAGA TATAGAAAGT
GGTATCCCTG GCTATTGGCA AGGTCAGGTT TTGATTCCAT TTACTATAGA TCCAGTAACA
AAAGAACGTA AACAAATCCA TCCAGAAGAT ATACATCTCA CTGCAAAAGA CTTCGAAGCA
AGTAAAGATA ACATCTCTAA GGATGAATGG GAAAACCTCC ATGCATTAAA AGAAAAGCGT
TTAAATGGAA TGCCTAAAAC AACACCCAAA AGTGACCAGG TTATTATGTT GCAAAATCAA
TACGTTCGTG AAATGCGAAA ATATGGCGTA CGAGGTTTAC GTTATGATGC GGCAAAACAC
TCAAAACATG AACAAATAGA AAGATCAATA ACCCCACCGC TTAAAAATTA TAATGAGCGG
TTACACAATA CTAACTTATT TAACCCAAAA TATCATAAAA AAGCCGTTAT GAATTACATG
GAATATCTGG TAACTTGTCA GTTGGATGAA CAACAAATGT CATCACTGCT TTATGAAAGA
GATGATTTAA GCGCCATTGA TTTTTCATTG CTCATGAAAA CGATAAAAGC CTTTTCATTT
GGTGGAGATC TCCAAACCCT TGCATCAAAA CCGGGTTCCA CAATCTCAAG CATCCCATCA
AAAAGACGGA TATTGATTAA CATTAACCAC GATTTTCCTA ACAATGGCAA TCTTTTCAAT
GACTTTCTAT TTAACCATCA ACAAGATGAA CAATTAGCAA TGGCATATAT GGCCGCTCTC
CCGTTCAGCA GGCCTTTAGT TTACTGGGAT GGCCAAGTAT TAAAATCAAC GACTGAAATT
AAAAATTATG ATGGGTCGAC GCGTGTTGGC GATGAGGCGT GGCTTAATAA AGGTTGCTCT
ACCTATCAGC AGCTCTACAA TGAATTCCAC GCATTATATA TAGATAAAGC AGGAATATGG
AGCGCATTTG AGGGTGTATT TGCAACTAAA AACGTTCTGG CCTTTAGTCG TGGGGATTCT
GTGAACATTA ATCACTCTCC TCATGATGGA CTAGTTATAA TAAATAAAGG AAACGAAGAA
GTTGAAGGTA CCTGGCCTAA CAAATTGCAA CCTGGAATAT ACAAAAACAT GGGGAGTAAT
AGCGTTAACA TTATTATTAA TAATACCCGA AAAATTATCC CCCCTGGTAA AGTATTTACG
CTTAGAGGCG GAACTCTAAA TATCAATATT CCTGGGCGTA GCGCTCTTCT TTTAGGGAAA
ACAGGAGAAC CGCCGAACTA TCTCTATTTA TAA
 
Protein sequence
MFSIKPGPRN LPIDNPTLLS WNITDGDLNS KLNTLEYLNC ITNIINSCGV YPQGLKDREI 
ISTFHAEKVI NDLLKNDYKI SLSPDTTYRE LNKAAQRSIT APDRIGEGKT WVYQRDTIIE
RGDNSGVYQY GPAEHFTHII SDKPSPKDKY VAYAINIPDY ELAADVYNIN VTSPSGQQET
FKILINPEHL RQTLERKSLT AVQKSQCEII TPKKPGEAIL HAFNATYQQI RENMSEFARC
HYGYIQIPPV TTFRADGPET PEEEKGYWFH AYQPEDLCTI HNPMGDLQDF IALVKDAKKF
GIDIIPDYTF NFMGIGGSGK NDLDYPSADI RAKISKDIES GIPGYWQGQV LIPFTIDPVT
KERKQIHPED IHLTAKDFEA SKDNISKDEW ENLHALKEKR LNGMPKTTPK SDQVIMLQNQ
YVREMRKYGV RGLRYDAAKH SKHEQIERSI TPPLKNYNER LHNTNLFNPK YHKKAVMNYM
EYLVTCQLDE QQMSSLLYER DDLSAIDFSL LMKTIKAFSF GGDLQTLASK PGSTISSIPS
KRRILININH DFPNNGNLFN DFLFNHQQDE QLAMAYMAAL PFSRPLVYWD GQVLKSTTEI
KNYDGSTRVG DEAWLNKGCS TYQQLYNEFH ALYIDKAGIW SAFEGVFATK NVLAFSRGDS
VNINHSPHDG LVIINKGNEE VEGTWPNKLQ PGIYKNMGSN SVNIIINNTR KIIPPGKVFT
LRGGTLNINI PGRSALLLGK TGEPPNYLYL