Gene EcDH1_0791 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcDH1_0791 
Symbol 
ID
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli DH1 
KingdomBacteria 
Replicon accessionCP001637 
Strand
Start bp838662 
End bp840095 
Gene Length1434 bp 
Protein Length477 aa 
Translation table11 
GC content51% 
IMG OID 
Productglycoside hydrolase family 1 
Protein accessionACX38475 
Protein GI260448053 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones39 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGAAAAAAC TCACCTTACC GAAAGATTTC TTATGGGGCG GCGCAGTTGC CGCTCATCAG 
GTCGAAGGCG GCTGGAACAA AGGCGGAAAA GGGCCGAGCA TTTGTGACGT TCTGACCGGT
GGCGCACACG GCGTGCCGCG CGAAATCACC AAAGAAGTCT TGCCAGGAAA ATACTATCCA
AACCATGAAG CCGTTGATTT TTATGGTCAC TATAAGGAAG ACATCAAGCT ATTTGCCGAA
ATGGGCTTCA AATGTTTTCG TACATCCATT GCCTGGACGC GCATTTTTCC AAAAGGCGAT
GAAGCTCAGC CAAACGAAGA AGGGCTGAAG TTCTACGATG ATATGTTCGA TGAACTGCTG
AAATACAACA TCGAACCGGT GATCACCCTC TCCCACTTTG AAATGCCGCT GCATCTGGTG
CAGCAATACG GTAGCTGGAC CAACCGTAAA GTGGTTGATT TCTTTGTACG TTTCGCGGAA
GTGGTATTTG AACGCTATAA GCACAAAGTC AAATACTGGA TGACCTTCAA CGAAATTAAC
AACCAGCGTA ACTGGCGTGC ACCGCTGTTC GGTTACTGCT GCTCCGGCGT GGTGTATACC
GAGCATGAAA ACCCGGAAGA GACGATGTAT CAGGTGCTGC ATCACCAGTT TGTCGCCAGC
GCCCTGGCGG TGAAAGCTGC GCGTCGCATT AACCCGGAGA TGAAAGTCGG CTGTATGCTG
GCGATGGTGC CGCTCTATCC TTACTCCTGT AACCCGGACG ATGTGATGTT CGCTCAGGAG
TCGATGCGCG AACGCTACGT CTTTACCGAT GTGCAGCTAC GCGGCTATTA CCCGTCCTAT
GTGTTGAACG AGTGGGAGCG TCGCGGATTT AACATCAAAA TGGAAGACGG CGATCTGGAT
GTGCTGCGTG AAGGCACCTG CGATTATCTT GGTTTCAGCT ATTACATGAC CAATGCAGTG
AAGGCCGAAG GCGGCACCGG CGATGCGATC TCTGGTTTTG AAGGCAGCGT ACCAAACCCG
TATGTTAAAG CATCTGACTG GGGCTGGCAG ATTGATCCAG TAGGTCTGCG CTATGCACTT
TGCGAACTGT ATGAGCGTTA TCAGAGGCCG CTGTTTATTG TCGAAAACGG TTTTGGCGCT
TACGACAAAG TGGAAGAAGA TGGCAGCATC AACGACGACT ACCGCATTGA CTACCTGCGC
GCCCATATCG AAGAGATGAA AAAAGCGGTG ACTTACGATG GCGTGGATCT GATGGGCTAC
ACACCGTGGG GCTGCATCGA CTGCGTGTCG TTCACCACCG GGCAGTACAG CAAACGCTAC
GGCTTTATCT ATGTGAATAA ACATGACGAC GGTACTGGCG ATATGTCGCG TTCACGTAAG
AAGAGCTTTA ACTGGTACAA AGAGGTGATT GCCAGCAACG GCGAGAAGCT TTAA
 
Protein sequence
MKKLTLPKDF LWGGAVAAHQ VEGGWNKGGK GPSICDVLTG GAHGVPREIT KEVLPGKYYP 
NHEAVDFYGH YKEDIKLFAE MGFKCFRTSI AWTRIFPKGD EAQPNEEGLK FYDDMFDELL
KYNIEPVITL SHFEMPLHLV QQYGSWTNRK VVDFFVRFAE VVFERYKHKV KYWMTFNEIN
NQRNWRAPLF GYCCSGVVYT EHENPEETMY QVLHHQFVAS ALAVKAARRI NPEMKVGCML
AMVPLYPYSC NPDDVMFAQE SMRERYVFTD VQLRGYYPSY VLNEWERRGF NIKMEDGDLD
VLREGTCDYL GFSYYMTNAV KAEGGTGDAI SGFEGSVPNP YVKASDWGWQ IDPVGLRYAL
CELYERYQRP LFIVENGFGA YDKVEEDGSI NDDYRIDYLR AHIEEMKKAV TYDGVDLMGY
TPWGCIDCVS FTTGQYSKRY GFIYVNKHDD GTGDMSRSRK KSFNWYKEVI ASNGEKL