Gene EcDH1_1908 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcDH1_1908 
Symbol 
ID
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli DH1 
KingdomBacteria 
Replicon accessionCP001637 
Strand
Start bp2059978 
End bp2061330 
Gene Length1353 bp 
Protein Length450 aa 
Translation table11 
GC content47% 
IMG OID 
Productglycoside hydrolase family 4 
Protein accessionACX39566 
Protein GI260449144 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value0.12949 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCCAGA AATTAAAAGT CGTCACTATT GGTGGCGGGA GCAGCTATAC CCCGGAGTTA 
CTGGAAGGAT TTATTAAGCG TTATCACGAA TTGCCGGTCA GCGAATTATG GCTGGTGGAT
GTCGAAGGTG GTAAACCGAA ACTGGATATT ATTTTCGATC TCTGCCAACG GATGATTGAT
AACGCTGGCG TCCCGATGAA GCTTTATAAA ACGCTGGATC GCCGCGAAGC ATTGAAAGAT
GCTGATTTCG TTACTACCCA ACTGCGCGTT GGCCAATTAC CGGCGCGTGA ACTGGATGAA
CGTATTCCAT TAAGTCATGG TTATCTTGGT CAGGAAACCA ACGGCGCGGG CGGTTTGTTT
AAAGGTCTGC GTACCATTCC GGTGATTTTT GACATCGTAA AAGATGTCGA AGAACTTTGT
CCGAATGCAT GGGTGATTAA CTTCACTAAC CCGGCGGGAA TGGTCACTGA AGCCGTTTAT
CGTCATACCG GATTTAAACG CTTTATCGGC GTGTGTAATA TTCCGATCGG CATGAAGATG
TTTATTCGCG ATGTTCTGAT GCTGAAAGAC AGCGATGATT TATCTATCGA TTTGTTCGGC
CTCAACCATA TGGTGTTCAT TAAGGATGTG CTGATAAATG GCAAGTCGCG CTTTGCCGAA
TTGCTTGATG GTGTGGCGTC AGGGCAGTTA AAAGCGTCCT CTGTAAAAAA TATTTTCGAT
CTGCCATTTA GTGAGGGCTT AATTCGTTCG TTGAATCTGC TGCCATGTTC TTATCTGCTG
TATTACTTCA AGCAGAAAGA GATGCTGGCT ATTGAAATGG GCGAATACTA CAAAGGCGGC
GCACGAGCGC AGGTAGTACA GAAAGTCGAG AAACAACTTT TTGAGCTGTA TAAAAATCCT
GAGCTGAAAG TTAAGCCGAA AGAACTGGAA CAGCGCGGTG GGGCTTATTA CTCTGATGCA
GCATGCGAAG TGATCAACGC TATCTACAAC GACAAGCAAG CAGAACATTA CGTTAATATC
CCGCATCATG GGCAGATTGA TAATATTCCG GCAGACTGGG CAGTAGAAAT GACCTGTAAG
CTGGGGCGCG ATGGCGCGAC GCCACATCCG CGCATTACGC ATTTCGATGA TAAAGTAATG
GGGCTGATTC ACACCATTAA AGGCTTCGAG ATTGCTGCCA GTAACGCCGC ACTTAGCGGA
GAATTTAACG ATGTGTTACT GGCGCTAAAC CTTAGTCCGT TGGTGCATTC CGATCGCGAT
GCTGAGCTGC TGGCACGCGA GATGATTCTG GCGCACGAGA AATGGCTGCC AAACTTTGCC
GACTGCATCG CAGAGCTTAA AAAAGCACAT TAA
 
Protein sequence
MSQKLKVVTI GGGSSYTPEL LEGFIKRYHE LPVSELWLVD VEGGKPKLDI IFDLCQRMID 
NAGVPMKLYK TLDRREALKD ADFVTTQLRV GQLPARELDE RIPLSHGYLG QETNGAGGLF
KGLRTIPVIF DIVKDVEELC PNAWVINFTN PAGMVTEAVY RHTGFKRFIG VCNIPIGMKM
FIRDVLMLKD SDDLSIDLFG LNHMVFIKDV LINGKSRFAE LLDGVASGQL KASSVKNIFD
LPFSEGLIRS LNLLPCSYLL YYFKQKEMLA IEMGEYYKGG ARAQVVQKVE KQLFELYKNP
ELKVKPKELE QRGGAYYSDA ACEVINAIYN DKQAEHYVNI PHHGQIDNIP ADWAVEMTCK
LGRDGATPHP RITHFDDKVM GLIHTIKGFE IAASNAALSG EFNDVLLALN LSPLVHSDRD
AELLAREMIL AHEKWLPNFA DCIAELKKAH