Gene EcDH1_3755 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcDH1_3755 
Symbol 
ID
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli DH1 
KingdomBacteria 
Replicon accessionCP001637 
Strand
Start bp4049096 
End bp4050751 
Gene Length1656 bp 
Protein Length551 aa 
Translation table11 
GC content53% 
IMG OID 
Productalpha,alpha-phosphotrehalase 
Protein accessionACX41360 
Protein GI260450938 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones48 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACTCATC TTCCCCACTG GTGGCAAAAC GGCGTTATCT ACCAGATTTA TCCAAAGAGT 
TTTCAGGACA CCACGGGTAG CGGTACCGGC GATTTACGTG GCGTTATCCA ACACCTGGAC
TATCTGCATA AACTGGGCGT TGATGCCATC TGGCTAACCC CCTTTTATGT CTCTCCCCAG
GTCGATAACG GTTACGACGT AGCGAACTAT ACGGCGATTG ATCCCACCTA CGGCACGCTG
GACGATTTTG ACGAACTGGT GACGCAGGCA AAATCGCGCG GGATTCGTAT CATTCTCGAT
ATGGTGTTTA ACCATACCTC TACCCAACAT GCCTGGTTTC GCGAGGCGCT GAACAAAGAA
AGCCCTTACC GCCAGTTTTA TATCTGGCGC GATGGAGAAC CAGAAACGCC ACCGAACAAC
TGGCGTTCAA AATTTGGCGG TAGTGCGTGG CGCTGGCATG CGGAAAGCGA ACAGTACTAT
TTGCATCTCT TTGCACCAGA ACAGGCGGAT CTCAACTGGG AGAATCCAGC GGTACGCGCA
GAGCTGAAAA AAGTCTGTGA GTTCTGGGCC GATCGTGGGG TCGACGGGTT GCGCCTGGAT
GTGGTGAATC TGATCTCCAA AGACCCGCGT TTCCCTGAAG ACCTGGACGG CGACGGGCGT
CGCTTCTACA CCGACGGGCC ACGAGCACAC GAGTTTTTGC ACGAGATGAA CCGCGATGTG
TTTACGCCAC GCGGGTTAAT GACCGTAGGT GAAATGTCCT CCACCAGCCT TGAGCATTGC
CAGCGATACG CGGCTCTGAC AGGCAGTGAA TTGTCGATGA CCTTTAATTT TCATCACCTG
AAGGTCGATT ATCCCGGTGG TGAAAAATGG ACGCTGGCTA AACCTGACTT TGTGGCGTTG
AAAACATTGT TCCGCCACTG GCAACAAGGA ATGCACAACG TAGCATGGAA TGCCTTGTTC
TGGTGTAACC ACGATCAGCC GCGCATTGTT TCTCGCTTTG GTGATGAAGG TGAATACCGC
GTGCCTGCGG CAAAAATGCT GGCGATGGTG CTGCATGGCA TGCAGGGAAC GCCGTATATC
TACCAGGGCG AAGAGATTGG CATGACCAAC CCGCATTTCA CGCGCATTAC TGACTATCGC
GACGTAGAGA GCCTCAATAT GTTTGCCGAG CTGCGCAACG ATGGGCGTGA TGCCGACGAG
TTATTGGCAA TCCTCGCCAG TAAATCCCGT GACAACAGTC GCACGCCCAT GCAATGGAGC
AACGGCGATA ATGCCGGGTT TACGGCTGGC GAACCGTGGA TTGGCCTGGG CGATAACTAT
CAACAAATCA ACGTAGAAGC CGCGCTGGCC GATGATTCCT CGGTGTTTTA CACCTACCAA
AAGTTAATCG CACTGCGTAA GCAGGAAGCC ATCCTGACAT GGGGCAATTA CCAGGATCTG
CTGCCAAACA GCCCTGTATT GTGGTGCTAT CGCCGTGAAT GGAAGGGGCA AACCTTGCTG
GTCATTGCCA ACCTTAGCCG TGAGATCCAA CCCTGGCAGG CAGGGCAAAT GCGCGGCAAC
TGGCAGCTTG TGATGCATAA CTACGAAGAA GCCTCACCAC AACCCTGTGC CATGAATTTA
CGGCCTTTTG AGGCTGTCTG GTGGTTACAG AAGTAA
 
Protein sequence
MTHLPHWWQN GVIYQIYPKS FQDTTGSGTG DLRGVIQHLD YLHKLGVDAI WLTPFYVSPQ 
VDNGYDVANY TAIDPTYGTL DDFDELVTQA KSRGIRIILD MVFNHTSTQH AWFREALNKE
SPYRQFYIWR DGEPETPPNN WRSKFGGSAW RWHAESEQYY LHLFAPEQAD LNWENPAVRA
ELKKVCEFWA DRGVDGLRLD VVNLISKDPR FPEDLDGDGR RFYTDGPRAH EFLHEMNRDV
FTPRGLMTVG EMSSTSLEHC QRYAALTGSE LSMTFNFHHL KVDYPGGEKW TLAKPDFVAL
KTLFRHWQQG MHNVAWNALF WCNHDQPRIV SRFGDEGEYR VPAAKMLAMV LHGMQGTPYI
YQGEEIGMTN PHFTRITDYR DVESLNMFAE LRNDGRDADE LLAILASKSR DNSRTPMQWS
NGDNAGFTAG EPWIGLGDNY QQINVEAALA DDSSVFYTYQ KLIALRKQEA ILTWGNYQDL
LPNSPVLWCY RREWKGQTLL VIANLSREIQ PWQAGQMRGN WQLVMHNYEE ASPQPCAMNL
RPFEAVWWLQ K