Gene EcDH1_0175 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcDH1_0175 
Symbol 
ID
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli DH1 
KingdomBacteria 
Replicon accessionCP001637 
Strand
Start bp183870 
End bp185549 
Gene Length1680 bp 
Protein Length559 aa 
Translation table11 
GC content53% 
IMG OID 
Productcellulose synthase operon protein YhjU 
Protein accessionACX37869 
Protein GI260447447 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones46 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACTCAAT TTACGCAAAA TACCGCCATG CCTTCTTCCC TCTGGCAATA CTGGCGCGGC 
CTTTCCGGCT GGAACTTCTA TTTTCTGGTT AAGTTCGGCC TGTTGTGGGC GGGATATCTT
AACTTCCATC CGCTCCTCAA TTTGGTGTTT GCCGCGTTTC TGCTGATGCC CCTTCCGCGC
TACAGCCTGC ATCGCTTGCG CCACTGGATT GCCCTGCCGA TCGGCTTTGC TTTGTTCTGG
CATGACACCT GGTTGCCTGG CCCGGAAAGC ATAATGAGCC AGGGTTCGCA GGTGGCGGGG
TTCAGTACCG ATTATTTAAT CGACCTTGTC ACACGCTTTA TTAACTGGCA GATGATTGGG
GCCATTTTTG TTTTATTAGT GGCCTGGTTA TTCCTGTCAC AATGGATTCG CATTACCGTT
TTTGTGGTTG CCATACTGCT ATGGCTGAAC GTACTTACCC TGGCGGGACC AAGTTTCTCC
TTGTGGCCAG CCGGACAACC GACGACCACT GTAACAACGA CGGGTGGTAA CGCAGCGGCA
ACCGTTGCGG CGACGGGTGG CGCACCGGTA GTGGGTGATA TGCCCGCACA AACTGCACCG
CCAACAACGG CGAACCTTAA CGCCTGGCTG AATAATTTCT ATAACGCGGA GGCGAAACGT
AAATCGACCT TCCCGTCTTC GCTGCCCGCT GATGCTCAGC CATTTGAACT ACTGGTGATT
AACATCTGTT CGCTTTCCTG GTCGGATATA GAAGCCGCCG GGTTGATGTC GCATCCACTG
TGGTCGCATT TCGATATTGA GTTCAAGAAC TTTAACTCCG CCACCTCCTA CAGTGGCCCG
GCGGCGATCC GTTTACTGCG CGCCAGCTGC GGGCAGACTT CGCACACTAA TCTGTATCAA
CCGGCAAATA ACGACTGCTA TCTGTTTGAT AACCTTTCGA AACTGGGCTT TACCCAGCAC
CTGATGATGG GACATAACGG CCAGTTCGGC GGTTTTTTGA AAGAAGTTCG CGAAAATGGC
GGCATGCAGA GCGAATTGAT GGATCAAACA AATCTGCCGG TTATTTTGCT GGGCTTTGAT
GGTTCGCCGG TTTATGACGA TACCGCTGTG CTTAACCGCT GGCTGGACGT TACCGAAAAA
GATAAAAACA GCCGTAGTGC CACGTTCTAC AACACGCTTC CACTGCATGA CGGCAACCAT
TATCCGGGGG TCAGCAAAAC AGCGGATTAC AAAGCGCGGG CGCAGAAATT CTTTGATGAA
CTGGACGCCT TCTTTACTGA ACTTGAGAAA TCGGGTCGTA AAGTGATGGT GGTCGTGGTG
CCGGAACACG GCGGCGCGCT GAAGGGCGAC AGAATGCAGG TATCTGGCCT ACGTGATATC
CCTAGCCCGT CTATCACCGA CGTCCCCGTT GGGGTGAAAT TCTTCGGCAT GAAGGCACCG
CATCAGGGGG CACCGATTGT CATCGAACAA CCGAGCAGCT TCCTGGCTAT CTCCGATCTG
GTGGTTCGCG TTCTCGATGG CAAGATTTTC ACCGAAGACA ATGTTGACTG GAAAAAACTC
ACCAGTGGGT TGCCACAAAC AGCACCGGTC TCCGAGAACT CAAATGCAGT AGTTATTCAA
TACCAGGATA AACCGTACGT TCGCCTGAAC GGCGGCGACT GGGTGCCTTA CCCGCAGTAA
 
Protein sequence
MTQFTQNTAM PSSLWQYWRG LSGWNFYFLV KFGLLWAGYL NFHPLLNLVF AAFLLMPLPR 
YSLHRLRHWI ALPIGFALFW HDTWLPGPES IMSQGSQVAG FSTDYLIDLV TRFINWQMIG
AIFVLLVAWL FLSQWIRITV FVVAILLWLN VLTLAGPSFS LWPAGQPTTT VTTTGGNAAA
TVAATGGAPV VGDMPAQTAP PTTANLNAWL NNFYNAEAKR KSTFPSSLPA DAQPFELLVI
NICSLSWSDI EAAGLMSHPL WSHFDIEFKN FNSATSYSGP AAIRLLRASC GQTSHTNLYQ
PANNDCYLFD NLSKLGFTQH LMMGHNGQFG GFLKEVRENG GMQSELMDQT NLPVILLGFD
GSPVYDDTAV LNRWLDVTEK DKNSRSATFY NTLPLHDGNH YPGVSKTADY KARAQKFFDE
LDAFFTELEK SGRKVMVVVV PEHGGALKGD RMQVSGLRDI PSPSITDVPV GVKFFGMKAP
HQGAPIVIEQ PSSFLAISDL VVRVLDGKIF TEDNVDWKKL TSGLPQTAPV SENSNAVVIQ
YQDKPYVRLN GGDWVPYPQ