Gene EcDH1_3878 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcDH1_3878 
Symbol 
ID
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli DH1 
KingdomBacteria 
Replicon accessionCP001637 
Strand
Start bp4178961 
End bp4180634 
Gene Length1674 bp 
Protein Length557 aa 
Translation table11 
GC content49% 
IMG OID 
Productsulfatase 
Protein accessionACX41479 
Protein GI260451057 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones47 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGCACTT TGTTCGATGG AAACACCGTG ATGTTGAAGC GCCTACTAAA AAGACCCTCT 
TTGAATTTAC TCGCCTGGCT ATTGTTGGCC GCTTTTTATA TCTCTATCTG CCTGAATATT
GCCTTTTTTA AACAGGTGTT GCAGGCGCTG CCGCTGGATT CGCTGCATAA CGTACTGGTT
TTCTTGTCGA TGCCGGTCGT CGCTTTCAGC GTGATTAATA TTGTCCTGAC ACTAAGCTCT
TTCTTATGGC TTAATCGACC ACTGGCCTGC CTGTTTATTC TGGTTGGCGC GGCTGCACAA
TATTTCATAA TGACTTACGG CATCGTCATC GACCGCTCGA TGATTGCCAA TATTATTGAT
ACCACTCCGG CAGAAAGTTA TGCGCTGATG ACACCGCAAA TGTTATTAAC GCTGGGATTC
AGCGGCGTGC TTGCTGCGCT GATTGCCTGC TGGATAAAAA TCAAACCTGC CACCTCGCGT
CTGCGCAGTG TTCTTTTCCG TGGAGCCAAT ATTCTGGTTT CTGTACTACT GATTTTGCTG
GTCGCCGCAC TGTTTTATAA AGACTACGCC TCGTTGTTCC GCAATAACAA AGAGCTGGTG
AAATCCTTAA GCCCCTCTAA CAGCATTGTT GCCAGCTGGT CATGGTACTC CCATCAGCGA
CTGGCAAATC TGCCGCTGGT GCGAATTGGT GAAGACGCGC ACCGCAACCC GTTAATGCAG
AACGAAAAAC GTAAAAATTT GACCATCCTG ATTGTCGGCG AAACCTCGCG GGCGGAGAAC
TTCTCCCTCA ACGGCTACCC GCGTGAAACT AACCCGCGGC TGGCGAAAGA TAACGTGGTC
TATTTCCCTA ATACCGCATC TTGCGGCACG GCAACGGCAG TTTCAGTACC GTGCATGTTC
TCGGATATGC CGCGTGAGCA CTACAAAGAA GAGCTGGCAC AGCACCAGGA AGGCGTGCTG
GATATCATTC AGCGAGCGGG CATCAACGTG CTGTGGAATG ACAACGATGG CGGCTGTAAA
GGTGCCTGCG ACCGCGTGCC TCACCAGAAC GTCACCGCGC TGAATCTACC TGATCAGTGC
ATCAACGGCG AATGCTATGA CGAAGTGCTG TTCCACGGGC TTGAAGAGTA CATCAATAAC
CTGCAAGGTG ATGGCGTGAT TGTCTTACAC ACCATCGGCA GCCACGGTCC GACCTATTAC
AACCGCTATC CGCCTCAGTT CAGGAAATTT ACCCCAACCT GCGACACCAA TGAGATCCAG
ACCTGTACCA AAGAGCAACT GGTGAACACT TACGACAACA CGCTGGTTTA CGTCGACTAT
ATTGTTGATA AAGCGATTAA TCTGCTGAAA GAACATCAGG ATAAATTTAC CACCAGCCTG
GTTTATCTTT CTGACCACGG TGAATCGTTA GGTGAAAATG GCATCTATCT GCACGGTCTG
CCTTATGCCA TCGCCCCGGA TAGCCAAAAA CAGGTGCCGA TGCTGCTGTG GCTGTCGGAG
GATTATCAAA AACGGTATCA GGTTGACCAG AACTGCCTGC AAAAACAGGC GCAAACGCAA
CACTATTCAC AAGACAATTT ATTCTCCACG CTATTGGGAT TAACTGGCGT TGAGACGAAG
TATTACCAGG CTGCGGATGA TATTCTGCAA ACTTGCAGGA GAGTGAGTGA ATGA
 
Protein sequence
MRTLFDGNTV MLKRLLKRPS LNLLAWLLLA AFYISICLNI AFFKQVLQAL PLDSLHNVLV 
FLSMPVVAFS VINIVLTLSS FLWLNRPLAC LFILVGAAAQ YFIMTYGIVI DRSMIANIID
TTPAESYALM TPQMLLTLGF SGVLAALIAC WIKIKPATSR LRSVLFRGAN ILVSVLLILL
VAALFYKDYA SLFRNNKELV KSLSPSNSIV ASWSWYSHQR LANLPLVRIG EDAHRNPLMQ
NEKRKNLTIL IVGETSRAEN FSLNGYPRET NPRLAKDNVV YFPNTASCGT ATAVSVPCMF
SDMPREHYKE ELAQHQEGVL DIIQRAGINV LWNDNDGGCK GACDRVPHQN VTALNLPDQC
INGECYDEVL FHGLEEYINN LQGDGVIVLH TIGSHGPTYY NRYPPQFRKF TPTCDTNEIQ
TCTKEQLVNT YDNTLVYVDY IVDKAINLLK EHQDKFTTSL VYLSDHGESL GENGIYLHGL
PYAIAPDSQK QVPMLLWLSE DYQKRYQVDQ NCLQKQAQTQ HYSQDNLFST LLGLTGVETK
YYQAADDILQ TCRRVSE