Gene EcDH1_2148 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcDH1_2148 
Symbol 
ID
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli DH1 
KingdomBacteria 
Replicon accessionCP001637 
Strand
Start bp2295954 
End bp2297636 
Gene Length1683 bp 
Protein Length560 aa 
Translation table11 
GC content44% 
IMG OID 
Productsulfatase 
Protein accessionACX39801 
Protein GI260449379 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value0.541761 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAGTCTG CATTAAAGAA AAGTGTCGTA AGTACCTCGA TATCTTTGAT ACTGGCATCT 
GGTATGGCTG CATTTGCTGC TCATGCGGCA GATGATGTAA AGCTGAAAGC AACCAAAACA
AACGTTGCTT TCTCAGACTT TACGCCGACA GAATACAGTA CCAAAGGAAA GCCAAATATT
ATCGTACTGA CCATGGATGA TCTTGGTTAT GGACAACTTC CTTTTGATAA GGGATCTTTT
GACCCAAAAA CAATGGAAAA TCGTGAAGTT GTCGATACCT ACAAAATAGG GATAGATAAA
GCCATTGAAG CTGCACAAAA ATCAACGCCG ACGCTCCTTT CATTAATGGA TGAAGGCGTA
CGTTTTACTA ACGGCTATGT GGCACACGGT GTTTCCGGCC CCTCCCGCGC CGCAATAATG
ACCGGTCGAG CTCCCGCCCG CTTTGGTGTC TATTCCAATA CCGATGCTCA GGATGGTATT
CCGCTAACAG AAACTTTCTT GCCTGAATTA TTCCAGAATC ATGGTTATTA CACTGCAGCA
GTAGGTAAAT GGCACTTGTC AAAAATCAGT AATGTGCCGG TACCGGAAGA TAAACAAACG
CGTGACTATC ATGACAACTT CACCACATTT TCTGCGGAAG AATGGCAACC TCAAAACCGT
GGCTTTGATT ACTTTATGGG ATTCCACGCT GCAGGAACGG CATATTACAA CTCCCCTTCA
CTGTTCAAAA ATCGTGAACG TGTCCCCGCA AAAGGTTATA TCAGCGATCA GTTAACCGAT
GAGGCAATTG GCGTTGTTGA TCGTGCCAAA ACACTTGACC AGCCTTTTAT GCTTTACCTG
GCTTATAATG CTCCGCACCT GCCAAATGAT AATCCTGCAC CGGATCAATA TCAGAAGCAA
TTTAATACCG GTAGTCAAAC AGCAGATAAC TACTACGCTT CCGTTTATTC TGTTGATCAG
GGTGTAAAAC GCATTCTCGA ACAACTGAAG AAAAACGGAC AGTATGACAA TACAATTATT
CTCTTTACCT CCGATAATGG TGCGGTTATC GATGGTCCTC TGCCGCTGAA CGGGGCGCAA
AAAGGCTATA AGAGTCAGAC CTATCCTGGC GGTACTCACA CCCCAATGTT TATGTGGTGG
AAAGGAAAAC TTCAACCCGG TAATTATGAC AAGCTGATTT CCGCAATGGA TTTCTACCCG
ACAGCTCTTG ATGCAGCCGA TATCAGCATT CCAAAAGACC TTAAGCTGGA TGGCGTTTCC
TTGCTGCCCT GGTTGCAAGA TAAGAAACAA GGCGAGCCAC ATAAAAATCT GACCTGGATA
ACCTCTTATT CTCACTGGTT TGACGAGGAA AATATTCCAT TCTGGGATAA TTACCACAAA
TTTGTTCGCC ATCAGTCAGA CGATTACCCG CATAACCCCA ACACTGAGGA CTTAAGCCAA
TTCTCTTATA CGGTGAGAAA TAACGATTAT TCGCTTGTCT ATACAGTAGA AAACAATCAG
TTAGGTCTCT ACAAACTGAC GGATCTACAG CAAAAAGATA ACCTTGCCGC CGCCAATCCG
CAGGTCGTTA AAGAGATGCA AGGCGTGGTA AGAGAGTTTA TCGACAGCAG CCAGCCACCG
CTTAGCGAGG TAAATCAGGA GAAGTTTAAC AATATCAAGA AAGCACTAAG CGAAGCGAAA
TAA
 
Protein sequence
MKSALKKSVV STSISLILAS GMAAFAAHAA DDVKLKATKT NVAFSDFTPT EYSTKGKPNI 
IVLTMDDLGY GQLPFDKGSF DPKTMENREV VDTYKIGIDK AIEAAQKSTP TLLSLMDEGV
RFTNGYVAHG VSGPSRAAIM TGRAPARFGV YSNTDAQDGI PLTETFLPEL FQNHGYYTAA
VGKWHLSKIS NVPVPEDKQT RDYHDNFTTF SAEEWQPQNR GFDYFMGFHA AGTAYYNSPS
LFKNRERVPA KGYISDQLTD EAIGVVDRAK TLDQPFMLYL AYNAPHLPND NPAPDQYQKQ
FNTGSQTADN YYASVYSVDQ GVKRILEQLK KNGQYDNTII LFTSDNGAVI DGPLPLNGAQ
KGYKSQTYPG GTHTPMFMWW KGKLQPGNYD KLISAMDFYP TALDAADISI PKDLKLDGVS
LLPWLQDKKQ GEPHKNLTWI TSYSHWFDEE NIPFWDNYHK FVRHQSDDYP HNPNTEDLSQ
FSYTVRNNDY SLVYTVENNQ LGLYKLTDLQ QKDNLAAANP QVVKEMQGVV REFIDSSQPP
LSEVNQEKFN NIKKALSEAK