Gene EcDH1_0025 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcDH1_0025 
Symbol 
ID
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli DH1 
KingdomBacteria 
Replicon accessionCP001637 
Strand
Start bp25358 
End bp26851 
Gene Length1494 bp 
Protein Length497 aa 
Translation table11 
GC content52% 
IMG OID 
Productsulfatase 
Protein accessionACX37723 
Protein GI260447301 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones64 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAACGCC CCAATTTTCT GTTCGTCATG ACCGATACCC AGGCCACCAA TATGGTCGGT 
TGCTATAGCG GTAAACCGCT GAATACGCAA AATATTGATA GTCTGGCGGC GGAAGGTATT
CGCTTTAATT CCGCCTACAC CTGTTCACCG GTTTGTACGC CTGCACGGGC CGGACTATTT
ACCGGTATCT ACGCTAACCA GTCCGGCCCG TGGACCAACA ACGTCGCGCC AGGCAAAAAC
ATCTCCACTA TGGGGCGCTA CTTTAAGGAT GCCGGCTATC ACACCTGTTA CATCGGCAAA
TGGCATCTCG ACGGTCATGA CTATTTCGGC ACTGGCGAGT GTCCGCCGGA GTGGGACGCT
GATTACTGGT TCGATGGGGC GAACTATCTT AGCGAACTGA CGGAAAAAGA GATTAGCCTG
TGGCGCAATG GCCTAAACAG CGTCGAAGAT TTACAGGCGA ACCATATCGA CGAAACCTTC
ACCTGGGCGC ATCGTATCAG CAATCGGGCG GTGGATTTTC TGCAACAGCC TGCGCGCGCA
GACGAGCCTT TCCTGATGGT GGTTTCGTAT GATGAGCCGC ATCACCCGTT CACCTGTCCG
GTGGAGTATT TAGAGAAATA CGCTGATTTT TACTACGAGC TGGGCGAGAA AGCACAGGAT
GACCTGGCTA ACAAACCGGA ACATCACCGC TTATGGGCGC AGGCGATGCC ATCGCCAGTC
GGTGATGACG GGCTTTATCA CCATCCGCTC TATTTTGCCT GTAATGACTT TGTTGATGAC
CAAATCGGAC GGGTCATCAA TGCCTTAACG CCAGAGCAAC GTGAAAATAC GTGGGTTATT
TATACCTCCG ATCACGGCGA AATGATGGGC GCACATAAGC TGATCAGTAA AGGGGCGGCG
ATGTATGACG ACATCACCCG CATTCCGCTG ATCATCCGTT CGCCGCAAGG GGAGCGGCGA
CAGGTCGATA CGCCAGTCAG TCATATCGAT TTACTGCCGA CAATGATGGC GCTGGCAGAT
ATTGAAAAAC CAGAGATTCT GCCGGGGGAA AATATCCTTG CCGTGAAAGA GCCACGCGGC
GTGATGGTGG AATTTAACCG CTACGAGATT GAGCATGACA GCTTTGGCGG TTTTATTCCG
GTGCGTTGCT GGGTGACGGA TGACTTTAAA CTGGTACTCA ACCTCTTCAC CAGTGATGAA
CTTTACGATC GCCGTAATGA CCCAAATGAA ATGCATAACC TGATCGATGA TATCCGTTTT
GCAGACGTTC GCAGCAAAAT GCATGACGCC TTATTGGATT ACATGGACAA AATTCGCGAT
CCGTTCCGCA GTTACCAATG GAGTCTGCGT CCGTGGCGTA AAGATGCACG GCCGCGCTGG
ATGGGGGCGT TTCGTCCACG TCCACAAGAT GGCTATTCGC CAGTTGTACG CGACTATGAC
ACCGGCCTAC CGACACAAGG GGTGAAGGTG GAGGAGAAAA AACAGAAGTT CTGA
 
Protein sequence
MKRPNFLFVM TDTQATNMVG CYSGKPLNTQ NIDSLAAEGI RFNSAYTCSP VCTPARAGLF 
TGIYANQSGP WTNNVAPGKN ISTMGRYFKD AGYHTCYIGK WHLDGHDYFG TGECPPEWDA
DYWFDGANYL SELTEKEISL WRNGLNSVED LQANHIDETF TWAHRISNRA VDFLQQPARA
DEPFLMVVSY DEPHHPFTCP VEYLEKYADF YYELGEKAQD DLANKPEHHR LWAQAMPSPV
GDDGLYHHPL YFACNDFVDD QIGRVINALT PEQRENTWVI YTSDHGEMMG AHKLISKGAA
MYDDITRIPL IIRSPQGERR QVDTPVSHID LLPTMMALAD IEKPEILPGE NILAVKEPRG
VMVEFNRYEI EHDSFGGFIP VRCWVTDDFK LVLNLFTSDE LYDRRNDPNE MHNLIDDIRF
ADVRSKMHDA LLDYMDKIRD PFRSYQWSLR PWRKDARPRW MGAFRPRPQD GYSPVVRDYD
TGLPTQGVKV EEKKQKF