Gene EcDH1_4031 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcDH1_4031 
Symbol 
ID
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli DH1 
KingdomBacteria 
Replicon accessionCP001637 
Strand
Start bp4364205 
End bp4365938 
Gene Length1734 bp 
Protein Length577 aa 
Translation table11 
GC content50% 
IMG OID 
Productsulfatase 
Protein accessionACX41631 
Protein GI260451209 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones46 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCATTCCA CAGAAGTCCA GGCTAAACCT CTTTTTAGCT GGAAAGCCCT GGGTTGGGCA 
CTGCTCTACT TTTGGTTTTT CTCTACTCTG CTACAGGCCA TTATTTACAT CAGTGGTTAT
AGTGGCACTA ACGGCATTCG CGACTCGCTG TTATTCAGTT CGCTGTGGTT GATCCCGGTA
TTCCTCTTTC CGAAGCGGAT TAAAATTATT GCCGCAGTAA TCGGCGTGGT GCTATGGGCG
GCCTCTCTGG CGGCGCTGTG CTACTACGTC ATCTACGGTC AGGAGTTCTC GCAGAGCGTT
CTGTTTGTGA TGTTCGAAAC CAACACCAAC GAAGCCAGCG AGTATTTAAG CCAGTATTTC
AGCCTGAAAA TTGTGCTTAT CGCGCTGGCC TATACGGCGG TGGCAGTTCT GCTGTGGACA
CGCCTGCGCC CGGTCTATAT TCCAAAGCCG TGGCGTTATG TTGTCTCTTT TGCCCTGCTT
TATGGCTTGA TTCTGCATCC GATCGCCATG AATACGTTTA TCAAAAACAA GCCGTTTGAG
AAAACGTTGG ATAACCTGGC CTCGCGTATG GAGCCTGCCG CACCGTGGCA ATTCCTGACC
GGCTATTATC AGTATCGTCA GCAACTAAAC TCGCTAACAA AGTTACTGAA TGAAAATAAT
GCCTTGCCGC CACTGGCTAA TTTCAAAGAT GAATCGGGTA ACGAACCGCG CACTTTAGTG
CTGGTGATTG GCGAGTCGAC CCAGCGCGGA CGCATGAGTC TGTACGGTTA TCCGCGTGAA
ACCACGCCGG AGCTGGATGC GCTGCATAAA ACCGATCCGA ATCTGACCGT GTTTAATAAC
GTAGTTACGT CTCGTCCGTA CACCATTGAA ATCCTGCAAC AGGCGCTGAC CTTTGCCAAT
GAAAAGAACC CGGATCTGTA TCTGACGCAG CCGTCGCTGA TGAACATGAT GAAACAGGCG
GGTTATAAAA CCTTCTGGAT CACCAACCAG CAGACGATGA CCGCCCGCAA TACCATGCTG
ACGGTATTTT CGCGCCAGAC CGACAAGCAG TACTACATGA ACCAGCAACG TACGCAGAGT
GCGCGTGAAT ACGACACCAA CGTGCTGAAG CCGTTCCAGG AAGTGCTGAA TGACCCTGCG
CCGAAGAAAC TGATCATTGT TCATCTGCTG GGTACGCATA TCAAATACAA ATACCGCTAC
CCGGAAAATC AGGGCAAGTT TGATGGCAAT ACCGATCATG TTCCGCCGGG ATTAAACGCG
GAAGAGCTGG AGTCATATAA CGATTATGAC AACGCTAACC TGTATAACGA TCATGTGGTT
GCCAGCCTGA TTAAAGACTT TAAAGCAGCA AACCCGAACG GTTTCCTGGT TTATTTCTCT
GACCACGGTG AAGAGGTTTA CGACACGCCG CCGCACAAAA CTCAGGGGCG TAATGAGGAC
AACCCGACGC GTCATATGTA CACCATTCCG TTCCTGCTGT GGACGTCAGA AAAATGGCAA
GCGACTCATC CCCGTGATTT CTCGCAGGAT GTTGATCGTA AATACAGCCT GGCGGAACTG
ATCCACACCT GGTCAGATTT GGCGGGCTTA TCTTACGACG GTTACGATCC AACCCGTTCA
GTGGTGAATC CGCAGTTCAA AGAAACTACC CGCTGGATTG GTAACCCGTA TAAGAAAAAC
GCACTGATCG ATTACGACAC ACTGCCCTAT GGCGATCAGG TGGGTAATCA GTAA
 
Protein sequence
MHSTEVQAKP LFSWKALGWA LLYFWFFSTL LQAIIYISGY SGTNGIRDSL LFSSLWLIPV 
FLFPKRIKII AAVIGVVLWA ASLAALCYYV IYGQEFSQSV LFVMFETNTN EASEYLSQYF
SLKIVLIALA YTAVAVLLWT RLRPVYIPKP WRYVVSFALL YGLILHPIAM NTFIKNKPFE
KTLDNLASRM EPAAPWQFLT GYYQYRQQLN SLTKLLNENN ALPPLANFKD ESGNEPRTLV
LVIGESTQRG RMSLYGYPRE TTPELDALHK TDPNLTVFNN VVTSRPYTIE ILQQALTFAN
EKNPDLYLTQ PSLMNMMKQA GYKTFWITNQ QTMTARNTML TVFSRQTDKQ YYMNQQRTQS
AREYDTNVLK PFQEVLNDPA PKKLIIVHLL GTHIKYKYRY PENQGKFDGN TDHVPPGLNA
EELESYNDYD NANLYNDHVV ASLIKDFKAA NPNGFLVYFS DHGEEVYDTP PHKTQGRNED
NPTRHMYTIP FLLWTSEKWQ ATHPRDFSQD VDRKYSLAEL IHTWSDLAGL SYDGYDPTRS
VVNPQFKETT RWIGNPYKKN ALIDYDTLPY GDQVGNQ