Gene EcHS_A1583 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcHS_A1583 
Symbol 
ID5593762 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli HS 
KingdomBacteria 
Replicon accessionNC_009800 
Strand
Start bp1597665 
End bp1599347 
Gene Length1683 bp 
Protein Length560 aa 
Translation table11 
GC content44% 
IMG OID640920736 
Productsulfatase 
Protein accessionYP_001458292 
Protein GI157160974 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG3119] Arylsulfatase A and related enzymes 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones30 
Plasmid unclonability p-value0.096723 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAGTCTG CATTAAAGAA AAGTGTCGTA AGTACCTCGA TATCTTTGAT ACTGGCATCT 
GGTATGGCTG CATTTGCTGC TCATGCGGCA GATGATGTAA AGCTGAAAGC AACCAAAACA
AACGTTGCTT TCTCAGACTT TACGCCGACT GAATACAGTA CCAAAGGAAA GCCAAATATT
ATCGTACTAA CCATGGATGA TCTTGGTTAT GGACAACTTC CTTTTGATAA GGGATCTTTT
GACCCAAAAA CAATGGAAAA TCGTGAAGTT GTCGATACCT ACAAAATAGG GATAGATAAA
GCCATTGAAG CTGCACAAAA ATCAACGCCG ACGCTCCTTT CATTAATGGA TGAAGGCGTA
CGGTTTACTA ACGGCTATGT GGCACACGGT GTTTCCGGCC CCTCCCGTGC CGCAATAATG
ACCGGTCGAG CTCCCGCCCG CTTTGGTGTC TATTCCAATA CCGATGCTCA GGATGGTATT
CCGCTAACAG AAACTTTCTT GCCTGAATTA TTCCAGAATC ATGGTTATTA CACTGCAGCA
GTAGGTAAAT GGCACTTGTC AAAAATCAGT AATGTGCCGG TACCGGAAGA TAAACAAACA
CGTGACTATC ATGACAACTT CACCACATTT TCTGCGGAAG AATGGCAACC TCAAAACCGT
GGCTTTGATT ACTTTATGGG ATTCCACGCT GCAGGAACGG CATATTACAA CTCCCCTTCA
CTGTTCCAAA ATCGTGAACG TGTCCCCGCA AAAGGTTATA TCAGCGATCA GTTAACCGAT
GAGGCAATTG GCGTTGTTGA TCGTGCCAAA ACACTTGACC AGCCTTTTAT GCTTTACCTG
GCTTATAATG CTCCGCACCT GCCAAATGAT AATCCTGCAC CGGAGCAATA TCAGAAGCAA
TTTAATACCG GTAGTCAAAC GGCAGATAAC TACTACGCTT CCGTTTATTC TGTTGATCAG
GGTGTAAAAC GCATTCTCGA ACAACTGAAG AAAAACGGAC AGTATGACAA TACAATTATT
CTCTTTACCT CCGATAATGG TGCGGTTATC GATGGTCCTC TGCCGCTGAA CGGGGCGCAA
AAAGGCTATA AGAGTCAGAC CTATCCTGGC GGTACTCACA CCCCAATGTT TATGTGGTGG
AAAGGAAAAC TTCAACCCGG TAATTATGAC AAGCTGATTT CCGCAATGGA TTTCTACCCG
ACAGCTCTTG ATGCAGCCGA TATCAGCATT CCAAAAGACC TTAAGCTGGA TGGCGTTTCC
TTGCTGCCCT GGTTGCAAGA TAAGAAACAA GGCGAGCCAC ATAAAAATCT GACCTGGATA
ACCTCTTATT CTCACTGGTT TGATGAGGAA AATATTCCAT TCTGGGATAA TTACCACAAA
TTTGTCCGTC ATCAGTCAGA CGATTACCCG CATAACCCCA ACACTGAGGA CTTAAGCCAA
TTCTCTTATA CGGTGAGAAA TAACGATTAT TCGCTTGTCT ATACAGTAGA AAACAATCAG
TTAGGTCTAT ACAAACTGAC GGATCTACAG CAAAAAGATA ACCTTGCCGC CGCCAATCCG
CAGGTCGTTA AAGAGATGCA AGGCGTGGTA AGAGAGTTTA TCGACAGCAG CCAGCCACCG
CTTAGCGAGG TAAATCAGGA GAAGTTTAAC AATATCAAGA AAGCACTAAG CGAAGCGAAA
TAA
 
Protein sequence
MKSALKKSVV STSISLILAS GMAAFAAHAA DDVKLKATKT NVAFSDFTPT EYSTKGKPNI 
IVLTMDDLGY GQLPFDKGSF DPKTMENREV VDTYKIGIDK AIEAAQKSTP TLLSLMDEGV
RFTNGYVAHG VSGPSRAAIM TGRAPARFGV YSNTDAQDGI PLTETFLPEL FQNHGYYTAA
VGKWHLSKIS NVPVPEDKQT RDYHDNFTTF SAEEWQPQNR GFDYFMGFHA AGTAYYNSPS
LFQNRERVPA KGYISDQLTD EAIGVVDRAK TLDQPFMLYL AYNAPHLPND NPAPEQYQKQ
FNTGSQTADN YYASVYSVDQ GVKRILEQLK KNGQYDNTII LFTSDNGAVI DGPLPLNGAQ
KGYKSQTYPG GTHTPMFMWW KGKLQPGNYD KLISAMDFYP TALDAADISI PKDLKLDGVS
LLPWLQDKKQ GEPHKNLTWI TSYSHWFDEE NIPFWDNYHK FVRHQSDDYP HNPNTEDLSQ
FSYTVRNNDY SLVYTVENNQ LGLYKLTDLQ QKDNLAAANP QVVKEMQGVV REFIDSSQPP
LSEVNQEKFN NIKKALSEAK