Gene EcE24377A_4186 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcE24377A_4186 
Symbol 
ID5590158 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli E24377A 
KingdomBacteria 
Replicon accessionNC_009801 
Strand
Start bp4173672 
End bp4175165 
Gene Length1494 bp 
Protein Length497 aa 
Translation table11 
GC content52% 
IMG OID640927803 
Productsulfatase 
Protein accessionYP_001465162 
Protein GI157157618 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG3119] Arylsulfatase A and related enzymes 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones42 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAACGCC CCAATTTTCT GTTCATCATG ACCGATACCC AGGCCACCAA TATGGTCGGT 
TGCTATAGCG GTAAACCGCT GAATACGCAA AATATTGATA GTCTGGCGGC GGAAGGTATT
CGCTTTAATT CCGCCTACAC CTGTTCACCG GTTTGTACGC CTGCACGCGC CGGACTATTT
ACCGGTATCT ACGCTAACCA GTCCGGCCCG TGGACCAACA ACGTCGCGCC AGGCAAAAAC
ATCTCCACTA TGGGGCGCTA CTTTAAGGAT GCCGGCTATC ACACCTGTTA CATCGGCAAA
TGGCATCTCG ACGGTCATGA CTATTTCGGC ACTGGCGAGT GTCCGCCGGA GTGGGACGCT
GATTACTGGT TCGATGGGGC GAACTACCTT AGCGAACTGA CGGAAAAAGA GATCAGCCTG
TGGCGCAATG GCCTAAACAG CGTTGAGGAT TTACAGGCGA ACCATATCGA CGAAACCTTC
ACCTGGGCGC ACCGCATCAG CAATCGGGCG GTAGATTTTC TGCAACAGCC CGCGCGCGCC
GAGGAACCCT TCCTGATGGT GGTTTCGTAT GATGAGCCGC ATCACCCGTT CACCTGTCCG
GTGGAGTATT TAGAGAAATA CGCTGATTTT TACTACGATC TGGGCGAGAA AGCTCAGGAT
GACCTGGCGA ACAAACCGGA ACATCACCGC TTATGGGCGC AGGCGATGCC ATCGCCAGTC
GGTGATGACG GGCTTTATCA CCATCCGCTC TATTTTGCCT GTAATGACTT TGTTGATGAC
CAAATCGGAC GGGTCATCAA TGCCTTAACG CCAGAGCAAC GTGAAAATAC GTGGGTCATT
TATACTTCCG ATCACGGCGA AATGATGGGC GCACATAAGC TGATCAGTAA AGGAGCGGCG
ATGTATGACG ACATCACCCG TATTCCGCTG ATCATCCGTT CGCCGCAAGG GGAGCGGCGA
CAGGTCGATA CGCCAGTCAG TCATATCGAT TTACTGCCGA CAATGATGGC GCTGGCAGAT
ATTGAAAAAC CAGAGATTCT GCCGGGGGAA AATATCCTTG CCGTGAAAGA GCCACGCGGT
GTAATGGTGG AATTTAACCG CTACGAGATT GAGCATGACA GCTTTGGCGG TTTTATTCCG
GTGCGTTGCT GGGTGACGGA TGACTTTAAA CTGGTACTCA ACCTCTTCAC CAGTGATGAA
CTTTACGATC GCCGTAATGA CCCAAATGAA ATGCATAACC TGATCGATGA TATCCGTTTT
GCAGACGTTC GCAGCAAAAT GCATGACGCC TTATTGGATT ACATGGACAA AATTCGCGAT
CCGTTCCGCA GTTACCAATG GAGCCTGCGT CCGTGGCGTA AAGATGCACT GCCGCGCTGG
ATGGGGGCAT TTCGTCCACG CCCACAAGAT GGCTATTCGC CGGTGGTACG TGACTATGAC
ACCGGCCTAC CGACGCAAGG AGTGAAAGTG GAAGAGAAAA AACAGAAGTT CTGA
 
Protein sequence
MKRPNFLFIM TDTQATNMVG CYSGKPLNTQ NIDSLAAEGI RFNSAYTCSP VCTPARAGLF 
TGIYANQSGP WTNNVAPGKN ISTMGRYFKD AGYHTCYIGK WHLDGHDYFG TGECPPEWDA
DYWFDGANYL SELTEKEISL WRNGLNSVED LQANHIDETF TWAHRISNRA VDFLQQPARA
EEPFLMVVSY DEPHHPFTCP VEYLEKYADF YYDLGEKAQD DLANKPEHHR LWAQAMPSPV
GDDGLYHHPL YFACNDFVDD QIGRVINALT PEQRENTWVI YTSDHGEMMG AHKLISKGAA
MYDDITRIPL IIRSPQGERR QVDTPVSHID LLPTMMALAD IEKPEILPGE NILAVKEPRG
VMVEFNRYEI EHDSFGGFIP VRCWVTDDFK LVLNLFTSDE LYDRRNDPNE MHNLIDDIRF
ADVRSKMHDA LLDYMDKIRD PFRSYQWSLR PWRKDALPRW MGAFRPRPQD GYSPVVRDYD
TGLPTQGVKV EEKKQKF