Gene ECH74115_5236 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_5236 
SymbolaslB 
ID6972341 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp4879686 
End bp4880921 
Gene Length1236 bp 
Protein Length411 aa 
Translation table11 
GC content52% 
IMG OID643388901 
Productarylsulfatase-activating protein AslB 
Protein accessionYP_002273315 
Protein GI209399680 
COG category[R] General function prediction only 
COG ID[COG0641] Arylsulfatase regulator (Fe-S oxidoreductase) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00000331519 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones61 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTGCAAC AGGTTCCAAC GCGTGCTTTT CATGTGATGG CGAAACCGAG CGGTTCCGAT 
TGTAATCTGA ACTGTGACTA CTGTTTTTAT CTCGAAAAAC AATCCCTTTA CCGCGAAAAG
CCAGTCACGC ATATGGACGA TGACACGCTG GAAGCGTATG TCCGTCACTA TATCGCTGCC
AGCGAACCGC AAAACGAAGT GGCTTTTACC TGGCAGGGCG GCGAACCAAC GCTACTCGGG
CTGGAGTTTT ACCGCCGTGC CGTGGCGCTA CAGGCGAAAT ATGGTGCTGG CAGGAAGATA
AGTAACAGCT TCCAGACTAA CGGCGTACTG CTCGATGATG AATGGTGTGC ATTTCTGGCA
GAAAATCATT TTCTTGTTGG GTTATCGCTG GATGGTCCGG CTGAGATCCA CAATCAATAT
CGCGTGACAA AAGGCGGCAG ACCCACGCAT AAACTGGTGA TGCGTGCCCT GACGCTCCTG
CAAAAACATC ATGTCGACTA TAACGTGCTG GTCTGCGTCA ACCGCACCAG CGCGCAGCAA
CCGTTACAGG TTTATGATTT TTTGTGCGAT GCGGGAGTCG AATTCATCCA GTTTATTCCG
GTGGTCGAGC GCCTGGCTGA TGAAACGACT GCCCGCGAAG GACTGAAACT ACATGCGCCT
GGTGATATTC AGGGGGAACT GACGGAATGG TCTGTGCGCC CCGATGAATT TGGTGAATTT
CTGGTGGCGA TATTCGACCA CTGGATTAAA CGCGACGTCG GCAAGATTTT CGTGATGAAT
ATCGAATGGG CGTTTGCCAA TTTTGTCGGT GCGCCGGGTG CGGTTTGCCA TCATCAGCCA
ACCTGTGGGC GCTCGGTGAT TGTTGAGCAC AATGGCGACG TTTACGCCTG CGATCACTAT
GTTTATCCGC AATATCGACT GGGGAATATG CACCAGCAAA CAATTGCAGA AATGATCGAT
TCCCCGCAAC AGCAGGTGTT TGGTGAAGAT AAATTTAAGC AATTACCGGC GCAGTGTCGC
AGTTGTAACG TGTTAAAAGC ATGTTGGGGA GGCTGCCCGA AACACCGCTT CATGCTCGAT
GCCAGCGGCA AACCGGGACT GAATTATTTG TGTGCCGGGT ATCAGCGTTA TTTCCGCCAT
CTACCGCCAT ATCTTAAAGC AATGGCTGAT TTGCTGGCGC ACGGTCGCCC GGCCAGCGAC
ATTATGCAGG CGCATTTGCT GGTGGTGAGT AAGTAA
 
Protein sequence
MLQQVPTRAF HVMAKPSGSD CNLNCDYCFY LEKQSLYREK PVTHMDDDTL EAYVRHYIAA 
SEPQNEVAFT WQGGEPTLLG LEFYRRAVAL QAKYGAGRKI SNSFQTNGVL LDDEWCAFLA
ENHFLVGLSL DGPAEIHNQY RVTKGGRPTH KLVMRALTLL QKHHVDYNVL VCVNRTSAQQ
PLQVYDFLCD AGVEFIQFIP VVERLADETT AREGLKLHAP GDIQGELTEW SVRPDEFGEF
LVAIFDHWIK RDVGKIFVMN IEWAFANFVG APGAVCHHQP TCGRSVIVEH NGDVYACDHY
VYPQYRLGNM HQQTIAEMID SPQQQVFGED KFKQLPAQCR SCNVLKACWG GCPKHRFMLD
ASGKPGLNYL CAGYQRYFRH LPPYLKAMAD LLAHGRPASD IMQAHLLVVS K