Gene EcHS_A4019 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcHS_A4019 
SymbolaslB 
ID5591734 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli HS 
KingdomBacteria 
Replicon accessionNC_009800 
Strand
Start bp4011371 
End bp4012606 
Gene Length1236 bp 
Protein Length411 aa 
Translation table11 
GC content52% 
IMG OID640923123 
Productarylsulfatase-activating protein AslB 
Protein accessionYP_001460590 
Protein GI157163272 
COG category[R] General function prediction only 
COG ID[COG0641] Arylsulfatase regulator (Fe-S oxidoreductase) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000000000197695 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCTGCAAC AGGTTCCAAC GCGTGCTTTT CATGTGATGG CGAAACCGAG TGGTTCCGAT 
TGTAATCTGA ACTGTGACTA CTGTTTTTAT CTCGAAAAAC AATCCCTTTA CCGCGAAAAG
CCAGTCACGC ATATGGACGA TGACACGCTG GAAGCGTATA TCCGTCACTA TATCGCCGCC
AGCGAACTGC AAAACGAAGT GGCTTTTACC TGGCAGGGCG GCGAACCAAC GCTACTCGGG
CTGGAGTTTT ACCGCCGTGC CGTAGCGCTA CAGGCGAAAT ATGGTGCTGG CAGGAAGATA
AGTAACAGCT TCCAGACTAA CGGCGTGCTG CTGGATGACG AATGGTGCGC GTTTCTCGCG
GAGCATCATT TTCTTGTTGG TTTATCGCTG GATGGTCCGC CTGAGATCCA CAATCAATAT
CGCGTGACTA AAGGTGGCAG ACCCACGCAT AAGCTGGTGA TGCGTGCCCT GACGCTGCTG
CAAAAACATC ATGTCGACTA TAACGTGCTG GTCTGCGTTA ATCGCACCAG CGCGCAGCAA
CCGTTGCAGG TATATGATTT TTTGTGCGAT GCGGGAGTGG AATTCATCCA GTTTATTCCG
GTGGTCGAGC GCCTGGCTGA TGAAACGGCT GCCCGCGAAG GACTGAAATT GCATGCGCCT
GGTGATATTC AGGGTGAGCT AACGGAATGG TCGGTGCGCC CCGAGGAGTT CGGTGAATTT
CTGGTGGCGA TATTCGACCA CTGGATCAAA CGCGACGTCG GCAAGATTTT CGTGATGAAT
ATCGAATGGG CGTTTGCCAA TTTTGTCGGT GCGCCGGGTG CGGTTTGCCA TCATCAGCCA
ACCTGTGGGC GCTCGGTGAT TGTTGAGCAC AACGGCGACG TTTACGCCTG CGATCACTAT
GTTTATCCAC AATATCGGCT GGGGAATATG CACCAGCAAA CAATTGCAGA AATGATCGAT
TCCCCGCAAC AGCAGGTGTT TGGTGAAGAT AAATTTAAGC AATTACCGGC GCAGTGTCGC
AGTTGTAACG TGTTAAAAGC GTGCTGGGGA GGCTGCCCGA AACACCGCTT CATGCTCGAT
GCCAGCGGCA AACCGGGGCT GAATTATTTG TGTGCCGGGT ATCAGCGTTA TTTCCGCCAT
CTACCGCCAT ATCTTAAAGC AATGGCTGAT TTGCTGGCGC ACGGTCGCCC GGCCAGTGAC
ATTATGCATG CGCATTTGCT GGTGGTGAGT AAGTAA
 
Protein sequence
MLQQVPTRAF HVMAKPSGSD CNLNCDYCFY LEKQSLYREK PVTHMDDDTL EAYIRHYIAA 
SELQNEVAFT WQGGEPTLLG LEFYRRAVAL QAKYGAGRKI SNSFQTNGVL LDDEWCAFLA
EHHFLVGLSL DGPPEIHNQY RVTKGGRPTH KLVMRALTLL QKHHVDYNVL VCVNRTSAQQ
PLQVYDFLCD AGVEFIQFIP VVERLADETA AREGLKLHAP GDIQGELTEW SVRPEEFGEF
LVAIFDHWIK RDVGKIFVMN IEWAFANFVG APGAVCHHQP TCGRSVIVEH NGDVYACDHY
VYPQYRLGNM HQQTIAEMID SPQQQVFGED KFKQLPAQCR SCNVLKACWG GCPKHRFMLD
ASGKPGLNYL CAGYQRYFRH LPPYLKAMAD LLAHGRPASD IMHAHLLVVS K