Gene EcE24377A_4314 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcE24377A_4314 
SymbolaslB 
ID5588766 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli E24377A 
KingdomBacteria 
Replicon accessionNC_009801 
Strand
Start bp4304799 
End bp4306034 
Gene Length1236 bp 
Protein Length411 aa 
Translation table11 
GC content51% 
IMG OID640927931 
Productarylsulfatase-activating protein AslB 
Protein accessionYP_001465280 
Protein GI157158579 
COG category[R] General function prediction only 
COG ID[COG0641] Arylsulfatase regulator (Fe-S oxidoreductase) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000000000133586 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCTGCAAC AGGTTCCAAC GCGTGCTTTT CATGTGATGG CGAAACCGAG TGGTTCCGAT 
TGTAATCTGA ACTGTGACTA CTGTTTTTAT CTCGAAAAAC AATCCCTTTA CCGCGAAAAG
CCAGTCACGC ATATGGACGA TGACACGCTG GAAGCGTATG TCCGTCACTA TATCGCTGCC
AGCGAACCGC AAAACGAAGT GGCTTTTACC TGGCAGGGCG GCGAACCAAC GTTACTCGGG
CTGGATTTTT ACCGCCGTGC CGTAAAGTTA CAGGCGAAAT ACGGTGCTGG CAGGAAGATA
AGTAACAGCT TCCAGACTAA CGGCGTGCTG CTCGATGATA AATGGTGTGC ATTTCTGGCA
GAAAATCATT TTCTTGTTGG GTTATCGCTG GACGGTCCGG CTGAGATCCA CAATCAATAT
CGCGTGACCA AAGGTGGCAG ACCAACGCAT AAGCTGGTGA TGCGTGCCCT GACGCTGCTG
CAAAAACATC ATGTCGACTA TAACGTGCTG GTCTGCGTCA ACCGCACCAG CGCGCAGCAA
CCGTTGCAGG TTTATGATTT TTTGTGCGAT GCGGGAGTCG AATTCATCCA GTTTATTCCG
GTGGTCGAGC GCCTGGCTGA TGAAACAGCT GCCAGCGATG GACTGAAACT ACATGCGCCT
GGTGATATTC AGGGGGAACT GACGGAATGG TCGGTGCGCC CCGATGAATT TGGTGAATTT
CTGGTGGCGA TATTCGACCA CTGGATCAAA CGCGACGTCG GCAAGATTTT CGTGATGAAT
ATCGAATGGG CGTTTGCCAA TTTTGTCGGT GCGCCGGGTG CGGTTTGCCA TCATCAGCCA
ACCTGTGGGC GCTCGGTGAT TGTTGAGCAC AACGGCGACG TTTACGCCTG CGATCACTAT
GTTTATCCGC AATATCGGCT GGGGAATATG CATCAGCAAA CAATTGCAGA AATGATCGAT
TCCCCGCAAC AGCAGGTGTT TGGTGAAGAT AAATTTAAGC AATTACCGGC GCAGTGTCGC
AGTTGTAACG TGTTAAAAGC GTGCTGGGGA GGCTGCCCGA AACACCGCTT CATGCTCGAT
GCCAGCGGCA AACCGGGGCT GAATTATTTG TGTGCCGGGT ATCAGCGTTA TTTCCGCCAT
CTACCGCCAT ATCTTAAAGC AATGGCTGAT TTGCTGGCGC ACGGTCGCCC GGCCAGTGAC
ATTATGCAGG CACATTTGCT GGTGGTGAAT AAGTAA
 
Protein sequence
MLQQVPTRAF HVMAKPSGSD CNLNCDYCFY LEKQSLYREK PVTHMDDDTL EAYVRHYIAA 
SEPQNEVAFT WQGGEPTLLG LDFYRRAVKL QAKYGAGRKI SNSFQTNGVL LDDKWCAFLA
ENHFLVGLSL DGPAEIHNQY RVTKGGRPTH KLVMRALTLL QKHHVDYNVL VCVNRTSAQQ
PLQVYDFLCD AGVEFIQFIP VVERLADETA ASDGLKLHAP GDIQGELTEW SVRPDEFGEF
LVAIFDHWIK RDVGKIFVMN IEWAFANFVG APGAVCHHQP TCGRSVIVEH NGDVYACDHY
VYPQYRLGNM HQQTIAEMID SPQQQVFGED KFKQLPAQCR SCNVLKACWG GCPKHRFMLD
ASGKPGLNYL CAGYQRYFRH LPPYLKAMAD LLAHGRPASD IMQAHLLVVN K