Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcE24377A_4314 |
Symbol | aslB |
ID | 5588766 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli E24377A |
Kingdom | Bacteria |
Replicon accession | NC_009801 |
Strand | + |
Start bp | 4304799 |
End bp | 4306034 |
Gene Length | 1236 bp |
Protein Length | 411 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 640927931 |
Product | arylsulfatase-activating protein AslB |
Protein accession | YP_001465280 |
Protein GI | 157158579 |
COG category | [R] General function prediction only |
COG ID | [COG0641] Arylsulfatase regulator (Fe-S oxidoreductase) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 1 |
Plasmid unclonability p-value | 0.000000000133586 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCTGCAAC AGGTTCCAAC GCGTGCTTTT CATGTGATGG CGAAACCGAG TGGTTCCGAT TGTAATCTGA ACTGTGACTA CTGTTTTTAT CTCGAAAAAC AATCCCTTTA CCGCGAAAAG CCAGTCACGC ATATGGACGA TGACACGCTG GAAGCGTATG TCCGTCACTA TATCGCTGCC AGCGAACCGC AAAACGAAGT GGCTTTTACC TGGCAGGGCG GCGAACCAAC GTTACTCGGG CTGGATTTTT ACCGCCGTGC CGTAAAGTTA CAGGCGAAAT ACGGTGCTGG CAGGAAGATA AGTAACAGCT TCCAGACTAA CGGCGTGCTG CTCGATGATA AATGGTGTGC ATTTCTGGCA GAAAATCATT TTCTTGTTGG GTTATCGCTG GACGGTCCGG CTGAGATCCA CAATCAATAT CGCGTGACCA AAGGTGGCAG ACCAACGCAT AAGCTGGTGA TGCGTGCCCT GACGCTGCTG CAAAAACATC ATGTCGACTA TAACGTGCTG GTCTGCGTCA ACCGCACCAG CGCGCAGCAA CCGTTGCAGG TTTATGATTT TTTGTGCGAT GCGGGAGTCG AATTCATCCA GTTTATTCCG GTGGTCGAGC GCCTGGCTGA TGAAACAGCT GCCAGCGATG GACTGAAACT ACATGCGCCT GGTGATATTC AGGGGGAACT GACGGAATGG TCGGTGCGCC CCGATGAATT TGGTGAATTT CTGGTGGCGA TATTCGACCA CTGGATCAAA CGCGACGTCG GCAAGATTTT CGTGATGAAT ATCGAATGGG CGTTTGCCAA TTTTGTCGGT GCGCCGGGTG CGGTTTGCCA TCATCAGCCA ACCTGTGGGC GCTCGGTGAT TGTTGAGCAC AACGGCGACG TTTACGCCTG CGATCACTAT GTTTATCCGC AATATCGGCT GGGGAATATG CATCAGCAAA CAATTGCAGA AATGATCGAT TCCCCGCAAC AGCAGGTGTT TGGTGAAGAT AAATTTAAGC AATTACCGGC GCAGTGTCGC AGTTGTAACG TGTTAAAAGC GTGCTGGGGA GGCTGCCCGA AACACCGCTT CATGCTCGAT GCCAGCGGCA AACCGGGGCT GAATTATTTG TGTGCCGGGT ATCAGCGTTA TTTCCGCCAT CTACCGCCAT ATCTTAAAGC AATGGCTGAT TTGCTGGCGC ACGGTCGCCC GGCCAGTGAC ATTATGCAGG CACATTTGCT GGTGGTGAAT AAGTAA
|
Protein sequence | MLQQVPTRAF HVMAKPSGSD CNLNCDYCFY LEKQSLYREK PVTHMDDDTL EAYVRHYIAA SEPQNEVAFT WQGGEPTLLG LDFYRRAVKL QAKYGAGRKI SNSFQTNGVL LDDKWCAFLA ENHFLVGLSL DGPAEIHNQY RVTKGGRPTH KLVMRALTLL QKHHVDYNVL VCVNRTSAQQ PLQVYDFLCD AGVEFIQFIP VVERLADETA ASDGLKLHAP GDIQGELTEW SVRPDEFGEF LVAIFDHWIK RDVGKIFVMN IEWAFANFVG APGAVCHHQP TCGRSVIVEH NGDVYACDHY VYPQYRLGNM HQQTIAEMID SPQQQVFGED KFKQLPAQCR SCNVLKACWG GCPKHRFMLD ASGKPGLNYL CAGYQRYFRH LPPYLKAMAD LLAHGRPASD IMQAHLLVVN K
|
| |