Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_4165 |
Symbol | aslB |
ID | 6143689 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | + |
Start bp | 4262605 |
End bp | 4263840 |
Gene Length | 1236 bp |
Protein Length | 411 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 641618988 |
Product | arylsulfatase-activating protein AslB |
Protein accession | YP_001746116 |
Protein GI | 170681829 |
COG category | [R] General function prediction only |
COG ID | [COG0641] Arylsulfatase regulator (Fe-S oxidoreductase) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 2 |
Plasmid unclonability p-value | 0.00000104917 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 48 |
Fosmid unclonability p-value | 0.969874 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCTGCAAC AGGTTCCAAC GCGTGCTTTT CATGTGATGG CGAAACCGAG CGGTTCCGAT TGTAATCTGA ACTGTGACTA CTGTTTTTAT CTCGAAAAAC AATCCCTTTA CCGCGAAAAG CCAGTCACGC ATATGGACGA TGACACGCTG GAAGCGTATG TCCGTCACTA TATCGCTGCC AGCGAAACCC AAAACGAAGT GGCTTTTACC TGGCAGGGTG GCGAACCAAC GTTACTCGGG CTGGATTTTT ACCGCCGTGC CGTGGCGCTA CAGGCGAAAT ATGGTGCTGG CAGGAAGATA AGTAACAGCT TCCAGACTAA CGGCGTTCTG CTTGATGACG AATGGTGTGC ATTTCTGGCA GAAAATCATT TTCTTGTTGG GTTATCGCTG GATGGTCCGG CTGAGATCCA CAATCAATAT CGCGTGACCA AAGGCGGCAG ACCAACGCAT AAGCTGGTGA TGCGTGCCCT GACGCTCCTG CAAAAACATC ATGTCAACTA TAACGTGCTG GTCTGCGTAA ATCGCACCAG TGCGTTACAA CCTTTGCAAG TCTATGATTT TCTGTGTGAT GCAGGCGTTG AGTTTATCCA GTTTATTCCT GTGGTCGAAC GCCTGGCTGA TGAAACGGCT GTTCATGCTG GACTTAAGCT ACATGCTCCC GGCGATATTC AGGGCGAACT GACGGAATGG TCGGTACGCC CCGATGAGTT CGGTGAATTT TTGGTGGCGA TATTCGACCA CTGGATCAAA CGCGACGTCG GCAAGATTTT CGTGATGAAT ATCGAATGGG CGTTTGCCAA TTTTGTCGGT GCGCCAGGAG CGGTTTGCCA TCATCAGCCA ACCTGTGGGC GCTCGGTGAT TGTTGAACAC AACGGCGACG TTTACGCCTG CGATCACTAT GTTTATTCGC AATATCGACT GGGGAATATG CTTCAGCAGA CAATTGCAGA AATGGTAGAT TCCCCGCAAC AGCAGGTGTT TGGTGAAGAT AAATTTAAGC AGTTACCAGC GCAGTGTCGC AGTTGTAACG TGTTAAAAGC ATGTTGGGGA GGCTGCCCGA AACACCGCTT TATGCTCGAT GCCAGCGGCA AACCGGGGCT GAATTATTTG TGTGCCGGGT ATCAGCGTTA TTTCCGCCAT CTACCGCCAT ATCTTAAAGC AATGGCTGAT TTGCTGGCGC ACGGTCGTCC GGCCAGCGAC ATTATGCAGG CGCATTTGAT GGTGGTGAAT AAGTAG
|
Protein sequence | MLQQVPTRAF HVMAKPSGSD CNLNCDYCFY LEKQSLYREK PVTHMDDDTL EAYVRHYIAA SETQNEVAFT WQGGEPTLLG LDFYRRAVAL QAKYGAGRKI SNSFQTNGVL LDDEWCAFLA ENHFLVGLSL DGPAEIHNQY RVTKGGRPTH KLVMRALTLL QKHHVNYNVL VCVNRTSALQ PLQVYDFLCD AGVEFIQFIP VVERLADETA VHAGLKLHAP GDIQGELTEW SVRPDEFGEF LVAIFDHWIK RDVGKIFVMN IEWAFANFVG APGAVCHHQP TCGRSVIVEH NGDVYACDHY VYSQYRLGNM LQQTIAEMVD SPQQQVFGED KFKQLPAQCR SCNVLKACWG GCPKHRFMLD ASGKPGLNYL CAGYQRYFRH LPPYLKAMAD LLAHGRPASD IMQAHLMVVN K
|
| |