Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SeAg_B0923 |
Symbol | |
ID | 6795344 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salmonella enterica subsp. enterica serovar Agona str. SL483 |
Kingdom | Bacteria |
Replicon accession | NC_011149 |
Strand | + |
Start bp | 912974 |
End bp | 914461 |
Gene Length | 1488 bp |
Protein Length | 495 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 642775196 |
Product | sulfatase |
Protein accession | YP_002145839 |
Protein GI | 197249992 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG3119] Arylsulfatase A and related enzymes |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 0.362875 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAGCGA TTATTCTGCT GTTTGACAGT CTGAATAAAA ACTATTTGCC GCCCTATGGT GATTTGCTAA CGAAAGCGCC TAACTTTCAA CGCCTGGCGG CACATGCCGC CACCTTTGAC AATAGTTATG TCGGCAGTAT GCCCTGTATG CCAGCCCGTC GGGAACTGCA CACCGGGCGC TATAATTTCC TGCATCGTGA GTGGGGGCCG CTGGAACCCT TTGATGATTC CATGCCGGAA TTATTGAAAA AAGCGGGGAT CTACACCCAT CTTATCAGCG ATCATCTGCA TTACTGGGAA GACGGCGGCG GTAACTACCA TAACCGCTAT AGCTCCTGGG ACGTAGTACG CGGTCAGGAG GGCGATCACT GGAAGGCGAG CGTTGGCGAG CCGCCCATTC CGGAAGTACT GCGCGTTCCA CAAAAACAAA CCGGAGGCGG CGTTTCCGGG CTATGGCGTC ATGACTGGGC GAACCGCGAA TACATCCAGC AGGAAGCCGA TTTTCCCCAG ACGAAAGTTT TTGACGCCGG GTGCGATTTT ATCCATAAAA ATCATGCCGA AGATAACTGG TTATTGCAGG TTGAGACGTT TGATCCGCAT GAGCCGTTTT ATACCACCGA GGAATATTTA TCGCTCTATG ACGATGAGTG GCAAGGCCCG CATTATGACT GGCCGCGCGG CAAAGTCAGT GAAAGTGAGG AGGCGATAGC GCATATTCGC TGTCGTTATC GGGCCCTGGT TTCCATGTGC GACCGCAATC TGGGACGTAT CCTTGATCTG ATGGATGAAC ACGATCTCTG GCGCGATACG ATGCTGATTG TCGGTACCGA TCACGGCTTC TTGCTGGGGG AGCACGGTTG GTGGGCTAAA AATCAAATGC CCTATTATAA CGAGGTGGCG AATAACCCGC TGTTTATCTG GGACCCGCGC AGCGCGGTAT GCGGAGCGCG ACGGCAGTCG CTGGTGCAGA TGATTGACTG GGCACCAACG CTACTGGATT ATTTTCAGCA ACCTATTCCC GCAGATATGC AGGGCCAACC GCTGGCGAAA GTCATTGCCA GTGATGAACC CGTCAGGGAA GGCGCGCTGT TTGGCGTGTT TAGCGGACAT GTTAATGTTA CCGACGGACG CTATGTTTAT ATGCGGGCCG CGCAGCCGGG GCGTGAGCAT GACATTGCGA ACTACACGTT AATGCCGATC AAGATGAATG CGCGTTATGA TGTGGATGAA CTGGGAAAAT TATCTCTGGC ACCTCCGTTT AACTTTACTA AAGGGCTTCA GGTATTACGT ATTCCGGCCA GGGAAAAATA TAAAGGTGTG AATAGCTTTG GTCATCTTTT GTTTGATCTC AGAGACGATC CGCAGCAGCA ACATCCTATT CATGATGAGG CCATCGAAGC AAGGATGATC AACTTACTTA TCCGTTTGAT GAAAGAAAAT GATGCTCCGG CGGAGCAGTA TCGCCGTCTG GGTCTGGATG TTGTCTAA
|
Protein sequence | MKAIILLFDS LNKNYLPPYG DLLTKAPNFQ RLAAHAATFD NSYVGSMPCM PARRELHTGR YNFLHREWGP LEPFDDSMPE LLKKAGIYTH LISDHLHYWE DGGGNYHNRY SSWDVVRGQE GDHWKASVGE PPIPEVLRVP QKQTGGGVSG LWRHDWANRE YIQQEADFPQ TKVFDAGCDF IHKNHAEDNW LLQVETFDPH EPFYTTEEYL SLYDDEWQGP HYDWPRGKVS ESEEAIAHIR CRYRALVSMC DRNLGRILDL MDEHDLWRDT MLIVGTDHGF LLGEHGWWAK NQMPYYNEVA NNPLFIWDPR SAVCGARRQS LVQMIDWAPT LLDYFQQPIP ADMQGQPLAK VIASDEPVRE GALFGVFSGH VNVTDGRYVY MRAAQPGREH DIANYTLMPI KMNARYDVDE LGKLSLAPPF NFTKGLQVLR IPAREKYKGV NSFGHLLFDL RDDPQQQHPI HDEAIEARMI NLLIRLMKEN DAPAEQYRRL GLDVV
|
| |