Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SNSL254_A3371 |
Symbol | |
ID | 6484629 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salmonella enterica subsp. enterica serovar Newport str. SL254 |
Kingdom | Bacteria |
Replicon accession | NC_011080 |
Strand | - |
Start bp | 3269920 |
End bp | 3271659 |
Gene Length | 1740 bp |
Protein Length | 579 aa |
Translation table | 11 |
GC content | 48% |
IMG OID | 642738662 |
Product | arylsulfatase |
Protein accession | YP_002042382 |
Protein GI | 194445259 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG3119] Arylsulfatase A and related enzymes |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.64108 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 64 |
Fosmid unclonability p-value | 0.82619 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGAAAAAAC AAGTAACACT TGCCACACTC AGCATTATCT TCTCCGGTAC GGCACACAGT ACGCAAAACG AACGTCCTGA TATTATCGTG ATTATCGCTG ATGATATGGG ATATTCTGAT ATCACTCCCT TCGGTGGGGA AATCCCAACG CCTAATTTGC AGGCGATGGC TGAGAACGGC GTGCGGATGA GTCAATATTA CACGTCTCCC ATGTCTGCTC CCGCCCGTGC GATGCTATTA ACCGGGAACA CCAGTCAGCA AGCGGGTATA GGCGGTATGT GGTGGTATGA AAATACCATA GGTAAGGAAG GCTATGAATT GCGCCTGACT GATCGGGTGA CGACCATGGC TGAACGCTTT AAAGATGCTG GTTACAATAC GCTGATGGCG GGTAAATGGC ATCTTGGTTT TACGCCAGTC TCGACGCCAA AAGATCGGGG CTTTCGTCAT TCTTTCGCCT TGATGGGGGG AGGCGCCAGT CACTTTGATG ATGCCGTGCC GCTGGGAACC GTGGAGATAT TTCATACCTA TTATACCCGT GACAATCAGC GCATTTCACT GCCCTCCAGT TTTTACTCCA GCGAAGCCTA TGCCAGCCAG ATTAATCGCT GGATCAGCGA GACGCCACGG GAACAACCTA TCTTCGCGTG GTTGGCCTTT ACTGCGCCAC ATGATCCTCT GCAGGCGCCG GATGAATGGA TTAGTCGTTT TAAAAGTCAG TATGAACAGG GCTATGCAGA CGTCTATCGT CAGCGTATTG CTCGTTTGAA GAAACTGGGT TTCCTGCGTG ATGACATACC TCTGCCAGGA CTGGAACTTG ATAAAGAATG GCAGGCGATG ACCCCGGAAC AGCAGAAATA TACGGCGAAG GTGATGCAGG TTTACGCTGC TATGATCGCC AATATGGATG CACAGATCGG CACCGTTATT GAGACGTTAA AAAAGACCGG GCGCGATAAA AACACGATTC TGGTCTTCTT AAGTGATAAT GGTGTGAATC CGGCGGAGGG CTTTCACTAT GAATCTGAAC CGGATTTTTG GAAGCAATTC GATAATCGTT ACGAAAATAT TGGTCGTAAA AATTCATTTA TCTCTTATGG CCCCCACTGG GCTGATGTCA GCAATGCGCC TTATGGTCGC TATCACAAAA CGACCAGCGG TCAGGGGGGA ATTAATACCA GTTTTATGAT TTCCGGTCCT GGTATCATCC ATCATGGCGC CATAGATAAC GCCACGATGG CGGCGTATGA TGTTGCGCCC ACGCTCTATG AATTTGCAGG TATTGATGCC AGTAAATCAT TATCTGAAAG ACCGACACTG CCAATGATCG GCGTGAGTTT TAAACGCTAT CTGACCGGTG AAAGTCTGCA CGCGCCTCGC ACACAATATG GTGTTGAACT CCATAATCAG GCGGCCTGGA TAGATGGGGA ATGGAAATTG CGTCGTCTTG TCACAGTATT CCCACAGGCG GGTAATGCTC CATGGGAATT ATTCAACCTG CAACGTGACC CCCTGGAAAC GCATAATCTC GCAGCAGATT ATGTGGATAA AGTGAAAATA CTGAGCAGTG CATATGAGGC ATTTGCAAAA CAGACAATGG TGCTTTATGC CAAAGGCAAG CTTATTGATT ATGTGGGTAT CGACAGTAAA ACCGGGCGTT ATCTGGCTGT CGATCCACAG ACATTGCAGC CAGTTCCTGC TCCGTTAGCG ATTCCTTTAG ACACAAAATC GGACCAATAA
|
Protein sequence | MKKQVTLATL SIIFSGTAHS TQNERPDIIV IIADDMGYSD ITPFGGEIPT PNLQAMAENG VRMSQYYTSP MSAPARAMLL TGNTSQQAGI GGMWWYENTI GKEGYELRLT DRVTTMAERF KDAGYNTLMA GKWHLGFTPV STPKDRGFRH SFALMGGGAS HFDDAVPLGT VEIFHTYYTR DNQRISLPSS FYSSEAYASQ INRWISETPR EQPIFAWLAF TAPHDPLQAP DEWISRFKSQ YEQGYADVYR QRIARLKKLG FLRDDIPLPG LELDKEWQAM TPEQQKYTAK VMQVYAAMIA NMDAQIGTVI ETLKKTGRDK NTILVFLSDN GVNPAEGFHY ESEPDFWKQF DNRYENIGRK NSFISYGPHW ADVSNAPYGR YHKTTSGQGG INTSFMISGP GIIHHGAIDN ATMAAYDVAP TLYEFAGIDA SKSLSERPTL PMIGVSFKRY LTGESLHAPR TQYGVELHNQ AAWIDGEWKL RRLVTVFPQA GNAPWELFNL QRDPLETHNL AADYVDKVKI LSSAYEAFAK QTMVLYAKGK LIDYVGIDSK TGRYLAVDPQ TLQPVPAPLA IPLDTKSDQ
|
| |