Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SeHA_C3364 |
Symbol | |
ID | 6488330 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salmonella enterica subsp. enterica serovar Heidelberg str. SL476 |
Kingdom | Bacteria |
Replicon accession | NC_011083 |
Strand | - |
Start bp | 3272953 |
End bp | 3274692 |
Gene Length | 1740 bp |
Protein Length | 579 aa |
Translation table | 11 |
GC content | 49% |
IMG OID | 642743497 |
Product | arylsulfatase |
Protein accession | YP_002047112 |
Protein GI | 194449565 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG3119] Arylsulfatase A and related enzymes |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.244969 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 73 |
Fosmid unclonability p-value | 0.949476 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGAAAAAAG AAGTAACACT TGCCACACTT AGCATTATCT TCTCCGGTAC GGCGCACAGT ACGCAAAACG AACGTCCTGA TATTATCGTG ATTATCGCTG ATGATATGGG ATATTCTGAT ATCACTCCCT TCGGTGGGGA AATCCCAACG CCTAATTTGC AGGCGATGGC TGAGAACGGC GTGCGGATGA GTCAATATTA CACGTCTCCC ATGTCTGCTC CCGCCCGTGC GATGCTATTA ACCGGGAACA CCAGTCAGCA AGCGGGTATA GGCGGTATGT GGTGGTATGA AAATACCATA GGTAAGGAAG GCTATGAATT GCGCCTGACT GATCGGGTGA CGACCATGGC TGAACGCTTT AAAGATGCTG GTTACAATAC GCTGATGGCG GGTAAATGGC ATCTTGGTTT TACGCCAGGC TCGACGCCAA AAGATCGGGG CTTTCGTCAT TCTTTCGCCT TGATGGGGGG AGGCGCCAGT CACTTTGATG ATGCCGTGCC GCTGGGAACC GTGGAGATAT TTCATACCTA TTATACCCGT GACAATCAGC GCATTTCACT GCCCTCCAGT TTTTACTCCA GCGAAGCCTA TGCCAGCCAG ATTAATCGCT GGATCAGCGA GACGCCACGG GAACAACCTA TCTTCGCGTG GTTGGCCTTT ACTGCGCCAC ATGATCCTCT GCAGGCGCCG GATGAATGGA TTAGTCGTTT TAAAAGTCAG TATGAACAGG GCTATGCAGA CGTCTATCGT CAGCGTATTG CTCGTTTGAA GAAACTGGGT TTCCTGCGTG ATGACATACC TCTGCCAGGA CTGGAACTTG ATAAAGAATG GCAGGCGATG ACCCCGGAAC AGCAGAAATA TACGGCGAAG GTGATGCAGG TTTACGCTGC TATGATCGCC AATATGGATG CACAGATCGG CACCGTTATT GAGACGTTAA AAAAGACCGG GCGCGATAAA AACACGATTC TGGTCTTCTT AAGTGATAAT GGTGTGAATC CGGCGGAGGG CTTTCACTAT GAATCTGAAC CGGATTTTTG GAAGCAATTC GATAATCGTT ACGAAAATAT TGGTCGTAAA AATTCATTTA TCTCTTATGG CCCCCACTGG GCTGATGTCA GCAATGCGCC TTATGGTCGC TATCACAAAA CGACCAGCGG TCAGGGGGGA ATTAATACCA GTTTTATGAT TTCCGGTCCT GGTATCATCC ATCATGGCGC CATAGATAAC GCCACGATGG CGGCGTATGA TGTTGCGCCC ACGCTCTATG AATTTGCAGG TATTGATGCC AGTAAATCAT TATCTGAAAG ACCGACACTG CCAATGATCG GCGTGAGTTT TAAACGCTAT CTGACCGGTG AAAGTCTGCA CGCGCCTCGC ACACAATATG GTGTTGAACT CCATAATCAG GCGGCCTGGA TAGATGGGGA ATGGAAATTG CGTCGTCTTG TCACAGTATT CCCACAGGCG GGTAATGCGC CATGGGAATT ATTCAACCTG CAACGTGACC CCCTGGAAAC GCATAATCTC GCAGCAAATT ATGCGGATAA AGTGAAAATA CTGAGCAGCG CATATGAGGC ATTTGCAAAA CAGACAATGG TGCTTTATGC CAAAGGCAAG CTTATCGATT ATGTGGGTAT CGACAGTAAA ACCGGGCGTT ATCTGGCTGT CGATCCACAG ACATTGCAGC CAGTTCCTGC TCCGTTAGCG ATTCCTTTAG ACACAAAATC GGACCAATAA
|
Protein sequence | MKKEVTLATL SIIFSGTAHS TQNERPDIIV IIADDMGYSD ITPFGGEIPT PNLQAMAENG VRMSQYYTSP MSAPARAMLL TGNTSQQAGI GGMWWYENTI GKEGYELRLT DRVTTMAERF KDAGYNTLMA GKWHLGFTPG STPKDRGFRH SFALMGGGAS HFDDAVPLGT VEIFHTYYTR DNQRISLPSS FYSSEAYASQ INRWISETPR EQPIFAWLAF TAPHDPLQAP DEWISRFKSQ YEQGYADVYR QRIARLKKLG FLRDDIPLPG LELDKEWQAM TPEQQKYTAK VMQVYAAMIA NMDAQIGTVI ETLKKTGRDK NTILVFLSDN GVNPAEGFHY ESEPDFWKQF DNRYENIGRK NSFISYGPHW ADVSNAPYGR YHKTTSGQGG INTSFMISGP GIIHHGAIDN ATMAAYDVAP TLYEFAGIDA SKSLSERPTL PMIGVSFKRY LTGESLHAPR TQYGVELHNQ AAWIDGEWKL RRLVTVFPQA GNAPWELFNL QRDPLETHNL AANYADKVKI LSSAYEAFAK QTMVLYAKGK LIDYVGIDSK TGRYLAVDPQ TLQPVPAPLA IPLDTKSDQ
|
| |