Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SNSL254_A0039 |
Symbol | |
ID | 6485400 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salmonella enterica subsp. enterica serovar Newport str. SL254 |
Kingdom | Bacteria |
Replicon accession | NC_011080 |
Strand | + |
Start bp | 40122 |
End bp | 41615 |
Gene Length | 1494 bp |
Protein Length | 497 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 642735483 |
Product | sulfatase |
Protein accession | YP_002039265 |
Protein GI | 194443287 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG3119] Arylsulfatase A and related enzymes |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 57 |
Fosmid unclonability p-value | 0.225946 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAAAGAA CAGTCGTTGC CAGTATGATA GGGTTGGCGC TATGCGCTGG ATGCGTATTA TCAACCGCGC AAGCGGCAAC CGCAAAGCGT CCTAACTTAG TCATTATTCT GGCGGATGAT TTAGGGTATG GCGATCTCGC CACCTACGGG CACCGCATCG TTAAAACACC TAACATAGAC AAATTGGCGC AGGAGGGGGT GAAGTTTACC GACTATTATG CGCCAGCGCC TCTGTGTTCT CCTTCCCGCG CGGGCCTGTT AACCGGTCGT ATGCCGTTCC GTACCGGAAT CCGTTCCTGG ATACCGGAAG GCAAAGATGT TGCGCTGGGG CGTAATGAAC TGACTATCGC CAATCTGCTA AAACAGCAGG GCTACGATAC GGCGATGATG GGGAAATTAC ACCTGAATGC GGGCGGCGAT CGCACCGATC AGCCGCAGGC GAAAGACATG GGCTTTGACT ATACGTTGGT TAATCCGGCG GGATTTGTCA CCGATGCTAC GCTGGATAAC GCCAAGGAGC GCCCGCGCTA TGGCGTGGTG CATCCTACGG GGTGGATTCG TAATGGCCAA CATATTGGCC GCGCAGATAA GATGAGCGGC GAGTTTGTGA GCTCTGAAGT GGTGAACTGG CTGGATAATA AAAAAGACGA TAATCCGTTC TTCTTATATG TCGCCTTTAC CGAAGTCCAT AGCCCGCTGG CGTCGCCGAA AAAATACCTT GATATGTATT CGCAGTACAT GACCGACTAC CAGAAGCAGC ATCCGGATCT GTTCTACGGT GACTGGGCAG ACAAACCGTG GCGCGGTACC GGCGAATATT ACGCCAATAT CAGCTATATG GATGAGCAGG TCGGTAAAGT GCTGGATAAA ATTAAGGCGA TGGGCGAGGA AGATAACACT ATCGTCATCT TTACCAGCGA CAACGGCCCT GTCACGCGTG AAGCGCGTAA GGTATACGAG CTGAACCTGG CCGGGGAAAC CGACGGTCTG CGCGGGCGTA AAGACAACCT GTGGGAAGGT GGCATTCGCG TACCGGCAAT CATCAAATAC GGCAAGCACA TTCCACAGGG GATGGTAACG GACACGCCGG TATATGGTCT TGACTGGTTG CCGACGCTGG CCAACATGAT GGACTTTAAA CTTCCGACCG ATCGTACCTA CGACGGTCAG TCTTTAGTTC CGCTCCTGAA GGACAAGACG TTAAAACGCC AGAAACCGCT GATCTTCGGT ATCGATATGC CGTTCCAGGA CGATCCAACG GATGAGTGGG CGATCCGCGA CGGCGACTGG AAGATGATCA TCGATCGCCA GAATAAACCT AAATATCTCT ATAACCTGAA AACCGATCGT TTCGAGACGC TCAATCAAAT TGGTAAACAG CCGCAGATTG AGAAACAGCT TTACGGTAAG TTCCTGAAGT ATAAAAAGGA TATTGATAAC GATTCGCTGA TGAAAGCCCG TGGCGATAAG CCGACGCCTG TCACTTGGGG CTAA
|
Protein sequence | MKRTVVASMI GLALCAGCVL STAQAATAKR PNLVIILADD LGYGDLATYG HRIVKTPNID KLAQEGVKFT DYYAPAPLCS PSRAGLLTGR MPFRTGIRSW IPEGKDVALG RNELTIANLL KQQGYDTAMM GKLHLNAGGD RTDQPQAKDM GFDYTLVNPA GFVTDATLDN AKERPRYGVV HPTGWIRNGQ HIGRADKMSG EFVSSEVVNW LDNKKDDNPF FLYVAFTEVH SPLASPKKYL DMYSQYMTDY QKQHPDLFYG DWADKPWRGT GEYYANISYM DEQVGKVLDK IKAMGEEDNT IVIFTSDNGP VTREARKVYE LNLAGETDGL RGRKDNLWEG GIRVPAIIKY GKHIPQGMVT DTPVYGLDWL PTLANMMDFK LPTDRTYDGQ SLVPLLKDKT LKRQKPLIFG IDMPFQDDPT DEWAIRDGDW KMIIDRQNKP KYLYNLKTDR FETLNQIGKQ PQIEKQLYGK FLKYKKDIDN DSLMKARGDK PTPVTWG
|
| |