Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SeAg_B0040 |
Symbol | |
ID | 6795888 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salmonella enterica subsp. enterica serovar Agona str. SL483 |
Kingdom | Bacteria |
Replicon accession | NC_011149 |
Strand | + |
Start bp | 40128 |
End bp | 41621 |
Gene Length | 1494 bp |
Protein Length | 497 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 642774352 |
Product | sulfatase |
Protein accession | YP_002145016 |
Protein GI | 197249148 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG3119] Arylsulfatase A and related enzymes |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 0.843312 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAAGAA CAGTCGTTGC CAGTATGATA GGGTTGGCGC TATGCGCTGG AAGCGTATTA TCAACCGCGC AAGCGGCAAC CGCAAAGCGT CCTAACTTAG TCATTATTCT GGCGGATGAT TTAGGGTATG GCGATCTCGC CACCTACGGG CACCGCATCG TTAAAACACC TAACATAGAC AAATTGGCGC AGGAGGGGGT GAAGTTTACC GACTATTATG CGCCAGCGCC TCTGTGTTCT CCTTCCCGCG CGGGCCTGTT AACTGGTCGT ATGCCGTTCC GTACCGGAAT CCGTTCCTGG ATACCGGAAG GCAAAGATGT TGCGCTGGGG CGTAATGAAC TGACTATCGC CAATCTGCTA AAACAGCAGG GCTACGATAC GGCGATGATG GGGAAATTAC ACCTGAATGC GGGCGGCGAT CGCACCGATC AGCCGCAGGC GAAAGACATG GGCTTTGACT ATACGTTGGT TAATCCGGCG GGATTTGTCA CCGATGCTAC GCTGGACAAC GCCAAGGAGC GCCCGCGCTA TGGCGTGGTG CATCCTACGG GGTGGATTCG TAATGGCCAA CATATTGGCC GCGCAGATAA GATGAGCGGC GAGTTTGTGA GCTCTGAAGT GGTGAACTGG CTGGATAATA AAAAAGACGA TAATCCGTTC TTCTTATATG TCGCCTTTAC CGAAGTCCAT AGCCCGCTGG CGTCGCCGAA AAAATACCTT GATATGTATT CGCAGTACAT GACCGACTAC CAGAAGCAGC ATCCGGATCT GTTCTACGGC GACTGGGCAG ACAAACCGTG GCGCGGCACT GGCGAATATT ACGCCAATAT CAGCTACATG GATGAGCAGG TCGGTAAAGT GCTGGATAAA ATTAAGGCGA TGGGCGAGGA AGATAACACC ATCGTCATCT TTACCAGCGA CAACGGCCCT GTCACGCGTG AAGCGCGTAA GGTATACGAG CTGAACCTGG CCGGGGAAAC CGACGGTCTG CGCGGGCGTA AAGACAACCT GTGGGAAGGC GGCATTCGCG TACCGGCAAT CATCAAATAC GGCAAGCACA TTCCACAGGG GATGGTAACG GACACGCCGG TATATGGTCT TGACTGGTTG CCGACGCTGG CCAACATGAT GGACTTTAAA CTTCCGACAG ATCGTACCTA CGACGGTCAG TCTTTAGTTC CGCTCCTGAA GGACAAGACG TTAAAACGCC AGAAACCGCT GATCTTCGGT ATCGATATGC CGTTCCAGGA TGATCCGACG GATGAGTGGG CGATCCGCGA CGGCGACTGG AAGATGATCA TCGATCGCCA GAATAAACCT AAATATCTCT ATAACCTGAA AACCGATCGT TTCGAGACGC TCAATCAAAT TGGTAAACAG CCGCAGATTG AGAAACAGCT TTACGGTAAG TTCCTGAAGT ATAAAAAGGA TATTGATAAC GATTCGCTGA TGAAAGCCCG TGGCGATAAG CCGACGCCTG TCACCTGGGG CTAA
|
Protein sequence | MKRTVVASMI GLALCAGSVL STAQAATAKR PNLVIILADD LGYGDLATYG HRIVKTPNID KLAQEGVKFT DYYAPAPLCS PSRAGLLTGR MPFRTGIRSW IPEGKDVALG RNELTIANLL KQQGYDTAMM GKLHLNAGGD RTDQPQAKDM GFDYTLVNPA GFVTDATLDN AKERPRYGVV HPTGWIRNGQ HIGRADKMSG EFVSSEVVNW LDNKKDDNPF FLYVAFTEVH SPLASPKKYL DMYSQYMTDY QKQHPDLFYG DWADKPWRGT GEYYANISYM DEQVGKVLDK IKAMGEEDNT IVIFTSDNGP VTREARKVYE LNLAGETDGL RGRKDNLWEG GIRVPAIIKY GKHIPQGMVT DTPVYGLDWL PTLANMMDFK LPTDRTYDGQ SLVPLLKDKT LKRQKPLIFG IDMPFQDDPT DEWAIRDGDW KMIIDRQNKP KYLYNLKTDR FETLNQIGKQ PQIEKQLYGK FLKYKKDIDN DSLMKARGDK PTPVTWG
|
| |