Gene SeAg_B0040 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeAg_B0040 
Symbol 
ID6795888 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Agona str. SL483 
KingdomBacteria 
Replicon accessionNC_011149 
Strand
Start bp40128 
End bp41621 
Gene Length1494 bp 
Protein Length497 aa 
Translation table11 
GC content52% 
IMG OID642774352 
Productsulfatase 
Protein accessionYP_002145016 
Protein GI197249148 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG3119] Arylsulfatase A and related enzymes 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value0.843312 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAAGAA CAGTCGTTGC CAGTATGATA GGGTTGGCGC TATGCGCTGG AAGCGTATTA 
TCAACCGCGC AAGCGGCAAC CGCAAAGCGT CCTAACTTAG TCATTATTCT GGCGGATGAT
TTAGGGTATG GCGATCTCGC CACCTACGGG CACCGCATCG TTAAAACACC TAACATAGAC
AAATTGGCGC AGGAGGGGGT GAAGTTTACC GACTATTATG CGCCAGCGCC TCTGTGTTCT
CCTTCCCGCG CGGGCCTGTT AACTGGTCGT ATGCCGTTCC GTACCGGAAT CCGTTCCTGG
ATACCGGAAG GCAAAGATGT TGCGCTGGGG CGTAATGAAC TGACTATCGC CAATCTGCTA
AAACAGCAGG GCTACGATAC GGCGATGATG GGGAAATTAC ACCTGAATGC GGGCGGCGAT
CGCACCGATC AGCCGCAGGC GAAAGACATG GGCTTTGACT ATACGTTGGT TAATCCGGCG
GGATTTGTCA CCGATGCTAC GCTGGACAAC GCCAAGGAGC GCCCGCGCTA TGGCGTGGTG
CATCCTACGG GGTGGATTCG TAATGGCCAA CATATTGGCC GCGCAGATAA GATGAGCGGC
GAGTTTGTGA GCTCTGAAGT GGTGAACTGG CTGGATAATA AAAAAGACGA TAATCCGTTC
TTCTTATATG TCGCCTTTAC CGAAGTCCAT AGCCCGCTGG CGTCGCCGAA AAAATACCTT
GATATGTATT CGCAGTACAT GACCGACTAC CAGAAGCAGC ATCCGGATCT GTTCTACGGC
GACTGGGCAG ACAAACCGTG GCGCGGCACT GGCGAATATT ACGCCAATAT CAGCTACATG
GATGAGCAGG TCGGTAAAGT GCTGGATAAA ATTAAGGCGA TGGGCGAGGA AGATAACACC
ATCGTCATCT TTACCAGCGA CAACGGCCCT GTCACGCGTG AAGCGCGTAA GGTATACGAG
CTGAACCTGG CCGGGGAAAC CGACGGTCTG CGCGGGCGTA AAGACAACCT GTGGGAAGGC
GGCATTCGCG TACCGGCAAT CATCAAATAC GGCAAGCACA TTCCACAGGG GATGGTAACG
GACACGCCGG TATATGGTCT TGACTGGTTG CCGACGCTGG CCAACATGAT GGACTTTAAA
CTTCCGACAG ATCGTACCTA CGACGGTCAG TCTTTAGTTC CGCTCCTGAA GGACAAGACG
TTAAAACGCC AGAAACCGCT GATCTTCGGT ATCGATATGC CGTTCCAGGA TGATCCGACG
GATGAGTGGG CGATCCGCGA CGGCGACTGG AAGATGATCA TCGATCGCCA GAATAAACCT
AAATATCTCT ATAACCTGAA AACCGATCGT TTCGAGACGC TCAATCAAAT TGGTAAACAG
CCGCAGATTG AGAAACAGCT TTACGGTAAG TTCCTGAAGT ATAAAAAGGA TATTGATAAC
GATTCGCTGA TGAAAGCCCG TGGCGATAAG CCGACGCCTG TCACCTGGGG CTAA
 
Protein sequence
MKRTVVASMI GLALCAGSVL STAQAATAKR PNLVIILADD LGYGDLATYG HRIVKTPNID 
KLAQEGVKFT DYYAPAPLCS PSRAGLLTGR MPFRTGIRSW IPEGKDVALG RNELTIANLL
KQQGYDTAMM GKLHLNAGGD RTDQPQAKDM GFDYTLVNPA GFVTDATLDN AKERPRYGVV
HPTGWIRNGQ HIGRADKMSG EFVSSEVVNW LDNKKDDNPF FLYVAFTEVH SPLASPKKYL
DMYSQYMTDY QKQHPDLFYG DWADKPWRGT GEYYANISYM DEQVGKVLDK IKAMGEEDNT
IVIFTSDNGP VTREARKVYE LNLAGETDGL RGRKDNLWEG GIRVPAIIKY GKHIPQGMVT
DTPVYGLDWL PTLANMMDFK LPTDRTYDGQ SLVPLLKDKT LKRQKPLIFG IDMPFQDDPT
DEWAIRDGDW KMIIDRQNKP KYLYNLKTDR FETLNQIGKQ PQIEKQLYGK FLKYKKDIDN
DSLMKARGDK PTPVTWG