Gene SNSL254_A4429 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSNSL254_A4429 
Symbol 
ID6485518 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Newport str. SL254 
KingdomBacteria 
Replicon accessionNC_011080 
Strand
Start bp4298511 
End bp4300208 
Gene Length1698 bp 
Protein Length565 aa 
Translation table11 
GC content48% 
IMG OID642739667 
Productarylsulfotransferase 
Protein accessionYP_002043361 
Protein GI194446105 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones83 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTCGCAT CAGCAGATGT CATTCCAGTA CATGAAGGGC CATTAGGTAT GGTCGATGTC 
GCTCCCTACG GCGGCGTTTT CCCATTAACA GCAATCATTA ATAAAGCCAA TCATAATGTA
CAGGACGTGA AGGTTACCGT TTTAGGGAAA GGGGAAAAAG GTATCCCGAT CAGTTATGAT
GTCGGCCCGC AGGCTATAAA TACCCATGAC GGCATACCTG TATTTGGCTT GTATCCAGAT
TATGTCAATA AGGTTAAAGT TGACTGGACT GAAGAAGGTA AAAAACAAAC ATATACGTGG
TCCATTTACG CCGCACCGGT ATCATTACCC TCTACTACCG GGCAAACTGC CGTTCTTCCT
ACAGTAGAAC CGGTTAAAGT CGATAGCTCG CTTAAAAATC GCTTATATCT TTTTAACCAT
ATAACAGGGA TGCCAAGAGC CGGCCACATT ATGCATGTCG CAGGCGGCGC GGCGAACTGG
GATTATACCG GTATCAACTG GATTAGCGAT ACGAATGGCG ATGTTCGTGG CTATATGAAT
ATTGATAAAT TCCGTAACCA GGATGATATA ACGCGTTTTG GTTCCATGAT GAGCTTCCAC
CAGGTTAACG ATGGCAATCT TATTTTTGGC CAGGGTCAAC GTTACTTTAA ATATGATTTC
TTAGGCCGCG TTATTTCTGA TAAACGACTG CCAAAAGGAT TTATTGATTT TTCGCACGCC
ATTACCGAAA CGCCGAAAGG CACCTACCTG CTGCGTGTCG CAAAAGAAAA TTATCCATTA
AATGGTAAAT ACACCATCAA TACTGTGCGT GATCATATTC TTGAAGTTGA CCAGAACGGC
GATACCGTCG ATTACTGGGA TCTGCCAAAA ATCCTCGACC CCTATCGTGA CGACGTTATT
CTGGCGATGG ATCAGGGAGC GGTATGTTTG AGCGTCGATG CCGAACATTC CGGTCAGGTC
ATGACCAAAG AGCAGCTTGC AAAACAACCC TTCGGCGATA TCGCGGGCTC CGGCCCGGGC
CGCAACTGGG CGCATGTTAA CTCCGTCAGC TACGATCCTC GCGACGACAG CATTATCATT
AGCTCGCGCC ACCAGTCTGC CATCATCAAA ATTGGTCGCG ATAAAAAAGT GAAATGGATA
CTTTCCGATC CATCCGGCTG GAAAGGCGAA CTGGCGAAAA AAGTGCTGAA ACCCGTAGAC
AGCAACGGTA AACCGCTAAC CTGCGAAGCG CACCACTGCG ACGGTGGATT TGACTGGACA
TGGACACAAC ATACCGGTTG GTTAGTGCCA TCCAAAAGCA CCGGAGGTAA AACCGTCGTG
ACCGCCTTTG ATAACGGCGA TGCGCGCGGC ATGGAACAAC CGGCCATGCC GTCAATGAAA
TATTCCCGCG GCGTGGAATA TCAGATTGAC GAAAAAAATA TGACGGTTTC CCAAATGTGG
GAATATGGTA AAGAGCGCGG TTTTGACTGG TACAGCGCCA TTACTTCCGT CACGGAATAT
CGCCCGGAAA CCAAAACGAT GTTCATGTAC TCGGCCACAG CGGGAATGAG CGGTACAAAA
CCGATCGTTT CCGTTCTGGA TGAAGTCAAA GACGGTACTC AGGATGTGAT GCTGGAGCTA
AAAGTACACA GTAACCGTGC CGGTATGCTG GGTTATCGGG CGCTGATTAT CGATCCAGAG
CAGATGTTTA AAAAATAA
 
Protein sequence
MVASADVIPV HEGPLGMVDV APYGGVFPLT AIINKANHNV QDVKVTVLGK GEKGIPISYD 
VGPQAINTHD GIPVFGLYPD YVNKVKVDWT EEGKKQTYTW SIYAAPVSLP STTGQTAVLP
TVEPVKVDSS LKNRLYLFNH ITGMPRAGHI MHVAGGAANW DYTGINWISD TNGDVRGYMN
IDKFRNQDDI TRFGSMMSFH QVNDGNLIFG QGQRYFKYDF LGRVISDKRL PKGFIDFSHA
ITETPKGTYL LRVAKENYPL NGKYTINTVR DHILEVDQNG DTVDYWDLPK ILDPYRDDVI
LAMDQGAVCL SVDAEHSGQV MTKEQLAKQP FGDIAGSGPG RNWAHVNSVS YDPRDDSIII
SSRHQSAIIK IGRDKKVKWI LSDPSGWKGE LAKKVLKPVD SNGKPLTCEA HHCDGGFDWT
WTQHTGWLVP SKSTGGKTVV TAFDNGDARG MEQPAMPSMK YSRGVEYQID EKNMTVSQMW
EYGKERGFDW YSAITSVTEY RPETKTMFMY SATAGMSGTK PIVSVLDEVK DGTQDVMLEL
KVHSNRAGML GYRALIIDPE QMFKK