Gene SNSL254_A3640 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSNSL254_A3640 
Symbol 
ID6483753 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Newport str. SL254 
KingdomBacteria 
Replicon accessionNC_011080 
Strand
Start bp3530139 
End bp3531143 
Gene Length1005 bp 
Protein Length334 aa 
Translation table11 
GC content51% 
IMG OID642738915 
Productputative sulfite oxidase subunit YedY 
Protein accessionYP_002042632 
Protein GI194445137 
COG category[R] General function prediction only 
COG ID[COG2041] Sulfite oxidase and related enzymes 
TIGRFAM ID[TIGR01409] Tat (twin-arginine translocation) pathway signal sequence 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.145156 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones41 
Fosmid unclonability p-value0.000496426 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAAAAAGA TACGTCCATT AACAGAAGCC GATGTGACTG CGGAATCGGC TTTTTTTATG 
CAGCGCCGAC AGGTGCTAAA AGCATTAGGC ATCAGCGCGG CCGCCTTATC CTTACCCTCA
ACGGCGCAGG CCGATCTCTT CAGTTGGTTT AAAGGCAACG ATCGTCCGAA AGCGCCTGCC
GGTAAACCGC TTGAGTTTAG TCAGCCTGCC GCCTGGCGAA GCGATTTAGC GTTAACGCCG
GAAGATAAAG TGACGGGCTA CAACAATTTC TATGAGTTTG GCCTTGATAA AGCCGACCCG
GCGGCCAATG CCGGAAGTCT GAAAACCGAA CCGTGGACGT TGAAAATCAG CGGGGAAGTC
GCGAAGCCAT TTACGCTGGA TTACGACGAT TTAACACATC GTTTCCCATT AGAAGAGCGT
ATCTATCGAA TGCGCTGCGT CGAAGCGTGG TCCATGGTCG TGCCGTGGAT TGGTTTCCCT
TTATATAAGC TACTCGCGCA GGCACAGCCC ACCAGCCACG CTAAATATGT GGCATTCGAA
ACGCTATACG CGCCGGATGA TATGCCAGGA CAGAAAGATC GCTTTATTGG CGGCGGACTG
AAATACCCTT ATGTCGAAGG GCTACGTCTG GATGAAGCCA TGCATCCGCT GACTCTGATG
ACCGTTGGCG TCTATGGTAA GGCGTTACCC CCGCAAAACG GCGCGCCCAT TCGACTCATC
GTTCCATGGA AGTATGGTTT TAAAGGTATT AAATCTATTG TCAGCATTAA ACTCACCCGC
GAACGTCCGC CAACCACCTG GAATTTGTCG GCTCCCAACG AATATGGTTT TTACGCCAAT
GTGAACCCGC ATGTGGATCA TCCACGCTGG TCTCAGGCTA CCGAACGCTT TATTGGTTCA
GGCGGTATCC TTGATGTGCA AAGGCAGCCG ACGCTGCTGT TTAACGGCTA CGCCAATGAA
GTCGCTTCGC TGTATCGCGG TCTCAATTTG CGGGAGAATT TTTAA
 
Protein sequence
MKKIRPLTEA DVTAESAFFM QRRQVLKALG ISAAALSLPS TAQADLFSWF KGNDRPKAPA 
GKPLEFSQPA AWRSDLALTP EDKVTGYNNF YEFGLDKADP AANAGSLKTE PWTLKISGEV
AKPFTLDYDD LTHRFPLEER IYRMRCVEAW SMVVPWIGFP LYKLLAQAQP TSHAKYVAFE
TLYAPDDMPG QKDRFIGGGL KYPYVEGLRL DEAMHPLTLM TVGVYGKALP PQNGAPIRLI
VPWKYGFKGI KSIVSIKLTR ERPPTTWNLS APNEYGFYAN VNPHVDHPRW SQATERFIGS
GGILDVQRQP TLLFNGYANE VASLYRGLNL RENF