Gene SeSA_A3569 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeSA_A3569 
Symbol 
ID6517051 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Schwarzengrund str. CVM19633 
KingdomBacteria 
Replicon accessionNC_011094 
Strand
Start bp3446303 
End bp3447307 
Gene Length1005 bp 
Protein Length334 aa 
Translation table11 
GC content52% 
IMG OID642748555 
Productputative sulfite oxidase subunit YedY 
Protein accessionYP_002116325 
Protein GI194734390 
COG category[R] General function prediction only 
COG ID[COG2041] Sulfite oxidase and related enzymes 
TIGRFAM ID[TIGR01409] Tat (twin-arginine translocation) pathway signal sequence 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0584327 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAAAGA TACGTCCATT AACAGAAGCC GATGTGACTG CGGAATCGGC TTTTTTTATG 
CAGCGCCGAC AGGTGCTAAA AGCATTAGGC ATCAGCGCGG CCGCCTTATC CTTACCCTCA
ACGGCGCAGG CCGATCTCTT CAGTTGGTTT AAAGGCAACG ATCGTCCGAA AGCGCCTGCC
GGTAAACCGC TTGAGTTTAG TCAGCCTGCC GCCTGGCGAA GCGATTTAGC GTTAACGCCG
GAAGATAAGG TGACGGGCTA CAACAATTTC TATGAGTTTG GCCTTGATAA AGCCGACCCG
GCGGCCAATG CCGGAAGTCT GAAAACCGAA CCGTGGACGT TGAAAATCAG CGGGGAAGTC
GCGAAGCCAT TTACGCTGGA TTACGACGAT TTAACACATC GTTTTCCATT AGAAGAGCGT
ATCTATCGAA TGCGCTGCGT CGAAGCGTGG TCCATGGTCG TGCCGTGGAT TGGTTTCCCT
TTATATAAGC TACTCGCGCA GGCACAGCCC ACCAGCCACG CTAAATATGT GGCATTCGAA
ACGCTATACG CGCCGGATGA TATGCCAGGA CAGAAAGATC GCTTTATTGG CGGCGGACTG
AAATACCCTT ATGTCGAAGG GCTACGTCTG GACGAAGCCA TGCATCCGCT GACTCTGATG
ACCGTTGGCG TCTATGGTAA GGCGTTACCC CCGCAAAACG GCGCGCCCAT TCGACTCATC
GTTCCATGGA AGTATGGTTT CAAAGGTATT AAATCTGTTG TTAGCATTAA ACTCACCCGC
GAACGTCCGC CAACCACCTG GAATTTGTCG GCTCCCAACG AATATGGTTT TTACGCCAAT
GTGAACCCGC ATGTGGATCA TCCACGCTGG TCTCAGGCTA CCGAACGCTT TATTGGTTCA
GGCGGTATCC TTGATGTGCA AAGGCAGCCG ACGCTGCTGT TTAACGGCTA CGCCAATGAA
GTCGCTTCGC TGTATCGCGG TCTCAATTTG CGGGAGAATT TTTAA
 
Protein sequence
MKKIRPLTEA DVTAESAFFM QRRQVLKALG ISAAALSLPS TAQADLFSWF KGNDRPKAPA 
GKPLEFSQPA AWRSDLALTP EDKVTGYNNF YEFGLDKADP AANAGSLKTE PWTLKISGEV
AKPFTLDYDD LTHRFPLEER IYRMRCVEAW SMVVPWIGFP LYKLLAQAQP TSHAKYVAFE
TLYAPDDMPG QKDRFIGGGL KYPYVEGLRL DEAMHPLTLM TVGVYGKALP PQNGAPIRLI
VPWKYGFKGI KSVVSIKLTR ERPPTTWNLS APNEYGFYAN VNPHVDHPRW SQATERFIGS
GGILDVQRQP TLLFNGYANE VASLYRGLNL RENF