Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SNSL254_A3640 |
Symbol | |
ID | 6483753 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salmonella enterica subsp. enterica serovar Newport str. SL254 |
Kingdom | Bacteria |
Replicon accession | NC_011080 |
Strand | + |
Start bp | 3530139 |
End bp | 3531143 |
Gene Length | 1005 bp |
Protein Length | 334 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 642738915 |
Product | putative sulfite oxidase subunit YedY |
Protein accession | YP_002042632 |
Protein GI | 194445137 |
COG category | [R] General function prediction only |
COG ID | [COG2041] Sulfite oxidase and related enzymes |
TIGRFAM ID | [TIGR01409] Tat (twin-arginine translocation) pathway signal sequence |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.145156 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 41 |
Fosmid unclonability p-value | 0.000496426 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGAAAAAGA TACGTCCATT AACAGAAGCC GATGTGACTG CGGAATCGGC TTTTTTTATG CAGCGCCGAC AGGTGCTAAA AGCATTAGGC ATCAGCGCGG CCGCCTTATC CTTACCCTCA ACGGCGCAGG CCGATCTCTT CAGTTGGTTT AAAGGCAACG ATCGTCCGAA AGCGCCTGCC GGTAAACCGC TTGAGTTTAG TCAGCCTGCC GCCTGGCGAA GCGATTTAGC GTTAACGCCG GAAGATAAAG TGACGGGCTA CAACAATTTC TATGAGTTTG GCCTTGATAA AGCCGACCCG GCGGCCAATG CCGGAAGTCT GAAAACCGAA CCGTGGACGT TGAAAATCAG CGGGGAAGTC GCGAAGCCAT TTACGCTGGA TTACGACGAT TTAACACATC GTTTCCCATT AGAAGAGCGT ATCTATCGAA TGCGCTGCGT CGAAGCGTGG TCCATGGTCG TGCCGTGGAT TGGTTTCCCT TTATATAAGC TACTCGCGCA GGCACAGCCC ACCAGCCACG CTAAATATGT GGCATTCGAA ACGCTATACG CGCCGGATGA TATGCCAGGA CAGAAAGATC GCTTTATTGG CGGCGGACTG AAATACCCTT ATGTCGAAGG GCTACGTCTG GATGAAGCCA TGCATCCGCT GACTCTGATG ACCGTTGGCG TCTATGGTAA GGCGTTACCC CCGCAAAACG GCGCGCCCAT TCGACTCATC GTTCCATGGA AGTATGGTTT TAAAGGTATT AAATCTATTG TCAGCATTAA ACTCACCCGC GAACGTCCGC CAACCACCTG GAATTTGTCG GCTCCCAACG AATATGGTTT TTACGCCAAT GTGAACCCGC ATGTGGATCA TCCACGCTGG TCTCAGGCTA CCGAACGCTT TATTGGTTCA GGCGGTATCC TTGATGTGCA AAGGCAGCCG ACGCTGCTGT TTAACGGCTA CGCCAATGAA GTCGCTTCGC TGTATCGCGG TCTCAATTTG CGGGAGAATT TTTAA
|
Protein sequence | MKKIRPLTEA DVTAESAFFM QRRQVLKALG ISAAALSLPS TAQADLFSWF KGNDRPKAPA GKPLEFSQPA AWRSDLALTP EDKVTGYNNF YEFGLDKADP AANAGSLKTE PWTLKISGEV AKPFTLDYDD LTHRFPLEER IYRMRCVEAW SMVVPWIGFP LYKLLAQAQP TSHAKYVAFE TLYAPDDMPG QKDRFIGGGL KYPYVEGLRL DEAMHPLTLM TVGVYGKALP PQNGAPIRLI VPWKYGFKGI KSIVSIKLTR ERPPTTWNLS APNEYGFYAN VNPHVDHPRW SQATERFIGS GGILDVQRQP TLLFNGYANE VASLYRGLNL RENF
|
| |