Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SeD_A3737 |
Symbol | |
ID | 6873535 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salmonella enterica subsp. enterica serovar Dublin str. CT_02021853 |
Kingdom | Bacteria |
Replicon accession | NC_011205 |
Strand | + |
Start bp | 3588921 |
End bp | 3589925 |
Gene Length | 1005 bp |
Protein Length | 334 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 642786712 |
Product | putative sulfite oxidase subunit YedY |
Protein accession | YP_002217346 |
Protein GI | 198245414 |
COG category | [R] General function prediction only |
COG ID | [COG2041] Sulfite oxidase and related enzymes |
TIGRFAM ID | [TIGR01409] Tat (twin-arginine translocation) pathway signal sequence |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.256044 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 49 |
Fosmid unclonability p-value | 0.0204641 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAAAAGA TACGTCCATT AACAGAAGCC GATGTGACTG CGGAATCGGC TTTTTTTATG CAGCGCCGAC AGGTGCTAAA AGCATTAGGC ATCAGCGCGG CCGCCTTATC CTTACCCTCA ACGGCGCAGG CCGATCTCTT CAGTTGGTTT AAAGGCAACG ATCGTCCGAA AGCGCCTGCC GGTAAACCGC TTGAGTTTAG TCAGCCTGCC GCCTGGCGAA GCGATTTAGC GTTAACGCCG GAAGATAAGG TGACGGGCTA CAACAATTTC TATGAGTTTG GCCTTGATAA AGCCGACCCG GCGGCCAATG CCGGAAGTCT GAAAACCGAA CCGTGGACGT TGAAAATCAG CGGGGAAGTC GCGAAGCCAT TTACGCTGGA TTATGACGAT TTAACACATC GTTTCCCATT AGAAGAGCGT ATCTATCGAA TGCGCTGCGT CGAAGCGTGG TCCATGGTCG TGCCGTGGAT TGGTTTCCCT TTATATAAGC TACTCGCGCA GGCACAGCCC ACCAGCCACG CTAAATATGT GGCATTCGAA ACGCTATACG CGCCGGATGA TATGCCAGGA CAGAAAGATC GTTTTATTGG CGGCGGACTG AAATACCCTT ATGTCGAAGG GCTACGTCTG GACGAAGCCA TGCATCCGCT GACTCTGATG ACCGTTGGCG TCTATGGTAA GGCGTTACCC CCGCAAAACG GCGCGCCCAT TCGACTCATC GTTCCATGGA AGTATGGTTT TAAAGGTATT AAATCTATTG TCAGCATTAA ACTCACCCGC GAACGTCCGC TAACCACCTG GAATTTGTCG GCTCCCAACG AATATGGTTT TTACGCCAAT GTGAACCCGC ATGTGGATCA TCCACGCTGG TCTCAGGCTA CCGAACGCTT TATTGGTTCA GGCGGTATCC TCGATGTGCA AAGGCAGCCG ACGCTGCTGT TTAACGGCTA TGCCAATGAA GTCGCTTCGC TGTATCGCGG TCTCAATTTG CGGGAGAATT TTTAA
|
Protein sequence | MKKIRPLTEA DVTAESAFFM QRRQVLKALG ISAAALSLPS TAQADLFSWF KGNDRPKAPA GKPLEFSQPA AWRSDLALTP EDKVTGYNNF YEFGLDKADP AANAGSLKTE PWTLKISGEV AKPFTLDYDD LTHRFPLEER IYRMRCVEAW SMVVPWIGFP LYKLLAQAQP TSHAKYVAFE TLYAPDDMPG QKDRFIGGGL KYPYVEGLRL DEAMHPLTLM TVGVYGKALP PQNGAPIRLI VPWKYGFKGI KSIVSIKLTR ERPLTTWNLS APNEYGFYAN VNPHVDHPRW SQATERFIGS GGILDVQRQP TLLFNGYANE VASLYRGLNL RENF
|
| |