Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | YpsIP31758_1745 |
Symbol | sufS |
ID | 5386057 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Yersinia pseudotuberculosis IP 31758 |
Kingdom | Bacteria |
Replicon accession | NC_009708 |
Strand | + |
Start bp | 2019270 |
End bp | 2020490 |
Gene Length | 1221 bp |
Protein Length | 406 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 640864725 |
Product | bifunctional cysteine desulfurase/selenocysteine lyase |
Protein accession | YP_001400720 |
Protein GI | 153947303 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0520] Selenocysteine lyase |
TIGRFAM ID | [TIGR01979] cysteine desulfurases, SufS subfamily |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.0000031207 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGTTTTC CTATTGAGCG TGTAAGAGCT GATTTTCCAC TGTTGAGCCG CCAGGTTAAT GGGCAGCCGT TGGTTTATCT GGACAGCGCC GCCAGTGCGC AAAAACCTCA GGCGGTCATT GACAAGGAGC TTCATTTTTA CCGTGATGGT TATGCGGCCG TTCATCGGGG CATTCACAGT TTAAGTGCTG AAGCGACTCA GCAAATGGAA GCAGTACGCA CTCAGGTGGC TGATTTTATT CACGCCGCAT CCGCAGAAGA AATTATCTTT GTCCGAGGCA CCACTGAAGC AATCAATTTG GTTGCTAACA GTTATGGCCG CCATTTCCTT GTCACGGGTG ATAGCATTAT CATTACCGAA ATGGAACATC ATGCCAATAT TGTGCCTTGG CAGATGTTGG CGCAAGATCT CGGTGTTGAA ATCCGTGTTT GGCCACTGAC GGCTACCGGT GAGTTAGAGA TAACCGACTT GGCAGCGTTG ATTGATGACA CCACGCGCTT ACTGGCGGTG ACTCAGATCT CCAACGTGTT GGGAACGGTA AACCCGATTA AGGATATTGT GGCTCAGGCA AAAGCTGCTG GTTTGGTGGT GCTGGTGGAT GGTGCGCAAG CGGTTATGCA TCAGCCAGTT GATGTTCAGG CGTTGGGCTG CGATTTTTAT GTTTTCTCAG GGCACAAACT GTACGGCCCA TCGGGTATTG GGATTCTGTA CGGCAAAAGT GCGTTGTTAC AACAGATGCC GCCATGGGAA GGGGGCGGGG CGATGATCAA AACAGTCAGT TTGACGCAAG GCACTACGTT TGCTGATGCC CCTTGGCGCT TTGAGGCTGG GTCACCTAAT ACTGCGGGGA TCATGGGGCT TGGCGCGGCC ATTGACTATG TCACTGAATT AGGGCTCTTG CAGATCCAAC AGTATGAACA ATCGCTGATG CATTACGCAT TGGCGCAACT GAGCCAGATT AAGAGCCTGA CACTGTATGG CCCAACAGAG CGTGCCGGGG TTATTGCCTT CAATCTGGGC CTGCACCATG CCTATGATGT GGGCAGCTTT CTTGACCAAT ACGGTATTGC TATTCGTACG GGTCATCACT GTGCGATGCC GCTGATGGCA TTCTATCAGG TACCGAGTAT GTGCCGTGCC TCACTGGCGC TGTATAATAC CCGCGAGGAT GTTGATCGGT TGGTGGCAGG ATTACAGCGT ATCGAAAAAT TGCTGGGGTG A
|
Protein sequence | MSFPIERVRA DFPLLSRQVN GQPLVYLDSA ASAQKPQAVI DKELHFYRDG YAAVHRGIHS LSAEATQQME AVRTQVADFI HAASAEEIIF VRGTTEAINL VANSYGRHFL VTGDSIIITE MEHHANIVPW QMLAQDLGVE IRVWPLTATG ELEITDLAAL IDDTTRLLAV TQISNVLGTV NPIKDIVAQA KAAGLVVLVD GAQAVMHQPV DVQALGCDFY VFSGHKLYGP SGIGILYGKS ALLQQMPPWE GGGAMIKTVS LTQGTTFADA PWRFEAGSPN TAGIMGLGAA IDYVTELGLL QIQQYEQSLM HYALAQLSQI KSLTLYGPTE RAGVIAFNLG LHHAYDVGSF LDQYGIAIRT GHHCAMPLMA FYQVPSMCRA SLALYNTRED VDRLVAGLQR IEKLLG
|
| |