Gene YpsIP31758_1745 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagYpsIP31758_1745 
SymbolsufS 
ID5386057 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameYersinia pseudotuberculosis IP 31758 
KingdomBacteria 
Replicon accessionNC_009708 
Strand
Start bp2019270 
End bp2020490 
Gene Length1221 bp 
Protein Length406 aa 
Translation table11 
GC content51% 
IMG OID640864725 
Productbifunctional cysteine desulfurase/selenocysteine lyase 
Protein accessionYP_001400720 
Protein GI153947303 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0520] Selenocysteine lyase 
TIGRFAM ID[TIGR01979] cysteine desulfurases, SufS subfamily 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.0000031207 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGTTTTC CTATTGAGCG TGTAAGAGCT GATTTTCCAC TGTTGAGCCG CCAGGTTAAT 
GGGCAGCCGT TGGTTTATCT GGACAGCGCC GCCAGTGCGC AAAAACCTCA GGCGGTCATT
GACAAGGAGC TTCATTTTTA CCGTGATGGT TATGCGGCCG TTCATCGGGG CATTCACAGT
TTAAGTGCTG AAGCGACTCA GCAAATGGAA GCAGTACGCA CTCAGGTGGC TGATTTTATT
CACGCCGCAT CCGCAGAAGA AATTATCTTT GTCCGAGGCA CCACTGAAGC AATCAATTTG
GTTGCTAACA GTTATGGCCG CCATTTCCTT GTCACGGGTG ATAGCATTAT CATTACCGAA
ATGGAACATC ATGCCAATAT TGTGCCTTGG CAGATGTTGG CGCAAGATCT CGGTGTTGAA
ATCCGTGTTT GGCCACTGAC GGCTACCGGT GAGTTAGAGA TAACCGACTT GGCAGCGTTG
ATTGATGACA CCACGCGCTT ACTGGCGGTG ACTCAGATCT CCAACGTGTT GGGAACGGTA
AACCCGATTA AGGATATTGT GGCTCAGGCA AAAGCTGCTG GTTTGGTGGT GCTGGTGGAT
GGTGCGCAAG CGGTTATGCA TCAGCCAGTT GATGTTCAGG CGTTGGGCTG CGATTTTTAT
GTTTTCTCAG GGCACAAACT GTACGGCCCA TCGGGTATTG GGATTCTGTA CGGCAAAAGT
GCGTTGTTAC AACAGATGCC GCCATGGGAA GGGGGCGGGG CGATGATCAA AACAGTCAGT
TTGACGCAAG GCACTACGTT TGCTGATGCC CCTTGGCGCT TTGAGGCTGG GTCACCTAAT
ACTGCGGGGA TCATGGGGCT TGGCGCGGCC ATTGACTATG TCACTGAATT AGGGCTCTTG
CAGATCCAAC AGTATGAACA ATCGCTGATG CATTACGCAT TGGCGCAACT GAGCCAGATT
AAGAGCCTGA CACTGTATGG CCCAACAGAG CGTGCCGGGG TTATTGCCTT CAATCTGGGC
CTGCACCATG CCTATGATGT GGGCAGCTTT CTTGACCAAT ACGGTATTGC TATTCGTACG
GGTCATCACT GTGCGATGCC GCTGATGGCA TTCTATCAGG TACCGAGTAT GTGCCGTGCC
TCACTGGCGC TGTATAATAC CCGCGAGGAT GTTGATCGGT TGGTGGCAGG ATTACAGCGT
ATCGAAAAAT TGCTGGGGTG A
 
Protein sequence
MSFPIERVRA DFPLLSRQVN GQPLVYLDSA ASAQKPQAVI DKELHFYRDG YAAVHRGIHS 
LSAEATQQME AVRTQVADFI HAASAEEIIF VRGTTEAINL VANSYGRHFL VTGDSIIITE
MEHHANIVPW QMLAQDLGVE IRVWPLTATG ELEITDLAAL IDDTTRLLAV TQISNVLGTV
NPIKDIVAQA KAAGLVVLVD GAQAVMHQPV DVQALGCDFY VFSGHKLYGP SGIGILYGKS
ALLQQMPPWE GGGAMIKTVS LTQGTTFADA PWRFEAGSPN TAGIMGLGAA IDYVTELGLL
QIQQYEQSLM HYALAQLSQI KSLTLYGPTE RAGVIAFNLG LHHAYDVGSF LDQYGIAIRT
GHHCAMPLMA FYQVPSMCRA SLALYNTRED VDRLVAGLQR IEKLLG