Gene YPK_1851 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagYPK_1851 
Symbol 
ID6088614 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameYersinia pseudotuberculosis YPIII 
KingdomBacteria 
Replicon accessionNC_010465 
Strand
Start bp2056242 
End bp2057462 
Gene Length1221 bp 
Protein Length406 aa 
Translation table11 
GC content52% 
IMG OID641596919 
Productbifunctional cysteine desulfurase/selenocysteine lyase 
Protein accessionYP_001720595 
Protein GI170024090 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0520] Selenocysteine lyase 
TIGRFAM ID[TIGR01979] cysteine desulfurases, SufS subfamily 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00184689 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATTTTC CTATTGAACG TGTAAGAGCT GATTTTCCAC TGTTGAGCCG CCAGGTTAAT 
GGGCAGCCGT TGGTTTATCT GGACAGCGCC GCCAGTGCGC AAAAACCTCA GGCGGTCATT
GACAAGGAGC TTCATTTTTA CCGTGATGGT TATGCGGCCG TTCATAGGGG CATTCACAGT
TTAAGTGCTG AAGCGACTCA GCAGATGGAA GCCGTACGCA CTCAGGTGGC TGATTTTATT
CACGCCGCCT CCGCAGAAGA AATTATTTTT GTCAGAGGCA CCACTGAAGC AATCAATTTG
GTTGCTAACA GTTATGGCCG CCATTTCCTT GCCGCGGGTG ATAGTATTAT CATTACCGAA
ATGGAACACC ATGCCAATAT TGTGCCTTGG CAGATGTTGG CGCAAGATCT TGGTGTTGAA
ATCCGTGTTT GGCCACTGAC GGCTACCGGT GAGTTGGAAA TAACCGCCCT GGCAGCGTTG
ATTGATGACA CCACGCGCTT ACTGGCGGTG ACTCAGGTCT CCAACGTGTT GGGAACGGTA
AACCCGATTA AGGATATTGT GGCCCAGGCA AAAGCCGCCG GTTTGGTGGT GTTGGTGGAT
GGTGCGCAAG CGGTTATGCA TCAGCCAGTT GATGTTCAGG CGTTGGGCTG CGATTTTTAT
GTTTTCTCAG GCCACAAACT GTACGGCCCA TCGGGTATTG GGATTCTGTA CGGCAAAAGT
GCGTTGTTAC AACAGATGCC GCCATGGGAA GGGGGCGGGG CGATGATCAA AACAGTCAGT
TTGACGCAAG GTACTACGTT TGCTGACGCC CCTTGGCGCT TTGAGGCTGG GTCACCTAAT
ACTGCGGGTA TCATGGGGCT TGGCGCGGCC ATTGACTATG TCACTGAATT GGGGCTCTTG
CCGATCCAAC AGTATGAGCA ATCGCTGATG CATTACGCAT TGGCGCAACT GAGCCAGATT
AAGAGCCTGA CACTGTATGG CCCAACAGAG CGTGCCGGGG TTATTGCCTT CAATCTGGGC
CAGCACCATG CCTATGATGT GGGCAGCTTT CTTGACCAAT ACGGTATTGC TATTCGGACG
GGTCATCACT GTGCGATGCC GCTGATGGCA TTCTATCAGG TACCGAGTAT GTGCCGTGCC
TCACTGGCGC TGTATAATAC CCGCGAGGAT GTTGATCGGT TGGTGGCAGG ATTACAGCGT
ATCGAAAAAT TGCTGGGGTG A
 
Protein sequence
MNFPIERVRA DFPLLSRQVN GQPLVYLDSA ASAQKPQAVI DKELHFYRDG YAAVHRGIHS 
LSAEATQQME AVRTQVADFI HAASAEEIIF VRGTTEAINL VANSYGRHFL AAGDSIIITE
MEHHANIVPW QMLAQDLGVE IRVWPLTATG ELEITALAAL IDDTTRLLAV TQVSNVLGTV
NPIKDIVAQA KAAGLVVLVD GAQAVMHQPV DVQALGCDFY VFSGHKLYGP SGIGILYGKS
ALLQQMPPWE GGGAMIKTVS LTQGTTFADA PWRFEAGSPN TAGIMGLGAA IDYVTELGLL
PIQQYEQSLM HYALAQLSQI KSLTLYGPTE RAGVIAFNLG QHHAYDVGSF LDQYGIAIRT
GHHCAMPLMA FYQVPSMCRA SLALYNTRED VDRLVAGLQR IEKLLG