Gene YpsIP31758_0998 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagYpsIP31758_0998 
SymbolcsdA 
ID5384973 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameYersinia pseudotuberculosis IP 31758 
KingdomBacteria 
Replicon accessionNC_009708 
Strand
Start bp1199043 
End bp1200248 
Gene Length1206 bp 
Protein Length401 aa 
Translation table11 
GC content47% 
IMG OID640863968 
Productcysteine sulfinate desulfinase 
Protein accessionYP_001399982 
Protein GI153950116 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0520] Selenocysteine lyase 
TIGRFAM ID[TIGR01979] cysteine desulfurases, SufS subfamily
[TIGR03392] cysteine desulfurase, catalytic subunit CsdA 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.000110695 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAGTTT TTAATCCAAT GGATTTTCGT CGGGAATTCC CTGCGCTCAG TGATAAATTA 
ACCTATCTGG ACAGTGCGGC GACCGCCTTG AAACCACGTG CAATGATTGA CGCGACACAG
CAATTTTATC AGCAGGATTC AGCAACGGTA CACCGCAGCC AACATCAATC GGCGCTGTCA
TTAACGGTTC GCTTTGAAAA CACCCGCCAA CAAGTGGCTG ATTTTATTAA CTCATCTACA
GCAGAAAATA TTATCTGGAC GCGAGGAACA ACTGAAGCCA TCAATCTGAT CGCGCAAAGT
TATGCCCGCC CCCGTTTACA ACCTGAAGAT GAAATTATTG TCAGCGAAGC TGAACATCAT
GCGAATTTAA TTCCCTGGTT GATGGTAGCG GAGCAGACCG GTGCAAAAAT AGTCAAATTA
CCTCTTGGCC TTGATCATCT GCCAGATTTA CAGCAACTCC CTCAACTACT TAATGAAAAA
ACACGCATAT TAGCGCTGGG GCAGATGTCT AACGTAACAG GCGGTAGCCC TGATCTGGCT
CAGGCTATTA GGCTGGCTCA CCAATATGAC TGTGTTGTCG TGGTTGACGG TGCTCAGGGG
ATTGTTCATT GCCCAGCCGA TGTTCAGGCA TTGGATATTG ATTTTTATGC ATTCTCTTCC
CATAAATTGT ATGGCCCAAC CGGCATTGGC GTGCTGTATG GGAAGACTGA ATTATTAGAA
GAGATGCCCG CCTGGCAAGG CGGCGGTAAA ATGCTTACCC ATGTATCATT CGGGGGCTTT
ACACCTCATG AAGTGCCTTA TCGCTTTGAA GCGGGTACAC CCAATATTGC TGGCGTTATT
GGTTTATCAG CGGTACTCAA ATGGCTGGAA CATATTGATC TGGAAGAGGC CGAAGTTTAT
AGCCAAGGTT TAGCTACAAT GGCAGAAAAT AAGCTCGCAC AATTACCGGG TTTTCACAGT
TACCGTTGCC AGCAATCCAG TTTATTAGCA TTTACTTTCG ATGGTGTTCA TCACAGTGAT
TTAGTGGCGT TATTGGCCGA GCAAGGTATC GCACTACGTG CTGGGCAACA CTGCGCACAG
CCACTGATGG CCGCTCTGGG AGTCAATGGC AGTCTACGGG CTTCTTTTGC GCCTTATAAT
ACCCCCCAAG ATGTTGAAAT GCTTTGCTCG GCGCTTGGTA AGGCATTGGA ACTGCTTCAA
GACTAA
 
Protein sequence
MKVFNPMDFR REFPALSDKL TYLDSAATAL KPRAMIDATQ QFYQQDSATV HRSQHQSALS 
LTVRFENTRQ QVADFINSST AENIIWTRGT TEAINLIAQS YARPRLQPED EIIVSEAEHH
ANLIPWLMVA EQTGAKIVKL PLGLDHLPDL QQLPQLLNEK TRILALGQMS NVTGGSPDLA
QAIRLAHQYD CVVVVDGAQG IVHCPADVQA LDIDFYAFSS HKLYGPTGIG VLYGKTELLE
EMPAWQGGGK MLTHVSFGGF TPHEVPYRFE AGTPNIAGVI GLSAVLKWLE HIDLEEAEVY
SQGLATMAEN KLAQLPGFHS YRCQQSSLLA FTFDGVHHSD LVALLAEQGI ALRAGQHCAQ
PLMAALGVNG SLRASFAPYN TPQDVEMLCS ALGKALELLQ D