Gene YpsIP31758_0354 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagYpsIP31758_0354 
SymbolssuD 
ID5387716 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameYersinia pseudotuberculosis IP 31758 
KingdomBacteria 
Replicon accessionNC_009708 
Strand
Start bp413769 
End bp414917 
Gene Length1149 bp 
Protein Length382 aa 
Translation table11 
GC content55% 
IMG OID640863323 
Productalkanesulfonate monooxygenase 
Protein accessionYP_001399347 
Protein GI153950813 
COG category[C] Energy production and conversion 
COG ID[COG2141] Coenzyme F420-dependent N5,N10-methylene tetrahydromethanopterin reductase and related flavin-dependent oxidoreductases 
TIGRFAM ID[TIGR03565] alkanesulfonate monooxygenase, FMNH(2)-dependent 


Plasmid Coverage information

Num covering plasmid clones50 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCATTA ACGTATTCTG GTTTTTACCT ACCCATGGCG ATGGCCACTA TCTGGGCAGT 
TCTGAAGGGG CACGCGCGGT CGATTATAGT TATCTACAGC AAATTGCCCA AGCGGCTGAC
CGGCTTGGGT TCGGTGGTGT GTTGATCCCA ACGGGCCGTT CTTGTGAGGA TTCCTGGTTA
GTTGCCGCCT CGCTGATCCC CGTCACCCAA CGGCTGAAAT TTCTGGTCGC GCTGCGCCCT
GGCATCATCT CCCCCACGCT GGCCGCCAGG CAGGCCGCTA CGTTGGATCG GCTTTCTAAT
GGCCGGGCGC TGTTTAATCT GGTCACCGGT GGCGATCCAG AAGAGTTGGC CGCAGAAGGA
TTACACCTTA ATCACACTGA GCGCTACGAG GCTTCTGCCG AATTTACTCA TGTGTGGCGC
AAGGTGCTGG AAGGTGAAAC CGTTGATTTT GCTGGCAAGC ATATTCAGGT AAAAGGCGCC
AAGCTGTTAT TTCCGCCGGT TCAACATCCA CGGCCTCCAC TGTATTTTGG CGGCTCCTCG
GCAGCAGCTC AAGATTTGGC CGCTGAACAG GTTGAGCTGT ATCTGACTTG GGGAGAAACC
CCCGAACAGG TGAAGGAAAA AGTTGAGGAG GTGCGAGCTA AAGCAGCGGC AAAAGGGCGC
ACAGTACGTT TCGGCATCCG CTTGCATGTC ATCGTCCGTG AAACGACCGA AGAGGCTTGG
CGCGCAGCCA ACCGTTTGAT AGCTAATCTG GATGATAAAA CCATTGCCGA TGCGCAGCAG
GCTTTCGCTC GTTTCGATTC TGTTGGTCAG CAGCGCATGG CTGCACTGCA CGGCGGTAAG
AAAGATAATT TGGAAATCAG CCCGAATCTG TGGGCCGGTG TTGGGCTGGT GAGGGGCGGG
GCGGGTACCG CATTGGTAGG CGATGGCCCG ACTGTCGCAC AACGGATCCA GGAATATGCT
GACCTGGGGA TTGATACCTT TGTTTTCTCC GGTTATCCGC ACCTTGAAGA AGCTTATCGT
GTGAGTGAAT TGTTGTTCCC ACATCTTGAT TTGGCTACCA CCGAGCTACC GACGCAGCGG
CCTGCAACCC AACCGCAAGG TGAAGTAGTC GCCAATATTT ATGTGCCACA AAAAGTCTCG
CAAAGCTGA
 
Protein sequence
MSINVFWFLP THGDGHYLGS SEGARAVDYS YLQQIAQAAD RLGFGGVLIP TGRSCEDSWL 
VAASLIPVTQ RLKFLVALRP GIISPTLAAR QAATLDRLSN GRALFNLVTG GDPEELAAEG
LHLNHTERYE ASAEFTHVWR KVLEGETVDF AGKHIQVKGA KLLFPPVQHP RPPLYFGGSS
AAAQDLAAEQ VELYLTWGET PEQVKEKVEE VRAKAAAKGR TVRFGIRLHV IVRETTEEAW
RAANRLIANL DDKTIADAQQ AFARFDSVGQ QRMAALHGGK KDNLEISPNL WAGVGLVRGG
AGTALVGDGP TVAQRIQEYA DLGIDTFVFS GYPHLEEAYR VSELLFPHLD LATTELPTQR
PATQPQGEVV ANIYVPQKVS QS