Gene YpsIP31758_1058 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagYpsIP31758_1058 
Symbol 
ID5387264 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameYersinia pseudotuberculosis IP 31758 
KingdomBacteria 
Replicon accessionNC_009708 
Strand
Start bp1262448 
End bp1263398 
Gene Length951 bp 
Protein Length316 aa 
Translation table11 
GC content50% 
IMG OID640864034 
Productpolysaccharide deacetylase family protein 
Protein accessionYP_001400039 
Protein GI153950026 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0726] Predicted xylanase/chitin deacetylase 
TIGRFAM ID[TIGR03212] putative urate catabolism protein 


Plasmid Coverage information

Num covering plasmid clones48 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCATGAGA CTGAACTTAA TCATGATTAT CCACGGGATT TGGCCGGTTA CGGTGGCCAA 
CCCCCAGTCG CTAACTGGCC GGGGCAGGCA CGTATTGCGG TGCAATTTGT CCTGAATATT
GAAGAAGGTG CGGAAAATAA CGTTCTACAT GGCGATGCGG GATCTGAACA ATTTCTTTCC
GATATTATCG GTGCGGACAG TTATCCCGAT AGGCATATGT CGATGGAGTC ACTTTACGAA
TATGGAACTC GTGCTGGCTT CTGGCGTATC CATCAGGAGT TTGTTTCTCG GGGGCTGCCT
ATGACGGTCT TTGGTGTCGC CATGGCATTG GAGCGTAATC CATTGATTGT CGAAGCGATT
AAGTGTGCGG GTTATGATGT GGTTTGCCAT GGCTGGCGTT GGCTCCATTA TCAACACGTC
GATGAGCAAA CTGAACGTGA GCATATGCAG CGGGCCATCA AGATATTACA TGATTTATTC
GGCCAACCCC CTGCAGGCTG GTATACCGGG CGTGATAGCC CAAATACCCG GCGGCTGGTG
GTGGAGAATG GTCACCTTCT GTACGACAGC GATTATTATG GCGATGATTT GCCCTTTTGG
TCGCAGGTCA GGGGAGTTGA TGGCAGTACC ACCCCACATC TGGTCGTGCC TTATACACTG
GATGCCAATG ATATGCGTTT TGCCTCAGCA CAGGGATTTA ACTCCAGTGA GCAGTTTTAT
ACCTATTTAA AAGACAGCTT CGATGTGTTG TACGCAGAGG GTGAAACTGC ACCTAAAATG
ATGTCAGTGG GGATGCACTG TCGGTTATTA GGGCGCCCTG GACGTTTCCG GGCTTTGCAG
CGCTTTTTGG ATTATATCCA GCAACACGAA AGGGTGTGGG TTTGTCGGCG TCAAGAGATT
GCGGAGCATT GGGTTAAACA TCACCCGTTT GAAGGTATCA ATGGCCGGTA G
 
Protein sequence
MHETELNHDY PRDLAGYGGQ PPVANWPGQA RIAVQFVLNI EEGAENNVLH GDAGSEQFLS 
DIIGADSYPD RHMSMESLYE YGTRAGFWRI HQEFVSRGLP MTVFGVAMAL ERNPLIVEAI
KCAGYDVVCH GWRWLHYQHV DEQTEREHMQ RAIKILHDLF GQPPAGWYTG RDSPNTRRLV
VENGHLLYDS DYYGDDLPFW SQVRGVDGST TPHLVVPYTL DANDMRFASA QGFNSSEQFY
TYLKDSFDVL YAEGETAPKM MSVGMHCRLL GRPGRFRALQ RFLDYIQQHE RVWVCRRQEI
AEHWVKHHPF EGINGR