Gene YpsIP31758_2130 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagYpsIP31758_2130 
SymbolhmsF 
ID5386159 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameYersinia pseudotuberculosis IP 31758 
KingdomBacteria 
Replicon accessionNC_009708 
Strand
Start bp2447619 
End bp2449640 
Gene Length2022 bp 
Protein Length673 aa 
Translation table11 
GC content48% 
IMG OID640865116 
Productouter membrane N-deacetylase 
Protein accessionYP_001401103 
Protein GI153950323 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0726] Predicted xylanase/chitin deacetylase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones41 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCGAAGC TATTAATTTT TATCAAATCG CTCATTGTTG GGATGATGAT CGTGTCCACT 
ATGGGGTGTG CGGAGAAGCC AACGTTCGTT CCGCCTGCAC AACGTGCATT GCCGCAAAGT
GAAAGACCGT GGCAAAAGAA TACATTTGTT GTGATTGCTT ATCATGACGT TGAAGACGAT
TCAGCCGATC AACGCTATCT GTCGGTAAGA AGCAGCGCAT TAAATGAGCA GTTCGTGTGG
TTGCGTGATA ACGGTTACCA CGTGGTTTCT GTTGATCAAA TTTTGGCAGC CCGTAATGGT
GGCCCTACAT TGCCGGATAA GGCGGTGCTC CTCACCTTTG ACGATGGTTA CAGCAGTTTT
TATCGGCGCG TTTATCCCTT ACTGAAAGCA TACAAATGGA GTGCCGTATT AGCGCCAGTG
GGCACTTGGA TTGATACCGC CACCGATAAA AAAGTGGATT TCGGTGGCTT GAGCACTGAT
CGGGATCGTT TTGCCACCTG GAAGCAGATT ACTGAAATGT CTAAATCAGG GTTGGTTGAA
ATCGGGGCGC ATACTTACGC TTCTCACTAT GGTGTGATTG CTAACCCACA GGGGAATACC
GAACCGGCGG CGGCCAATCT GCAATATGAT CCCAAAACAA AACAGTATGA AACTGTTGAA
GCCTTTAAGC AGCGAATGGA GAAAGACGTT GCGTTGATAA CTCAGCGTAT TGTCCAGGCA
ACAGGGAAAC AACCACGGGT TTGGGTTTGG CCATACGGTG CGCCGAATGG TACGGTGCTA
AATATTTTAC GCCAGCATGG TTATCAACTC GCCATGACCC TAGATCCTGG TGTGGCTAAT
ATTAATGACT TGATGAATAT ACCGCGCATT TTAATCAGTA ATAATCCGTC ACTGAAGGAC
TTTGCCCTCA CGGTTACCAG CGTGCAGGAA AAAAATATCA TGCGGGTGGC GCATGTTGAT
TTGGATTATC TCTACGATCC AGACCCGGCT CAGGAAAAAG AGAATCTTGA TAAGTTGGTT
CAGCGAATTT CTGACCTACG TGTCACTCAC GTTTTCCTGC AAGCATTTTC TGATCCGAAG
GGGGATGGCA ACATTCGCCA GGTTTACTTC CCGAACCGTT GGATCCCGAT GCGTCAGGAT
TTGTTTAACC GGGTGGTATG GCAATTGGCT TCACGGCCTG ATGTTGAAGT CTATGCCTGG
ATGCCGGTAT TGGCGTTTGA TATGGACCCG TCTTTACCTC GGATCACGCG TATTGACCCT
AAAACCGGTA AAACGAGTAT CGACCCAGAT CAATATCGCC GTTTGTCACC TTTTAACCCT
GAAGTAAGAC AGCGCATTAT TGATATCTAT CGTGATATGG CCTATAGCGC GCCGATTGAC
GGAATTATCT ATCACGATGA TGCGGTGATG TCTGATTTTG AAGATGCTTC CCCCGATGCT
ATCCGGGCCT ATGAAAAAGC GGGCTTCCCT GGTTCGATTA CCACGATACG CCAAGATCCA
GAGATGATGC AGCGGTGGAC TCGTTACAAG AGTAAGTATC TGATTGATTT TACTAATGAG
TTGACCCGCG AAGTCCGTGA TATCCGTGGC CCACAGGTAA AATCGGCGCG TAATATTTTC
GCTATGCCTA TTTTAGAACC AGAAAGCGAA GCATGGTTTG CACAGAATCT TGATGATTTC
CTCGCTAACT ATGATTGGGT CGCACCAATG GCCATGCCGT TAATGGAGAA AGTGCCGCTA
TCTGAATCTA ATGAATGGTT AGCGGAGCTA GTCAATAAAG TTGCGCAACG CCCCGGTGCT
TTGGAGAAAA CCGTGTTTGA ACTGCAATCT AAGGACTGGA CCCAACCTGA GGGCAATAAC
GCCATCAGCG GCCCAATACT GGCGGGATGG ATGCGCCAGT TGCAGTTAAG TGGTGCGCAA
AGTTTTGGTT ACTACCCGGA TAACTTTATT ACTGGCGAAC CACCGCTAAA AGATGTCCGC
CCTGTGCTGT CTTCTGCCTG GTACCCTTTA TATGATCGAT AG
 
Protein sequence
MAKLLIFIKS LIVGMMIVST MGCAEKPTFV PPAQRALPQS ERPWQKNTFV VIAYHDVEDD 
SADQRYLSVR SSALNEQFVW LRDNGYHVVS VDQILAARNG GPTLPDKAVL LTFDDGYSSF
YRRVYPLLKA YKWSAVLAPV GTWIDTATDK KVDFGGLSTD RDRFATWKQI TEMSKSGLVE
IGAHTYASHY GVIANPQGNT EPAAANLQYD PKTKQYETVE AFKQRMEKDV ALITQRIVQA
TGKQPRVWVW PYGAPNGTVL NILRQHGYQL AMTLDPGVAN INDLMNIPRI LISNNPSLKD
FALTVTSVQE KNIMRVAHVD LDYLYDPDPA QEKENLDKLV QRISDLRVTH VFLQAFSDPK
GDGNIRQVYF PNRWIPMRQD LFNRVVWQLA SRPDVEVYAW MPVLAFDMDP SLPRITRIDP
KTGKTSIDPD QYRRLSPFNP EVRQRIIDIY RDMAYSAPID GIIYHDDAVM SDFEDASPDA
IRAYEKAGFP GSITTIRQDP EMMQRWTRYK SKYLIDFTNE LTREVRDIRG PQVKSARNIF
AMPILEPESE AWFAQNLDDF LANYDWVAPM AMPLMEKVPL SESNEWLAEL VNKVAQRPGA
LEKTVFELQS KDWTQPEGNN AISGPILAGW MRQLQLSGAQ SFGYYPDNFI TGEPPLKDVR
PVLSSAWYPL YDR