Gene YpAngola_A2142 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagYpAngola_A2142 
SymbolhmsF 
ID5800612 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameYersinia pestis Angola 
KingdomBacteria 
Replicon accessionNC_010159 
Strand
Start bp2241133 
End bp2243154 
Gene Length2022 bp 
Protein Length673 aa 
Translation table11 
GC content47% 
IMG OID641340050 
Productouter membrane N-deacetylase 
Protein accessionYP_001606595 
Protein GI162420113 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0726] Predicted xylanase/chitin deacetylase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones29 
Fosmid unclonability p-value0.0474034 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGAAGC TATTAATTTT TATCAAATCG CTCATTGTTG GGATGATGAT CGTGTCCACT 
ATGGGGTGTG CGGAGAAGCC AACGTTCGTT CCGCCTGCAC AACGTGCATT GCCGCAAAGT
GAAAGACCGT GGCAAAAGAA TACATTTGTT GTGATTGCTT ATCATGACGT TGAAGACGAT
TCAGCCGATC AACGCTATCT GTCGGTAAGA AGCAGCGCAT TAAATGAGCA GTTCGTGTGG
TTGCGTGATA ACGGTTACCA CGTGGTTTCT GTTGATCAAA TTTTGGCAGC CCGTAATGGT
GGCCCTACAT TGCCGGATAA GGCGGTGCTC CTCACCTTTG ACGATGGTTA CAGCAGTTTT
TATCGGCGCG TTTATCCCTT ACTGAAAGCA TACAAATGGA GTGCCGTATT AGCGCCAGTG
GGCACTTGGA TTGATACTGC CACCGATAAA AAAGTGGATT TTGGTGGCTT GAGCACTGAT
CGGGATCGTT TTGCCACATG GAAGCAGATT ACTGAAATGT CTAAATCAGG GTTGGTTGAA
ATCGGGGCGC ATACTTACGC TTCTCACTAT GGTGTGATTG CTAACCCACA GGGGAATACC
GAACCGGCGG CGGCCAATCT GCAATATGAT CCCAAAACAA AACAGTATGA AACTGTTGAA
GCCTTTAAGC AGCGAATGGA GAAAGACGTT GCGTTGATAA CTCAGCGTAT TGTCCAGGCA
ACAGGGAAAC AACCACGGGT TTGGGTTTGG CCATACGGTG CGCCGAATGG TACGGTGCTA
AATATTTTAC GCCAGCATGG TTATCAACTC GCCATGACCC TAGATCCTGG TGTGGCTAAT
ATTAATGACT TGATGAATAT ACCGCGCATT TTAATCAGTA ATAATCCGTC ACTGAAGGAC
TTTGCCCTCA CGGTTACCAG CGTGCAGGAA AAAAATATCA TGCGGGTGGC GCATGTTGAT
TTGGATTATC TCTACGATCC AGACCCGGCT CAGGAAAAAG AGAATCTTGA TAAGTTGGTT
CAGCGAATTT CTGACCTACG TGTCACTCAC GTTTTCCTGC AAGCATTTTC TGATCCTAAG
GGGGATGGCA ACATTCGCCA GGTTTACTTC CCGAACCGTT GGATCCCGAT GCGTCAGGAT
TTGTTTAACC GGGTGGTATG GCAATTGGCT TCACGGCCTG ATGTTGAAGT CTATGCCTGG
ATGCCGGTAT TGGCGTTTGA TATGGACCCG TCTTTACCTC GGATCACGCG TATTGACCCT
AAAACCGGTA AAACGAGTAT CGACCCAGAT CAATATCGCC GTTTATCACC TTTTAACCCT
GAAGTAAGAC AGCGCATTAT TGATATCTAT CGTGATATGG CCTATAGCGC GCCGATTGAC
GGAATTATCT ATCACGATGA TGCGGTGATG TCTGATTTTG AAGATGCTTC CCCCGATGCT
ATCCGGGCCT ATGAAAAAGC GGGCTTCCCT GGTTCGATTA CCACGATACG CCAAGATCCA
GAGATGATGC AGCGGTGGAC TCGTTACAAG AGTAAGTATC TGATTGATTT TACTAATGAG
TTGACCCGCG AAGTCCGTGA TATCCGTGGC CCACAGGTAA AATCGGCGCG TAATATTTTC
GCTATGCCTA TTTTAGAACC AGAAAGCGAA GCATGGTTTG CACAGAATCT TGATGATTTC
CTCGCTAACT ATGATTGGGT CGCACCAATG GCCATGCCGT TAATGGAGAA AGTGCCGCTA
TCTGAATCTA ATGAATGGTT AGCGGAGCTA GTCAATAAAG TTGCGCAACG CCCCGGTGCT
TTGGAGAAAA CCGTGTTTGA ACTGCAATCT AAGGACTGGA CCCAACCTGA GGGCAATAAC
GCCATCAGCG GCCCAATACT GGCGGGATGG ATGCGCCAGT TGCAGTTAAG TGGTGCGCAA
AGTTTTGGTT ACTACCCGGA TAACTTTATT ACTGGCGAAC CACCGCTAAA AGATGTCCGC
CCTGTGCTGT CTTCTGCCTG GTACCCTTTA TATGATCGAT AG
 
Protein sequence
MAKLLIFIKS LIVGMMIVST MGCAEKPTFV PPAQRALPQS ERPWQKNTFV VIAYHDVEDD 
SADQRYLSVR SSALNEQFVW LRDNGYHVVS VDQILAARNG GPTLPDKAVL LTFDDGYSSF
YRRVYPLLKA YKWSAVLAPV GTWIDTATDK KVDFGGLSTD RDRFATWKQI TEMSKSGLVE
IGAHTYASHY GVIANPQGNT EPAAANLQYD PKTKQYETVE AFKQRMEKDV ALITQRIVQA
TGKQPRVWVW PYGAPNGTVL NILRQHGYQL AMTLDPGVAN INDLMNIPRI LISNNPSLKD
FALTVTSVQE KNIMRVAHVD LDYLYDPDPA QEKENLDKLV QRISDLRVTH VFLQAFSDPK
GDGNIRQVYF PNRWIPMRQD LFNRVVWQLA SRPDVEVYAW MPVLAFDMDP SLPRITRIDP
KTGKTSIDPD QYRRLSPFNP EVRQRIIDIY RDMAYSAPID GIIYHDDAVM SDFEDASPDA
IRAYEKAGFP GSITTIRQDP EMMQRWTRYK SKYLIDFTNE LTREVRDIRG PQVKSARNIF
AMPILEPESE AWFAQNLDDF LANYDWVAPM AMPLMEKVPL SESNEWLAEL VNKVAQRPGA
LEKTVFELQS KDWTQPEGNN AISGPILAGW MRQLQLSGAQ SFGYYPDNFI TGEPPLKDVR
PVLSSAWYPL YDR