Gene YPK_2241 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagYPK_2241 
SymbolhmsH 
ID6088562 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameYersinia pseudotuberculosis YPIII 
KingdomBacteria 
Replicon accessionNC_010465 
Strand
Start bp2486293 
End bp2488767 
Gene Length2475 bp 
Protein Length824 aa 
Translation table11 
GC content48% 
IMG OID641597306 
Productouter membrane protein 
Protein accessionYP_001720975 
Protein GI170024470 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTATAACG CATTTACAAC ACTACTGCGC CCGTTGCACT GGCACAGAAT GACTTTATTA 
GCGCTATTTA TTTCTGGCTT TGTCGTTAAT CCTGCTATGG CGGATACGCA GTATGACTCA
TTAATTATTC GTGCCAGAGC CGGTGATACT GCGCCAGTCT TGGACTATTT ACAGGAAGAA
TCTAAAGCGG GTCCATTGCA GAGTGGGCAG GTTGATGACT GGCTGCAAAT TGCAGGTTGG
GCAGGGCGCG ATCAAGAAGT GATTGATGTT TATGAACGAT ATCAATCTTC CATGAATATC
TCTTCCAGAG GTTTGGCTTC TGCCGCTCGA GCTTACCGCA ACGAAAAGCG TTGGGATCAG
GCTTTGGCCT TGTGGCAAAG CAGCTTAAAG AAAGACCCCA CCAATCCTGA TCTTATCAGC
GGTATGATCA TGACTCAGGC TGATGCTGGA CGCGGTGGGG TTGTTTTAAA ACAGGCAACA
GAGTTAGCCG AGCGTGATCC CACGGTACAA AATTACATGA CGCTGTCTTA TCTAAACCGG
GCTACTGACC GTAATTATGA TGCTTTGCAA GCCTCCAGCG AAGCGGTGCG ATTGGCACCA
ACGTCTGAAG AAGTGCTCAA AAATCATCTC GAAATCTTAC AGAGAAACCG TATTGTTGAG
CCAGCGTTAC GGCTTGCTAA AGAGAACCCA AATTTGGTTT CTGCTGAGCA TTATCGCCAA
CTTGAGCGCG ATGCTGCTGC GGAACAAGTT CGGATGGCGG TGTTGCCAAC GCGCAGCGAA
ACCGAGCGTT TTGACATAGC GGATAAGGCG TTAGCGGATT ATCAGAATTT ATTGACCCGT
TGGGGCAAAG ATCCAGAGGC ACAGGCTGAC TACCAACGCG CACGCATTGA CCGATTAGGG
GCGTTATTGG TCCGTCATCA GACGGCTGAT CTGATCAAAG AGTATGAAGC GATGGAGGCT
GAGGGCTATA AAATGCCTGA TTATGCCCGC CGTTGGGCGG CTTCTGCCTA TATTGACCGA
AGGTTGCCGG AAAAAGCAGC GCCTATCCTC TCCAGCCTCT ATTATTCAGA CGGTAAAACC
TTCCGTAACA GCGACGATTT GCTTGATGCC GATGATCTTT ACTATTCACT GAATGAAAGC
GAACAACTTG ATAAAGCCTA CCAATTTGCG GTCAATTATA GCGAGCAAAC GCCGTATCAG
GTTGGGGTTT ATGGCCTGCC GGGCAAAGAA CCGAATGATG ATTGGATTGA AGGCCAAACG
CTATTGGTTC AATCATTGGT GGCATTGAAT GATCTGCCCA CCGCACAAAA AAAATTGGAA
GATTTATCCA GTACTGCACC CGCTAATCAG AATTTAAGGA TCGCATTAGC CAGTATTTAT
CTGGCCCGGG ATTTACCGCG TAAAGCGGAG CAAGAATTGA AGGCCGTGGA ATCATTAGCA
CCACGCAGCC TTATTCTGGA ACGTGCGCAG GCTGAAACTG CGATGGCGCT ACAAGAATGG
CATCAGATGG AGTTACTGAC TGATGATGTG ATTAGCCGTT CCCCAGAAGA TATCCCCTCT
CAAGAATTAG ATCGCCAACG TAAAGTCCAT AACATGTATG AACTACGGGT GAGTGGTAAC
CGCGTTATCT CCTCCAACAG CCCGGTGAGC GGCAATAAAG ATTATGGGGT CGAAACTCTC
CTTTACAGCC CACCTATTGC AGAGAATTGG CGCGTATTTG GTGGAGGCAG TTATAACAAT
GGGCAGTTTG AAGAGGGGAC GGGGATTAGC CGGATTTTGC GCTTAGGTGG TGAATGGACC
TCCCGTGACC ATTGGGTTGA AGGGGAAATT TCCAATCAGA ATTATGGCAA TGGCAATGGC
AATAAAGTCG GGGCCCGTTT ATCAACCTGG TACGACCTTA ATGACCACTG GCGTGTAGGG
GGGCAGGTTG AACGTTTAGC TAAAGACACA CCACTACGGG CACTGAAGAA TAAAGTGACC
GCGAATAGCG CGTCTGCTTA TGTTTTCTGG AAAGCAGATG ATAAGCGTGA TGCTGAACTT
AGCGTGACGC CGTCGCGTTT TTCTGACGGT AACAACCGTT GGGAATATGA ATTTAACGGG
CGTCAGCGTA TCTGGACTGG GCCTTACCTA ACGGCAGATT TTAATCTGGG GTTGGCAGCC
AGCCAAAACA GTAAAGAAGA TGTGATTTAC TACAACCCGA AACGTGATTT TGCTTACGTT
CCGGCAGTGA CTCTCAATCA CATTATGTAC CGGCGGTACA AAACCATCTG GAGCCAGCAA
GTCCAACTGG GTGTGGGAGG GTACTGGGAG AAAAATTACG GTAATGGCTT GGTGACCACG
GCGGGCTATG GCCAACGTGT TCAATGGAAT GATGTTATCG ATACCGGTGT TGCTGTGGTT
TATGACAAGC GTCCTTACGA TGGTAAACGT GAGCACGATG TTACGCTTTC TTTCGATTTA
AATTATCGTT TTTAA
 
Protein sequence
MYNAFTTLLR PLHWHRMTLL ALFISGFVVN PAMADTQYDS LIIRARAGDT APVLDYLQEE 
SKAGPLQSGQ VDDWLQIAGW AGRDQEVIDV YERYQSSMNI SSRGLASAAR AYRNEKRWDQ
ALALWQSSLK KDPTNPDLIS GMIMTQADAG RGGVVLKQAT ELAERDPTVQ NYMTLSYLNR
ATDRNYDALQ ASSEAVRLAP TSEEVLKNHL EILQRNRIVE PALRLAKENP NLVSAEHYRQ
LERDAAAEQV RMAVLPTRSE TERFDIADKA LADYQNLLTR WGKDPEAQAD YQRARIDRLG
ALLVRHQTAD LIKEYEAMEA EGYKMPDYAR RWAASAYIDR RLPEKAAPIL SSLYYSDGKT
FRNSDDLLDA DDLYYSLNES EQLDKAYQFA VNYSEQTPYQ VGVYGLPGKE PNDDWIEGQT
LLVQSLVALN DLPTAQKKLE DLSSTAPANQ NLRIALASIY LARDLPRKAE QELKAVESLA
PRSLILERAQ AETAMALQEW HQMELLTDDV ISRSPEDIPS QELDRQRKVH NMYELRVSGN
RVISSNSPVS GNKDYGVETL LYSPPIAENW RVFGGGSYNN GQFEEGTGIS RILRLGGEWT
SRDHWVEGEI SNQNYGNGNG NKVGARLSTW YDLNDHWRVG GQVERLAKDT PLRALKNKVT
ANSASAYVFW KADDKRDAEL SVTPSRFSDG NNRWEYEFNG RQRIWTGPYL TADFNLGLAA
SQNSKEDVIY YNPKRDFAYV PAVTLNHIMY RRYKTIWSQQ VQLGVGGYWE KNYGNGLVTT
AGYGQRVQWN DVIDTGVAVV YDKRPYDGKR EHDVTLSFDL NYRF