Gene YpsIP31758_2131 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagYpsIP31758_2131 
SymbolhmsH 
ID5387125 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameYersinia pseudotuberculosis IP 31758 
KingdomBacteria 
Replicon accessionNC_009708 
Strand
Start bp2449653 
End bp2452121 
Gene Length2469 bp 
Protein Length822 aa 
Translation table11 
GC content48% 
IMG OID640865117 
Productouter membrane protein 
Protein accessionYP_001401104 
Protein GI153950032 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones47 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTATAACG CATTTACAAC ACTACTGCGC CCGTTGCACT GGCACAGAAT GACTTTATTA 
GCGCTATTTA TTTCTGGCTT TGTCGTTAAT CCTGCTATGG CGGATACGCA GTATGACTCA
TTAATTATTC GTGCCAGAGC CGGTGATACT GCGCCAGTCT TGGACTATTT ACAGGAAGAA
TCTAAAGCGG GTCCATTGCA GAGTGGGCAG GTTGATGACT GGCTGCAAAT TGCAGGTTGG
GCAGGGCGCG ATCAAGAAGT GATTGATGTT TATGAACGAT ATCAATCTTC CATGAATATC
TCTTCCAGAG GTTTGGCTTC TGCCGCTCGA GCTTACCGCA ACGAAAAGCG CTGGGATCAG
GCTTTGGCCT TGTGGCAAAG CAGCTTAAAG AAAGACCCCA CCAATCCTGA TCTTATCAGC
GGTATGATCA TGACTCAGGC TGATGCTGGA CGCGGTGGGG TTGTTTTAAA ACAGGCAACA
GAGTTAGCCG AGCGTGATCC CACGGTACAA AATTACATGA CGCTGTCTTA TTTAAACCGG
GCTACTGACC GTAATTATGA TGCTTTGCAA GCCTCCAGCG AAGCGGTGCG ATTGGCACCA
ACATCTGAAG AAGTGCTCAA AAATCATCTC GAAATATTAC AGAGAAACCG TATTGTTGAG
CCAGCGTTAC GGCTTGCTAA AGAGAACCCA AATTTGGTTT CTGCTGAGCA TTATCGCCAA
CTTGAGCGCG ATGCTGCTGC GGAACAAGTT CGGATGGCGG TGTTGCCAAC GCGCAGCGAA
ACCGAGCGTT TTGACATAGC GGATAAGGCG TTAGCGGATT ATCAGAATTT ATTGACCCGT
TGGGGCAAAG ATCCAGAGGC ACAGGCTGAC TACCAACGCG CACGCATTGA CCGATTAGGG
GCGTTATTGG TCCGTCATCA GACGGCTGAT CTGATCAAAG AGTATGAAGC GATGGAGGCT
GAGGGCTATA AAATGCCTGA TTATGCCCGC CGTTGGGCGG CTTCTGCCTA TATTGACCGA
AGGTTGCCGG AAAAAGCAGC GCCTATCCTC TCCAGCCTCT ATTATTCAGA CGGTAAAACC
TTCCGTAACA GCGACGATTT GCTTGATGCC GATGATCTTT ACTATTCACT GAATGAAAGC
GAACAACTTG ATAAAGCCTA CCAATTTGCG GTCAATTATA GCGAGCAAAC GCCGTATCAG
GTTGGGGTTT ATGGCCTGCC GGGCAAAGAA CCGAATGATG ATTGGATTGA AGGCCAAACG
CTATTGGTTC AATCATTGGT GGCATTGAAT GATCTGCCCA CCGCACAAAA AAAATTGGAA
GATTTATCCA GTACTGCACC CGCTAATCAG AATTTAAGGA TCGCATTAGC CAGTATTTAT
CTGGCCCGGG ATTTACCGCG TAAAGCGGAG CAAGAATTGA AGGCCGTGGA ATCATTAGCA
CCACGCAGCC TTATTCTGGA ACGTGCGCAG GCTGAAACTG CGATGGCGCT ACAAGAATGG
CATCAGATGG CGTTACTGAC TGATGATGTG ATTAGCCGTT CCCCAGAAGA TATCCCCTCT
CAAGAATTAG ATCGCCAACG TAAAGTCCAT AACATGTATG AACTACGGGT GAGTGGTAAC
CGCGTTATCT CCTCCAACAG CCCGGTGAGC GGCAATAAAG ATTATGGGGT CGAAACTCTC
CTTTACAGCC CACCTATTGC AGAGAATTGG CGCGTATTTG GTGGCGGCAG TTATAACAAT
GGGCAGTTTG AAGAGGGGAC GGGGATTAGC CGGATTTTGC GCTTAGGTGG TGAATGGACC
TCCCGTGACC ATTGGGTTGA AGGGGAAATT TCCAATCAGA ATTATGGCAA TGGCAATAAA
GTCGGGGCCC GTTTATCAAC CTGGTACGAC CTTAATGACC ACTGGCGTGT AGGGGGGCAG
GTTGAACGTT TAGCTAAAGA CACACCACTA CGGGCACTGA AGAATAAAGT GACAGCGAAT
AGCGCGTCTG CTTATGTTTT CTGGAAAGCA GATGATAAGC GTGATGCTGA ACTTAGCGTG
ACGCCGTCGC GTTTTTCTGA CGGTAACAAC CGTTGGGAAT ATGAATTTAA CGGGCGTCAG
CGTATCTGGA CTGGGCCTTA CCTAACGGCA GATTTTAATC TGGGGTTGGC AGCCAGCCAA
AACAGTAAAG AAGATGTGAT TTACTACAAC CCGAAACGTG ATTTTGCTTA CGTTCCGGCA
GTGACTCTCA ATCACATTAT GTACCGGCGG TACAAAACCA TCTGGAGCCA GCAAGTCCAA
CTGGGTGTGG GAGGGTACTG GGAGAAAAAT TACGGTAATG GCTTGGTGAC CACGGCGGGC
TATGGCCAAC GTGTTCAATG GAATGATGTT ATCGATACCG GTGTTGCTGT GGTTTATGAC
AAGCGTCCTT ACGATGGTAA ACGTGAGCAC GATGTTACGC TTTCTTTCGA TTTAAATTAT
CGTTTTTAA
 
Protein sequence
MYNAFTTLLR PLHWHRMTLL ALFISGFVVN PAMADTQYDS LIIRARAGDT APVLDYLQEE 
SKAGPLQSGQ VDDWLQIAGW AGRDQEVIDV YERYQSSMNI SSRGLASAAR AYRNEKRWDQ
ALALWQSSLK KDPTNPDLIS GMIMTQADAG RGGVVLKQAT ELAERDPTVQ NYMTLSYLNR
ATDRNYDALQ ASSEAVRLAP TSEEVLKNHL EILQRNRIVE PALRLAKENP NLVSAEHYRQ
LERDAAAEQV RMAVLPTRSE TERFDIADKA LADYQNLLTR WGKDPEAQAD YQRARIDRLG
ALLVRHQTAD LIKEYEAMEA EGYKMPDYAR RWAASAYIDR RLPEKAAPIL SSLYYSDGKT
FRNSDDLLDA DDLYYSLNES EQLDKAYQFA VNYSEQTPYQ VGVYGLPGKE PNDDWIEGQT
LLVQSLVALN DLPTAQKKLE DLSSTAPANQ NLRIALASIY LARDLPRKAE QELKAVESLA
PRSLILERAQ AETAMALQEW HQMALLTDDV ISRSPEDIPS QELDRQRKVH NMYELRVSGN
RVISSNSPVS GNKDYGVETL LYSPPIAENW RVFGGGSYNN GQFEEGTGIS RILRLGGEWT
SRDHWVEGEI SNQNYGNGNK VGARLSTWYD LNDHWRVGGQ VERLAKDTPL RALKNKVTAN
SASAYVFWKA DDKRDAELSV TPSRFSDGNN RWEYEFNGRQ RIWTGPYLTA DFNLGLAASQ
NSKEDVIYYN PKRDFAYVPA VTLNHIMYRR YKTIWSQQVQ LGVGGYWEKN YGNGLVTTAG
YGQRVQWNDV IDTGVAVVYD KRPYDGKREH DVTLSFDLNY RF