Gene YpAngola_A2141 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagYpAngola_A2141 
SymbolhmsH 
ID5800611 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameYersinia pestis Angola 
KingdomBacteria 
Replicon accessionNC_010159 
Strand
Start bp2238652 
End bp2241120 
Gene Length2469 bp 
Protein Length822 aa 
Translation table11 
GC content48% 
IMG OID641340049 
Productouter membrane protein 
Protein accessionYP_001606594 
Protein GI162421638 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones31 
Fosmid unclonability p-value0.115838 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTATAACG CATTTACAAC ACTACTGCGC CCGTTGCACT GGCACAGAAT GACTTTATTA 
GCGCTATTTA TTTCTGGCTT TGTCGTTAAT CCTGCTATGG CGGATACGCA GTATGACTCA
TTAATTATTC GTGCCAGAGC CGGTGATACT GCGCCAGTCT TGGACTATTT ACAGGAAGAA
TCTAAAGCGG GTCCATTGCA GAGTGGGCAG GTTGATGACT GGCTGCAAAT TGCAGGTTGG
GCAGGGCGCG ATCAAGAAGT GATTGATGTT TATGAACGAT ATCAATCTTC CATGAATATC
TCTTCCAGAG GTTTGGCTTC TGCCGCTCGA GCTTACCGCA ACGAAAAGCG CTGGGATCAG
GCTTTGGCCT TGTGGCAAAG CAGCTTAAAG AAAGACCCCA CCAATCCTGA TCTTATCAGC
GGTATGATCA TGACTCAGGC TGATGCTGGA CGCGGTGGGG TTGTTTTAAA ACAGGCAACA
GAGTTAGCCG AGCGTGATCC CACGGTACAA AATTACATGA CGCTGTCTTA TTTAAACCGG
GCTACTGACC GTAATTATGA TGCTTTGCAA GCCTCCAGCG AAGCGGTGCG ATTGGCACCA
ACATCTGAAG AAGTGCTCAA AAATCATCTC GAAATATTAC AGAGAAACCG TATTGTTGAG
CCAGCGTTAC GGCTTGCTAA AGAGAACCCA AATTTGGTTT CTGCTGAGCA TTATCGCCAA
CTTGAGCGCG ATGCTGCTGC GGAACAAGTT CGGATGGCGG TGTTGCCAAC GCGCAGCGAA
ACCGAGCGTT TTGACATAGC GGATAAGGCG TTAGCGGATT ATCAGAATTT ATTGACCCGT
TGGGGCAAAG ATCCAGAGGC ACAGGCTGAC TACCAACGCG CACGCATTGA CCGATTAGGG
GCGTTATTGG TCCGTCATCA GACGGCTGAT CTGATCAAAG AGTATGAAGC GATGGAGGCT
GAGGGCTATA AAATGCCTGA TTATGCCCGC CGTTGGGCGG CTTCTGCCTA TATTGACCGA
AGGTTGCCGG AAAAAGCAGC GCCTATCCTC TCCAGCCTCT ATTATTCAGA CGGTAAAACC
TTCCGTAACA GCGACGATTT GCTTGATGCC GATGATCTTT ACTATTCACT GAATGAAAGC
GAACAACTTG ATAAAGCCTA CCAATTTGCG GTCAATTATA GCGAGCAAAC GCCGTATCAG
GTTGGGGTTT ATGGCCTGCC GGGCAAAGAA CCGAATGATG ATTGGATTGA AGGCCAAACG
CTATTGGTTC AATCATTGGT GGCATTGAAT GATCTGCCCA CCGCACAAAA AAAATTGGAA
GATTTATCCA GTACTGCACC CGCTAATCAG AATTTAAGGA TCGCATTAGC CAGTATTTAT
CTGGCCCGGG ATTTACCGCG TAAAGCGGAG CAAGAATTGA AGGCCGTGGA ATCATTAGCA
CCACGCAGCC TTATTCTGGA ACGTGCGCAG GCTGAAACTG CGATGGCGCT ACAAGAATGG
CATCAGATGG AGTTACTGAC TGATGATGTG ATTAGCCGTT CCCCAGAAGA TATCCCCTCT
CAAGAATTAG ATCGCCAACG TAAAGTCCAT AACATGTATG AACTACGGGT GAGTGGTAAC
CGCGTTATCT CCTCCAACAG CCCGGTGAGC GGCAATAAAG ATTATGGGGT CGAAACTCTC
CTTTACAGCC CACCTATTGC AGAGAATTGG CGCGTATTTG GTGGCGGCAG TTATAACAAT
GGGCAGTTTG AAGAGGGGAC GGGGATTAGC CGGATTCTGC GCTTAGGTGG TGAATGGACC
TCCCGTGACC ATTGGGTTGA AGGGGAAATT TCCAATCAGA ATTATGGCAA TGGCAATAAA
GTCGGGGCCC GTTTATCAAC CTGGTACGAC CTTAATGACC ACTGGCGTGT AGGGGGGCAG
GTTGAACGTT TAGCTAAAGA CACACCACTA CGGGCACTGA AGAATAAAGT GACCGCGAAT
AGCGCGTCTG CTTATGTTTT CTGGAAAGCA GATGATAAGC GTGATGCTGA ACTTAGCGTG
ACGCCGTCGC GTTTTTCTGA CGGTAACAAC CGTTGGGAAT ATGAATTTAA CGGGCGTCAG
CGTATCTGGA CTGGGCCTTA CCTAACGGCA GATTTTAATC TGGGGTTGGC AGCCAGCCAA
AACAGTAAAG AAGATGTGAT TTACTACAAC CCGAAACGTG ATTTTGCTTA CGTTCCGGCA
GTGACTCTCA ATCACATTAT GTACCGGCGG TACAAAACCA TCTGGAGCCA GCAAGTCCAA
CTGGGTGTGG GAGGGTACTG GGAGAAAAAT TACGGTAATG GCTTGGTGAC CACGGCGGGC
TATGGCCAAC GTGTTCAATG GAATGATGTT ATCGATACCG GTGTTGCTGT GGTTTATGAC
AAGCGTCCTT ACGATGGTAA ACGTGAGCAC GATGTTACGC TTTCTTTCGA TTTAAATTAT
CGTTTTTAA
 
Protein sequence
MYNAFTTLLR PLHWHRMTLL ALFISGFVVN PAMADTQYDS LIIRARAGDT APVLDYLQEE 
SKAGPLQSGQ VDDWLQIAGW AGRDQEVIDV YERYQSSMNI SSRGLASAAR AYRNEKRWDQ
ALALWQSSLK KDPTNPDLIS GMIMTQADAG RGGVVLKQAT ELAERDPTVQ NYMTLSYLNR
ATDRNYDALQ ASSEAVRLAP TSEEVLKNHL EILQRNRIVE PALRLAKENP NLVSAEHYRQ
LERDAAAEQV RMAVLPTRSE TERFDIADKA LADYQNLLTR WGKDPEAQAD YQRARIDRLG
ALLVRHQTAD LIKEYEAMEA EGYKMPDYAR RWAASAYIDR RLPEKAAPIL SSLYYSDGKT
FRNSDDLLDA DDLYYSLNES EQLDKAYQFA VNYSEQTPYQ VGVYGLPGKE PNDDWIEGQT
LLVQSLVALN DLPTAQKKLE DLSSTAPANQ NLRIALASIY LARDLPRKAE QELKAVESLA
PRSLILERAQ AETAMALQEW HQMELLTDDV ISRSPEDIPS QELDRQRKVH NMYELRVSGN
RVISSNSPVS GNKDYGVETL LYSPPIAENW RVFGGGSYNN GQFEEGTGIS RILRLGGEWT
SRDHWVEGEI SNQNYGNGNK VGARLSTWYD LNDHWRVGGQ VERLAKDTPL RALKNKVTAN
SASAYVFWKA DDKRDAELSV TPSRFSDGNN RWEYEFNGRQ RIWTGPYLTA DFNLGLAASQ
NSKEDVIYYN PKRDFAYVPA VTLNHIMYRR YKTIWSQQVQ LGVGGYWEKN YGNGLVTTAG
YGQRVQWNDV IDTGVAVVYD KRPYDGKREH DVTLSFDLNY RF