Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | YpAngola_A2143 |
Symbol | hmsR |
ID | 5800613 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Yersinia pestis Angola |
Kingdom | Bacteria |
Replicon accession | NC_010159 |
Strand | + |
Start bp | 2243144 |
End bp | 2244478 |
Gene Length | 1335 bp |
Protein Length | 444 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 641340051 |
Product | N-glycosyltransferase |
Protein accession | YP_001606596 |
Protein GI | 162420902 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG1215] Glycosyltransferases, probably involved in cell wall biogenesis |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 0.382592 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 27 |
Fosmid unclonability p-value | 0.0135691 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATCGATA GAATTATTGC TTTACTCATC CTATGTCTGG TGCTGAGTAT CCCTGCGGGA ATGATTTTGC TCTTTACCGG CGATGTACTG TTGAACTTTG TTTTCTTCTA TCCACTTTTT ATGTCCGGTA TTTGGATAAC CGGGGGGGTC TATTTCTGGC TACGCCGAGA GAGGCACTGG CCGTGGGGCG ATGATGTACC CGCACCAGAG CTGAAAGGTC ATCCGCTCGT GTCTATCTTA GTCCCTTGTT TCAATGAGGG GTTGAATGCA CGGGAAACCA TTCACGCAGC ACTGGCACAG ACCTACACCA ATATTGAAGT TATTGCTATT AACGATGGCT CCAGTGACGA TACCGCACAA GTCCTAGATG CGCTGCTTGC TGAAGATCCA CGCTTGCGCG TCATTCATCT GGCTCATAAT CAGGGTAAAG CGATTGCCCT GCGTATGGGG GCCGCCGCCG CCCGCAGTGA ATATCTGGTC TGTATCGATG GTGATGCCTT GTTGGACAAG AATGCGGTGC CTTATCTGGT CGCGCCACTG ATTGCGAATC CGCGTACTGG CGCGGTGACC GGTAACCCAC GTATTCGTAC CCGTTCGACA TTGATTGGCC GCGTTCAGGT CGGGGAGTTC TCTTCGATCA TTGGTTTGAT TAAGCGGACA CAGCGAGTCT ACGGCCAAGT GTTTACCGTC TCCGGTGTGG TCGCCGCTTT TCGCCGTAGA GCACTGGCGG ATGTTGGCTA CTGGAGCCCG GATATGATCA CGGAAGATAT TGATATTAGT TGGAAACTAC AGCTGAAGCA CTGGTCGGTA TTCTTCGAAC CCCGTGGGTT GTGTTGGATT TTGATGCCTG AAACCCTGCG CGGCTTGTGG AAACAGCGTC TCCGCTGGGC GCAAGGCGGT GCGGAAGTAT TTTTGAAAAA TATGTTCAAA CTTTGGCGCT GGCGTAACCG CCGGATGTGG CTGCTGTTTC TGGAGTATTC GTTGTCGATC ACTTGGGCAT TCACTTACCT GTTTAGCATC ACATTATACC TATTGGGGCT GGTTATCACT CTGCCGCCGG GTATTCATGT TCAAAGTGTC TTCCCGCCAG CCTTTACCGG GATGGTATTG GCACTGACAT GTCTATTGCA ATTTGCTATT AGTCTGGTGA TCGAACGCCG CTATGAACCT AAACTGGGTC ATTCCCTGTT TTGGATTATC TGGTATCCCA TGGTTTATTG GATGCTTAAC TTGTTTACCA CGGTGGTGTC GTTCCCGAAA GTGATGCTTA TCACCAAGCG TAAGCGTGCG CGTTGGGTAA GCCCTGATCG CGGCATTGGG AGGGTGAAAT CATGA
|
Protein sequence | MIDRIIALLI LCLVLSIPAG MILLFTGDVL LNFVFFYPLF MSGIWITGGV YFWLRRERHW PWGDDVPAPE LKGHPLVSIL VPCFNEGLNA RETIHAALAQ TYTNIEVIAI NDGSSDDTAQ VLDALLAEDP RLRVIHLAHN QGKAIALRMG AAAARSEYLV CIDGDALLDK NAVPYLVAPL IANPRTGAVT GNPRIRTRST LIGRVQVGEF SSIIGLIKRT QRVYGQVFTV SGVVAAFRRR ALADVGYWSP DMITEDIDIS WKLQLKHWSV FFEPRGLCWI LMPETLRGLW KQRLRWAQGG AEVFLKNMFK LWRWRNRRMW LLFLEYSLSI TWAFTYLFSI TLYLLGLVIT LPPGIHVQSV FPPAFTGMVL ALTCLLQFAI SLVIERRYEP KLGHSLFWII WYPMVYWMLN LFTTVVSFPK VMLITKRKRA RWVSPDRGIG RVKS
|
| |