Gene YpsIP31758_2129 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagYpsIP31758_2129 
SymbolhmsR 
ID5387318 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameYersinia pseudotuberculosis IP 31758 
KingdomBacteria 
Replicon accessionNC_009708 
Strand
Start bp2446295 
End bp2447629 
Gene Length1335 bp 
Protein Length444 aa 
Translation table11 
GC content51% 
IMG OID640865115 
ProductN-glycosyltransferase 
Protein accessionYP_001401102 
Protein GI153947690 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1215] Glycosyltransferases, probably involved in cell wall biogenesis 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones36 
Plasmid unclonability p-value0.772993 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATCGATA GAATTATTGC TTTACTCATC CTATGTCTGG TGCTGAGTAT CCCTGCGGGA 
ATGATTTTGC TCTTTACCGG CGATGTACTG TTGAACTTTG TTTTCTTCTA TCCACTTTTT
ATGTCCGGTA TTTGGATAAC CGGGGGGGTC TATTTCTGGC TACGCCGAGA GAGGCACTGG
CCGTGGGGCG ATGATGTACC CGCACCAGAG CTGAAAGGTC ATCCGCTCGT GTCTATCTTA
GTCCCTTGTT TCAATGAGGG GTTGAATGCA CGGGAAACCA TTCACGCAGC ACTGGCACAG
ACCTACACCA ATATTGAAGT TATTGCTATT AACGATGGCT CCAGTGACGA TACCGCACAA
GTCCTAGATG CGCTGCTTGC TGAAGATCCA CGCTTGCGCG TCATTCATCT GGCTCATAAT
CAGGGTAAAG CGATTGCCCT GCGTATGGGG GCCGCCGCCG CCCGCAGTGA ATATCTGGTC
TGTATCGATG GTGATGCCTT GTTGGACAAG AATGCGGTGC CTTATCTGGT CGCGCCACTG
ATTGCGAATC CGCGTACTGG CGCGGTGACC GGTAACCCAC GTATTCGTAC CCGTTCGACA
TTGATTGGCC GCGTTCAGGT CGGGGAGTTC TCTTCGATCA TTGGTTTGAT TAAGCGGACA
CAGCGAGTCT ACGGCCAAGT GTTTACCGTC TCCGGTGTGG TCGCCGCTTT TCGCCGTAGA
GCACTGGCGG ATGTTGGCTA CTGGAGCCCG GATATGATCA CGGAAGATAT TGATATTAGT
TGGAAACTAC AGCTGAAGCA CTGGTCGGTA TTCTTCGAAC CCCGTGGGTT GTGTTGGATT
TTGATGCCTG AAACCCTGCG CGGCTTGTGG AAACAGCGTC TCCGCTGGGC GCAAGGCGGT
GCGGAAGTAT TTTTGAAAAA TATGTTCAAA CTTTGGCGCT GGCGTAACCG CCGGATGTGG
CTGCTGTTTC TGGAGTATTC GTTGTCGATC ACTTGGGCAT TCACTTACCT GTTTAGCATC
ACATTATACC TATTGGGGCT GGTTATCACT CTGCCGCCGG GTATTCATGT TCAAAGTGTC
TTCCCGCCAG CCTTTACCGG GATGGTATTG GCACTGACAT GTCTATTGCA ATTTGCTATT
AGTCTGGTGA TCGAACGCCG CTATGAACCT AAACTGGGTC ATTCCCTGTT TTGGATTATC
TGGTATCCCA TGGTTTATTG GATGCTTAAC TTGTTTACCA CGGTGGTGTC GTTCCCGAAA
GTGATGCTTA TCACCAAGCG TAAGCGTGCG CGTTGGGTAA GCCCTGATCG CGGCATTGGG
AGGGTGAAAT CATGA
 
Protein sequence
MIDRIIALLI LCLVLSIPAG MILLFTGDVL LNFVFFYPLF MSGIWITGGV YFWLRRERHW 
PWGDDVPAPE LKGHPLVSIL VPCFNEGLNA RETIHAALAQ TYTNIEVIAI NDGSSDDTAQ
VLDALLAEDP RLRVIHLAHN QGKAIALRMG AAAARSEYLV CIDGDALLDK NAVPYLVAPL
IANPRTGAVT GNPRIRTRST LIGRVQVGEF SSIIGLIKRT QRVYGQVFTV SGVVAAFRRR
ALADVGYWSP DMITEDIDIS WKLQLKHWSV FFEPRGLCWI LMPETLRGLW KQRLRWAQGG
AEVFLKNMFK LWRWRNRRMW LLFLEYSLSI TWAFTYLFSI TLYLLGLVIT LPPGIHVQSV
FPPAFTGMVL ALTCLLQFAI SLVIERRYEP KLGHSLFWII WYPMVYWMLN LFTTVVSFPK
VMLITKRKRA RWVSPDRGIG RVKS