Gene YpAngola_A3120 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagYpAngola_A3120 
SymbolpurM 
ID5801594 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameYersinia pestis Angola 
KingdomBacteria 
Replicon accessionNC_010159 
Strand
Start bp3307987 
End bp3309030 
Gene Length1044 bp 
Protein Length347 aa 
Translation table11 
GC content52% 
IMG OID641340954 
Productphosphoribosylaminoimidazole synthetase 
Protein accessionYP_001607482 
Protein GI162419605 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0150] Phosphoribosylaminoimidazole (AIR) synthetase 
TIGRFAM ID[TIGR00878] phosphoribosylaminoimidazole synthetase 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones58 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACCAACA AAACCTCTCT CAGTTATAAA GACGCAGGTG TAGATATTGA TGCCGGCAAT 
GACCTTGTTG ATCGCATAAA AGGTGTGGTT AAACAAACCC GTCGACCAGA AGTCATGGGC
GGATTAGGAG GGTTCGGTGC CCTGTGCGCG TTGCCGCAAA AATACCGTGA ACCTATTCTG
GTTTCAGGCA CGGATGGCGT CGGCACCAAG CTGCGTCTGG CGATGGACCT GAAACGTCAC
GATACTATCG GTATCGATTT AGTCGCGATG TGTGTCAACG ATCTGGTGGT TCAGGGCGCA
GAGCCGCTGT TCTTCCTCGA CTACTTTGCG ACCGGTAAAC TGGATGTGGA TACTGCCGCC
AGTGTGATTA CCGGGATTGC CGAAGGCTGT AAACAATCAG GTTGTGCGTT GGTTGGCGGT
GAAACCGCAG AAATGCCGGG CATGTACCAC GGCGATGATT ATGACGTTGC TGGCTTCTGT
GTGGGTGTCG TAGAGAAATC TGAAATCATT GATGGCAGTA AAGTTACACC AGGTGATGTC
TTGGTCGCCT TAGGTGCTAG CGGCCCACAC TCCAATGGTT ATTCATTGGT GCGCAAAATT
CTGGACGTCA GCAACACCAA TCCAGAACAG ACCTCGTTGG AAGGCAAATC TCTGGCCGAT
CATTTGCTAG AACCGACCAA AATCTATGTG AAATCCATTC TCAGCCTGAT TGAACAGTTA
GATATCCACG CCATTGCGCA TCTGACCGGT GGTGGCTTCT GGGAAAATAT CCCGCGCGTG
CTACCGCAAG GCATGCAAGC CGTTATCGAC GAAGCCAGTT GGCAGTGGCC AGCGGTATTC
AGTTGGCTGC AACAAGCTGG CAATGTCAGC CGCCATGAGA TGTACCGCAC CTTTAACTGT
GGCGTCGGTA TGGTTGTTGC CTTGCCTGCA GAACTGGCAG ATAAAGCGGT TGAGTTGCTG
ACAGCTTCTG GCGAAAAAGC CTGGAAAATC GGTGTCATTG CCGCGGCAAC TGAGGGTGCT
GAGCAAGTCA TCATTAATCC GTAA
 
Protein sequence
MTNKTSLSYK DAGVDIDAGN DLVDRIKGVV KQTRRPEVMG GLGGFGALCA LPQKYREPIL 
VSGTDGVGTK LRLAMDLKRH DTIGIDLVAM CVNDLVVQGA EPLFFLDYFA TGKLDVDTAA
SVITGIAEGC KQSGCALVGG ETAEMPGMYH GDDYDVAGFC VGVVEKSEII DGSKVTPGDV
LVALGASGPH SNGYSLVRKI LDVSNTNPEQ TSLEGKSLAD HLLEPTKIYV KSILSLIEQL
DIHAIAHLTG GGFWENIPRV LPQGMQAVID EASWQWPAVF SWLQQAGNVS RHEMYRTFNC
GVGMVVALPA ELADKAVELL TASGEKAWKI GVIAAATEGA EQVIINP