Gene HMPREF0424_1033 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHMPREF0424_1033 
SymbolpurM 
ID8709867 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGardnerella vaginalis 409-05 
KingdomBacteria 
Replicon accessionNC_013721 
Strand
Start bp1172277 
End bp1173311 
Gene Length1035 bp 
Protein Length344 aa 
Translation table11 
GC content44% 
IMG OID646483126 
Productphosphoribosylformylglycinamidine cyclo-ligase 
Protein accessionYP_003374238 
Protein GI283783484 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0150] Phosphoribosylaminoimidazole (AIR) synthetase 
TIGRFAM ID[TIGR00878] phosphoribosylaminoimidazole synthetase 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones27 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCACATG CATATGAAGA AGCTGGCGTA AGCGTAGAAG CAGGATACGA AGTAGTACGT 
CGTATTAAAT CTCATGTAAA TCGCACAAAG CGCCCAGGCG TTGTAGGTGG CATTGGCGGA
TTTGGCGGCT TATTTGATTT GGCGTCTCTT GGTTACAAAG AGCCAGTGCT GATTTCTGGC
ACGGATGGCG TTGGAACCAA GCTTGTGATT GCAAAAATGA TGAATAAGCA TAACACTATT
GGCATTGATT GCGTTGCAAT GTGCGTAAAC GATATTGCAG CTCAAGGTGC CGAGCCGCTT
TTCTTCCTCG ATTATATTGC ATGCGGAAAA AACAATCCTG AAATACTTGA GCAAGTAGTT
TCAGGCGTTG CAGACGGTTG CGTGCAATCA GAAGCAGGTT TAATTGGCGG CGAAACTGCT
GAAATGCCTG GAATGTATGA CGAAGACGAG TACGATCTTG CAGGTTTTGC AGTAGGCGTT
GCAGAACGTT CTAATATTGT TGATGGATCC ACTATTACTG CCGGCGACGT GCTTATCGGA
CTTCCTTCTT CAGGAGTTCA TTCAAACGGA TTCTCTCTTG TTCGCAAAGC TTTGTTTGAA
GAAGCAGGTT TTAGCGTTGA CACCAAGCTA GATGAGCTTA ATGGCAAAAC ACTTGGCGAA
GTTCTTCTTG AACCCACTCG AATCTACGTA AAAGCTTTAA AGCCACTTTT TGCTGAACAT
CTTATTAAGG GAGTTGCTCA TATTACAGGC GGCGGATTTA TTGAAAATGT TCCGCGCATG
TACGCAGATG ATTTAGCTGC AAAAATTGAT ACCACTAGCT GGAGCGTTCC ACCTATTTTT
GGCGTTATTG AACAAGCAGG CAAAGTTGCT CACGCTGAAA TGTTCAACGT TTTTAATATG
GGCATTGGCA TGGTTTTGGC AGTTGATGAA AGCCGAGCAA ACGAGGCAAT GCGAGTACTA
AACAAGCATG ACGAAACCGC TTATATTATT GGCAAAATGG CTAAGCGAGA AAACGTTGCA
GTTGAGTTGT TGTAA
 
Protein sequence
MPHAYEEAGV SVEAGYEVVR RIKSHVNRTK RPGVVGGIGG FGGLFDLASL GYKEPVLISG 
TDGVGTKLVI AKMMNKHNTI GIDCVAMCVN DIAAQGAEPL FFLDYIACGK NNPEILEQVV
SGVADGCVQS EAGLIGGETA EMPGMYDEDE YDLAGFAVGV AERSNIVDGS TITAGDVLIG
LPSSGVHSNG FSLVRKALFE EAGFSVDTKL DELNGKTLGE VLLEPTRIYV KALKPLFAEH
LIKGVAHITG GGFIENVPRM YADDLAAKID TTSWSVPPIF GVIEQAGKVA HAEMFNVFNM
GIGMVLAVDE SRANEAMRVL NKHDETAYII GKMAKRENVA VELL