Gene HMPREF0424_1039 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHMPREF0424_1039 
SymbolpurK 
ID8709661 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGardnerella vaginalis 409-05 
KingdomBacteria 
Replicon accessionNC_013721 
Strand
Start bp1181765 
End bp1182937 
Gene Length1173 bp 
Protein Length390 aa 
Translation table11 
GC content44% 
IMG OID646483132 
Productphosphoribosylaminoimidazole carboxylase, ATPase subunit 
Protein accessionYP_003374244 
Protein GI283783490 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0026] Phosphoribosylaminoimidazole carboxylase (NCAIR synthetase) 
TIGRFAM ID[TIGR01161] phosphoribosylaminoimidazole carboxylase, PurK protein 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones31 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCAACCT TATCTGAAGT AACTAATGGT GCTGTTGAAC GTTTAATGCC AGGATCCACT 
ATTGGAATTA TTGGCGGCGG CCAGTTAGGG CGCATGATGG CTATTGCAGC TCGCCATATG
GGTTTTCGTA TTGGTGTTCT TGACCCTACG CTTGACTGCC CTGTGTTCCA AGTAGCAGAT
TTGCAGGTTG AAGCTAATTA CGACGATCCT GAAGGCTTAC GTGAGCTTGC TGAACGTTGC
GATGTATTAA CTTACGAATT TGAAAATGTT AACGCAGACG CACTTGATAA AGTTCGTCAT
TTAACCGCAA TCCCACAAGG AACTGACCTG TTACGCGTAA CTCAAGATCG CGTCAGTGAA
AAAACGTTCA TTAATAGTCA TAAAATCGAA ACAGCTCCGT GGCGTGAAGT AAATAATTTG
GACGACTTAG ATGCTGCTAT TGACGAAATA GGCTTGCCAG CAATTCTTAA AACTCGTCGC
GGCGGCTACG ACGGTCATGG TCAAGATGTT TTGCGCACAG AAGAAGACGT TGCTAACATT
CACCATCGCT CGGATCGCGG AGGAAAATTC CCTCCTTCAA TTCTCGAGGG TTTCGTCGAT
TTTGCTTTTG AAGCATCAAT CCTGGTTTCT GGAAATGGTA AGGATTTCGT AACTTATCCT
CTAGTAAAAA ACGTGCATCA CAATAGTATT TTGCACATGA CTTTAGCTCC TGCAGTAGTT
GATCCTGAAG TTGAAAAAAC AGCTCACGAA TTAGCTTTGC GCTTAGCTAA AGGATTCGAA
CTAGCAGGAA CATTAGGAAT CGAGCTTTTC ATCACTAAAG ATAATCGCGT AGTAGTAAAC
GAACTTGCTC CTCGCCCTCA CAATTCCGGG CATTATACGA TTGAAGCTTG CGATATGGAT
CAATTTGAAG CACATATTCG CGGTATTGTT GGTTGGCCTT TAAAGAAGCC TAAGCTACTT
TCCCCTGCTG TTATGGTAAA TGTTCTTGGG CAACATGTGG CTCCTACACG TTCGCTGATT
TTGGAACATC CAGAATGGCA TATACATGAT TATGGAAAAG CTGAAGTTCG TAAGAATCGC
AAAATGGGTC ATATTACTGT GCTATGCGAT AATCCTGTTG ACGCTGCTGC AGCATTAGAT
GCAACAGGCT GCTGGGACGA CGAGCTAGAC TAA
 
Protein sequence
MPTLSEVTNG AVERLMPGST IGIIGGGQLG RMMAIAARHM GFRIGVLDPT LDCPVFQVAD 
LQVEANYDDP EGLRELAERC DVLTYEFENV NADALDKVRH LTAIPQGTDL LRVTQDRVSE
KTFINSHKIE TAPWREVNNL DDLDAAIDEI GLPAILKTRR GGYDGHGQDV LRTEEDVANI
HHRSDRGGKF PPSILEGFVD FAFEASILVS GNGKDFVTYP LVKNVHHNSI LHMTLAPAVV
DPEVEKTAHE LALRLAKGFE LAGTLGIELF ITKDNRVVVN ELAPRPHNSG HYTIEACDMD
QFEAHIRGIV GWPLKKPKLL SPAVMVNVLG QHVAPTRSLI LEHPEWHIHD YGKAEVRKNR
KMGHITVLCD NPVDAAAALD ATGCWDDELD