Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | HMPREF0424_1039 |
Symbol | purK |
ID | 8709661 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Gardnerella vaginalis 409-05 |
Kingdom | Bacteria |
Replicon accession | NC_013721 |
Strand | + |
Start bp | 1181765 |
End bp | 1182937 |
Gene Length | 1173 bp |
Protein Length | 390 aa |
Translation table | 11 |
GC content | 44% |
IMG OID | 646483132 |
Product | phosphoribosylaminoimidazole carboxylase, ATPase subunit |
Protein accession | YP_003374244 |
Protein GI | 283783490 |
COG category | [F] Nucleotide transport and metabolism |
COG ID | [COG0026] Phosphoribosylaminoimidazole carboxylase (NCAIR synthetase) |
TIGRFAM ID | [TIGR01161] phosphoribosylaminoimidazole carboxylase, PurK protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 31 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCAACCT TATCTGAAGT AACTAATGGT GCTGTTGAAC GTTTAATGCC AGGATCCACT ATTGGAATTA TTGGCGGCGG CCAGTTAGGG CGCATGATGG CTATTGCAGC TCGCCATATG GGTTTTCGTA TTGGTGTTCT TGACCCTACG CTTGACTGCC CTGTGTTCCA AGTAGCAGAT TTGCAGGTTG AAGCTAATTA CGACGATCCT GAAGGCTTAC GTGAGCTTGC TGAACGTTGC GATGTATTAA CTTACGAATT TGAAAATGTT AACGCAGACG CACTTGATAA AGTTCGTCAT TTAACCGCAA TCCCACAAGG AACTGACCTG TTACGCGTAA CTCAAGATCG CGTCAGTGAA AAAACGTTCA TTAATAGTCA TAAAATCGAA ACAGCTCCGT GGCGTGAAGT AAATAATTTG GACGACTTAG ATGCTGCTAT TGACGAAATA GGCTTGCCAG CAATTCTTAA AACTCGTCGC GGCGGCTACG ACGGTCATGG TCAAGATGTT TTGCGCACAG AAGAAGACGT TGCTAACATT CACCATCGCT CGGATCGCGG AGGAAAATTC CCTCCTTCAA TTCTCGAGGG TTTCGTCGAT TTTGCTTTTG AAGCATCAAT CCTGGTTTCT GGAAATGGTA AGGATTTCGT AACTTATCCT CTAGTAAAAA ACGTGCATCA CAATAGTATT TTGCACATGA CTTTAGCTCC TGCAGTAGTT GATCCTGAAG TTGAAAAAAC AGCTCACGAA TTAGCTTTGC GCTTAGCTAA AGGATTCGAA CTAGCAGGAA CATTAGGAAT CGAGCTTTTC ATCACTAAAG ATAATCGCGT AGTAGTAAAC GAACTTGCTC CTCGCCCTCA CAATTCCGGG CATTATACGA TTGAAGCTTG CGATATGGAT CAATTTGAAG CACATATTCG CGGTATTGTT GGTTGGCCTT TAAAGAAGCC TAAGCTACTT TCCCCTGCTG TTATGGTAAA TGTTCTTGGG CAACATGTGG CTCCTACACG TTCGCTGATT TTGGAACATC CAGAATGGCA TATACATGAT TATGGAAAAG CTGAAGTTCG TAAGAATCGC AAAATGGGTC ATATTACTGT GCTATGCGAT AATCCTGTTG ACGCTGCTGC AGCATTAGAT GCAACAGGCT GCTGGGACGA CGAGCTAGAC TAA
|
Protein sequence | MPTLSEVTNG AVERLMPGST IGIIGGGQLG RMMAIAARHM GFRIGVLDPT LDCPVFQVAD LQVEANYDDP EGLRELAERC DVLTYEFENV NADALDKVRH LTAIPQGTDL LRVTQDRVSE KTFINSHKIE TAPWREVNNL DDLDAAIDEI GLPAILKTRR GGYDGHGQDV LRTEEDVANI HHRSDRGGKF PPSILEGFVD FAFEASILVS GNGKDFVTYP LVKNVHHNSI LHMTLAPAVV DPEVEKTAHE LALRLAKGFE LAGTLGIELF ITKDNRVVVN ELAPRPHNSG HYTIEACDMD QFEAHIRGIV GWPLKKPKLL SPAVMVNVLG QHVAPTRSLI LEHPEWHIHD YGKAEVRKNR KMGHITVLCD NPVDAAAALD ATGCWDDELD
|
| |