Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RSP_1570 |
Symbol | purK |
ID | 3718597 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodobacter sphaeroides 2.4.1 |
Kingdom | Bacteria |
Replicon accession | NC_007493 |
Strand | - |
Start bp | 163084 |
End bp | 164166 |
Gene Length | 1083 bp |
Protein Length | 360 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 640069719 |
Product | phosphoribosylaminoimidazole carboxylase ATPase subunit |
Protein accession | YP_351613 |
Protein GI | 77462109 |
COG category | [F] Nucleotide transport and metabolism |
COG ID | [COG0026] Phosphoribosylaminoimidazole carboxylase (NCAIR synthetase) |
TIGRFAM ID | [TIGR01161] phosphoribosylaminoimidazole carboxylase, PurK protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACTGACC GCCTGCCCCC CGGCTCGACC ATCGGCATCC TCGGCGGCGG CCAGCTCGGC CGGATGCTTT CGGTCGCGGC GGCGCGGCTG GGCTTCCGCA CCCATATCTT CGAGCCGAGC GCCAACCCGC CCGCCGCCGA CGTGGCCCAT GCGGTCACGA CCGCGCCCTA CGAGGACGAG GCCGCACTGC GGGCCTTCGC GACCTCGGTC GATGTCATCA CCTACGAATT CGAGAACATC CCGACCTCGG CCCTCGACCT GCTGGAGGCG CTGAAACCCC TCCACCCGAA CCGCCGCGCC CTCGCGGTCA GCCAGGACCG GCTCGAGGAG AAGGGCTTCC TGACCGGGCT CGGCCTCGCC GTGGCCCCCT ACCGCCCCGT CGGCAGCCGA GAGGATCTCG AGGCCGCGAT CCACGGCATC GGCACGCCCG CCATCCTCAA GACCACGCGC CTCGGCTATG ACGGCAAGGG GCAGGCCCGC CTCATGGAGC CCGACGACGC GGCCGAGGCC TTCGCGGCCA TGGGCGGCCA GCCTGCCGTG CTCGAGGGCT TCGTCCGCTT CACCCACGAG GTCTCGGTCA TCGCGGCGCG CGGCCGCGAC GGATCGGTCG CGGTCTATGA GCCCGGCGAG AACGTGCATC TCTCGGGCAT CCTGCACACG ACCACGGTGC CCGCCCGCCT CACCGCCTCG CAGCGCACCG ACGCGGTGCT GCTGGCCGGG CGGATCCTCA ATGCTCTCGA CTATGTGGGC GTGATGGGGG TCGAGCTCTT CGTGACGCCC GAGGCGCTGC TGGTGAACGA GATCGCGCCG CGGGTCCACA ATTCCGGGCA CTGGACGCAG AACGGCTGCG CGGTGGACCA GTTCGAGCAG CATATCCGTG CGATCACCGG CTGGCCGCTC GGCGACGGCT CGCGCTTCGC CGATGTCGAG ATGGAGAATC TGATCGGCCA TGATGTGGCC CGGGTGCCGG CGCTCGCGCT CGAGAAGCAC ACGGCGATCC ATCTCTATGG CAAGGCCGAA GCGCGCCCGG GGCGCAAGAT GGGCCATGTG AACCGCATCC TCCGCCCGGT GACCGGCGCA TGA
|
Protein sequence | MTDRLPPGST IGILGGGQLG RMLSVAAARL GFRTHIFEPS ANPPAADVAH AVTTAPYEDE AALRAFATSV DVITYEFENI PTSALDLLEA LKPLHPNRRA LAVSQDRLEE KGFLTGLGLA VAPYRPVGSR EDLEAAIHGI GTPAILKTTR LGYDGKGQAR LMEPDDAAEA FAAMGGQPAV LEGFVRFTHE VSVIAARGRD GSVAVYEPGE NVHLSGILHT TTVPARLTAS QRTDAVLLAG RILNALDYVG VMGVELFVTP EALLVNEIAP RVHNSGHWTQ NGCAVDQFEQ HIRAITGWPL GDGSRFADVE MENLIGHDVA RVPALALEKH TAIHLYGKAE ARPGRKMGHV NRILRPVTGA
|
| |