Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rsph17029_0221 |
Symbol | |
ID | 4897094 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodobacter sphaeroides ATCC 17029 |
Kingdom | Bacteria |
Replicon accession | NC_009049 |
Strand | - |
Start bp | 243281 |
End bp | 244363 |
Gene Length | 1083 bp |
Protein Length | 360 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 640110804 |
Product | phosphoribosylaminoimidazole carboxylase ATPase subunit |
Protein accession | YP_001042112 |
Protein GI | 126460998 |
COG category | [F] Nucleotide transport and metabolism |
COG ID | [COG0026] Phosphoribosylaminoimidazole carboxylase (NCAIR synthetase) |
TIGRFAM ID | [TIGR01161] phosphoribosylaminoimidazole carboxylase, PurK protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 21 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACTGACC GCCTGCCCCC CGGCTCGACC ATCGGCATCC TCGGCGGCGG CCAGCTCGGC CGGATGCTTT CGGTCGCGGC GGCGCGGCTG GGCTTCCGCA CCCATATCTT CGAGCCGAGC GCCAACCCGC CCGCCGCCGA CGTGGCCCAT GCGGTCACGA CCGCGCCCTA CGAGGACGAG GCCGCGCTGC GGGCCTTCGC GGCCTCGGTC GATGTCATCA CCTACGAGTT CGAGAACATC CCGACCTCCG CCCTCGACCT GCTCGAGGCG CTGAAACCCC TCCACCCGAA CCGCCGCGCC CTCGCGGTCA GCCAGGACCG GCTCGAGGAG AAGGGCTTCC TGACCGGGCT CGGCCTCGCC GTGGCCCCCT ACCGCCCCGT CGGCAGCCGC GAGGATCTCG AGGCCGCGAT CCACGGCATC GGCACGCCCG CCATCCTCAA GACCACGCGG CTTGGCTATG ACGGCAAGGG GCAGGCCCGC CTCATGGAGC CGGACGACGC GGCCGAGGCC TTTGCGGCCA TGAACGGCCA GCCCGCCGTG CTCGAGGGCT TCGTCCGCTT CACCCACGAG GTCTCGGTCA TCGCGGCGCG CGGCCGCGAC GGCTCGGTCG CGGTCTATGA GCCGGGCGAG AACGTTCATC TCTCGGGCAT ACTGCACACG ACCACGGTGC CCGCCCGCCT CACCGCCTCG CAGCGCACCG ACGCGGTGCT GCTGGCCGGG CGGATCCTCA ATGCGCTCGA TTATGTGGGC GTGATGGGGG TCGAGCTCTT CGTGACGCCC GAGGCGCTGC TGGTGAACGA GATCGCGCCG CGGGTCCACA ATTCCGGGCA CTGGACGCAG AACGGCTGCG CGGTGGACCA GTTCGAGCAG CATATCCGTG CGATCACCGG CTGGCCGCTC GGCGACGGCT CGCGCTTCGC CGATGTCGAG ATGGAGAATC TGATCGGCCA TGATGTGGCC CGGGTGCCGG CCCTCGCGCT CGAGAAGCAC ACGGCGATCC ATCTCTATGG CAAATCCGAA GCGCGCCCCG GGCGCAAGAT GGGCCATGTG AACCGCATCC TCCGCCCGGT GACCGGCGCA TGA
|
Protein sequence | MTDRLPPGST IGILGGGQLG RMLSVAAARL GFRTHIFEPS ANPPAADVAH AVTTAPYEDE AALRAFAASV DVITYEFENI PTSALDLLEA LKPLHPNRRA LAVSQDRLEE KGFLTGLGLA VAPYRPVGSR EDLEAAIHGI GTPAILKTTR LGYDGKGQAR LMEPDDAAEA FAAMNGQPAV LEGFVRFTHE VSVIAARGRD GSVAVYEPGE NVHLSGILHT TTVPARLTAS QRTDAVLLAG RILNALDYVG VMGVELFVTP EALLVNEIAP RVHNSGHWTQ NGCAVDQFEQ HIRAITGWPL GDGSRFADVE MENLIGHDVA RVPALALEKH TAIHLYGKSE ARPGRKMGHV NRILRPVTGA
|
| |