Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPB_1446 |
Symbol | |
ID | 3908396 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris HaA2 |
Kingdom | Bacteria |
Replicon accession | NC_007778 |
Strand | - |
Start bp | 1633396 |
End bp | 1634490 |
Gene Length | 1095 bp |
Protein Length | 364 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 637883340 |
Product | phosphoribosylaminoimidazole carboxylase ATPase subunit |
Protein accession | YP_485067 |
Protein GI | 86748571 |
COG category | [F] Nucleotide transport and metabolism |
COG ID | [COG0026] Phosphoribosylaminoimidazole carboxylase (NCAIR synthetase) |
TIGRFAM ID | [TIGR01161] phosphoribosylaminoimidazole carboxylase, PurK protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGACCGGTC CCGCTCGGCA AGTGCTCAAA CCCGGCGACA CCATCGGCAT TCTCGGCGGC GGCCAGCTCG GCCGGATGCT GGCGATGGCC GCAGCAAGGC TCGGCCTGCG CTGCAATGTG TTCTCGCCGG ACCCGGATTC GCCGGCCTTC GACGTGGTGC AGAACGCCGT CTGCGCCGAA TATGCCGATG TCGAGGCGCT GGAGATGTTC GCCGCCGACG TCGACGTCAT CACCTATGAA TTCGAGAACG TGCCGGCCTC GGCGGCGCTG GTGCTGGCGG CGCGCAAGCC GGTGCTGCCG GACTACAAGA TCCTGGAGAC CACCCAGGAT CGCCTCGCCG AGAAGGATTT CGTCACCGGC CTCGGCATCG GCACCGCCGC CTATGCCGAC GTCACCTCGG CGCAGACGCT ACGCGCCGCG ATCGCCAAGC TCGGCCTGCC CGCAGTGCTG AAGACGCGGC GGTTCGGCTA TGACGGCAAG GGCCAGGTGA TCATCCGCGA GGGCGACGAT CCCGATGCGG CCTGGGAGAA GCTGGAGACC CGCGCGGCGA TTCTCGAGGC CTTCGTGCCG TTCGAGCGCG AAGTCTCGGT GATCGCCGCG CGCGGCGCCG ACGGCCAGGT GGTGTGCTAC GATGTCACCG AGAACGAGCA CCGCGACCAC ATCCTCAAAG TGTCGCGGGT GCCGGCGCCG GTGAGCGACT CCGTCGCCGG CGAGGCACGG CGGATCGCCA CCAGCATCGC CGATGCGCTG AACTATGTCG GCGTGCTGGC GGTCGAGATG TTCGTGGTGC CGGGCGACGG CGGCGCGACC GTGCTGGTCA ACGAGATCGC GCCCCGGGTG CACAATTCCG GGCACTGGAC GCTCGACGGC GCCTCGGTGT CGCAGTTCGA GCAGCACATC CGGGCGATCG CCGGCTGGCC GCTGGCGGAA CCGCTACGCC ACGGCGCCGT CACCATGACC AACCTGATCG GCCACGATGT CGACGATTAT GCCCGCTGGC TGACGGTTCC CGGCGCCACG GTGCATCTCT ACGGCAAGCG GACGGCTTTG CCGGGCCGTA AGATGGGCCA CGTCACCGTG ATCGAGCCAC GATGA
|
Protein sequence | MTGPARQVLK PGDTIGILGG GQLGRMLAMA AARLGLRCNV FSPDPDSPAF DVVQNAVCAE YADVEALEMF AADVDVITYE FENVPASAAL VLAARKPVLP DYKILETTQD RLAEKDFVTG LGIGTAAYAD VTSAQTLRAA IAKLGLPAVL KTRRFGYDGK GQVIIREGDD PDAAWEKLET RAAILEAFVP FEREVSVIAA RGADGQVVCY DVTENEHRDH ILKVSRVPAP VSDSVAGEAR RIATSIADAL NYVGVLAVEM FVVPGDGGAT VLVNEIAPRV HNSGHWTLDG ASVSQFEQHI RAIAGWPLAE PLRHGAVTMT NLIGHDVDDY ARWLTVPGAT VHLYGKRTAL PGRKMGHVTV IEPR
|
| |