Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPC_3961 |
Symbol | |
ID | 3969248 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris BisB18 |
Kingdom | Bacteria |
Replicon accession | NC_007925 |
Strand | + |
Start bp | 4413897 |
End bp | 4414997 |
Gene Length | 1101 bp |
Protein Length | 366 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 637927065 |
Product | phosphoribosylaminoimidazole carboxylase ATPase subunit |
Protein accession | YP_533806 |
Protein GI | 90425436 |
COG category | [F] Nucleotide transport and metabolism |
COG ID | [COG0026] Phosphoribosylaminoimidazole carboxylase (NCAIR synthetase) |
TIGRFAM ID | [TIGR01161] phosphoribosylaminoimidazole carboxylase, PurK protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.00399053 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 0.845863 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGACTGAAC TGAAACGCGT CACGCTGAAG CCCGGCGACA CCATCGGAAT TCTCGGCGGC GGCCAGCTCG GCCGGATGCT GGCGCTGGCC GCCGCACGGC TCGGGCTGAA GTGCCAGGTG TTTTCGCCTG ACCCGGATTC GCCGGCGTTC GACGTGGTGC AATACGCCAC CTGCGCCGAA TATGCCGACG TCGAGGCGCT GGAGCTGTTC GCCAACGACG TCGACGTCAT CACCTATGAA TTCGAGAACA TCCCCTCCTC GGTGGCGGCG ATCCTGGCGT CGCGCCGCCC GGTGGTGCCG GATCCCAAGA TCCTGGAACT GACCCAGGAC CGGCTGGTGG AGAAGGACTT CGTCACCAAG CTCGGCATCC CCACCGCCGC CTATGCGGAC GTCTCCTCGC CGCAGACACT GCACGCCGCG GTGGCGCGGA TCGGCTTGCC GGCGGTGATC AAGACCCGCC GCTTCGGCTA TGACGGCAAG GGCCAGGCGA TCATCCGCGA AGGCGACGAC CTCGACGCGG TGTGGGCCGA TCTCGACACC CGCTCGGCGA TTCTCGAAGC CTTCGTGCCG TTCGAGCGGG AAATCTCGGT GATCGCCGCG CGCGGCTTCG ATGGCCAGGT GGTGTGCTAC GACGTCACCG AGAACGAGCA TCGCGATCAC ATTCTGAAGG TGTCGCGGGT GCCGGCGGCG ATTTCCGACG AGCTCGCGGC GCGGGCTCGC GGCATCGCCG AGACCATTGC GAACGCGGTC GGCTATGTCG GGGTGCTGGC GGTGGAACTG TTCGTGGCGC CGGGCCATGA CGGCCCGCTG CTGCTGGTCA ACGAGGTCGC GCCGCGGGTA CATAATTCCG GGCATTGGAC GCTGGACGGC GCCTCGGTGT CGCAGTTCGA GCAGCACATC CGCGCGGTGG CCGGCTGGCC GCTGGCGCAG CCGGTGCGGC ACGGCGCGGT CACCATGACC AATCTGATCG GCGAAGAGAT CGACGACTAT CCGAAATGGC TGTCGGAACC CGGCGCCACC GTGCATCTGT ACGGCAAACG CAGCGCCCGG CCGGGCCGCA AGATGGGCCA CGTCACCGTG GTGCAGCGGG CCAAGTCTTG A
|
Protein sequence | MTELKRVTLK PGDTIGILGG GQLGRMLALA AARLGLKCQV FSPDPDSPAF DVVQYATCAE YADVEALELF ANDVDVITYE FENIPSSVAA ILASRRPVVP DPKILELTQD RLVEKDFVTK LGIPTAAYAD VSSPQTLHAA VARIGLPAVI KTRRFGYDGK GQAIIREGDD LDAVWADLDT RSAILEAFVP FEREISVIAA RGFDGQVVCY DVTENEHRDH ILKVSRVPAA ISDELAARAR GIAETIANAV GYVGVLAVEL FVAPGHDGPL LLVNEVAPRV HNSGHWTLDG ASVSQFEQHI RAVAGWPLAQ PVRHGAVTMT NLIGEEIDDY PKWLSEPGAT VHLYGKRSAR PGRKMGHVTV VQRAKS
|
| |