Gene RPC_3961 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPC_3961 
Symbol 
ID3969248 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB18 
KingdomBacteria 
Replicon accessionNC_007925 
Strand
Start bp4413897 
End bp4414997 
Gene Length1101 bp 
Protein Length366 aa 
Translation table11 
GC content67% 
IMG OID637927065 
Productphosphoribosylaminoimidazole carboxylase ATPase subunit 
Protein accessionYP_533806 
Protein GI90425436 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0026] Phosphoribosylaminoimidazole carboxylase (NCAIR synthetase) 
TIGRFAM ID[TIGR01161] phosphoribosylaminoimidazole carboxylase, PurK protein 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00399053 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.845863 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACTGAAC TGAAACGCGT CACGCTGAAG CCCGGCGACA CCATCGGAAT TCTCGGCGGC 
GGCCAGCTCG GCCGGATGCT GGCGCTGGCC GCCGCACGGC TCGGGCTGAA GTGCCAGGTG
TTTTCGCCTG ACCCGGATTC GCCGGCGTTC GACGTGGTGC AATACGCCAC CTGCGCCGAA
TATGCCGACG TCGAGGCGCT GGAGCTGTTC GCCAACGACG TCGACGTCAT CACCTATGAA
TTCGAGAACA TCCCCTCCTC GGTGGCGGCG ATCCTGGCGT CGCGCCGCCC GGTGGTGCCG
GATCCCAAGA TCCTGGAACT GACCCAGGAC CGGCTGGTGG AGAAGGACTT CGTCACCAAG
CTCGGCATCC CCACCGCCGC CTATGCGGAC GTCTCCTCGC CGCAGACACT GCACGCCGCG
GTGGCGCGGA TCGGCTTGCC GGCGGTGATC AAGACCCGCC GCTTCGGCTA TGACGGCAAG
GGCCAGGCGA TCATCCGCGA AGGCGACGAC CTCGACGCGG TGTGGGCCGA TCTCGACACC
CGCTCGGCGA TTCTCGAAGC CTTCGTGCCG TTCGAGCGGG AAATCTCGGT GATCGCCGCG
CGCGGCTTCG ATGGCCAGGT GGTGTGCTAC GACGTCACCG AGAACGAGCA TCGCGATCAC
ATTCTGAAGG TGTCGCGGGT GCCGGCGGCG ATTTCCGACG AGCTCGCGGC GCGGGCTCGC
GGCATCGCCG AGACCATTGC GAACGCGGTC GGCTATGTCG GGGTGCTGGC GGTGGAACTG
TTCGTGGCGC CGGGCCATGA CGGCCCGCTG CTGCTGGTCA ACGAGGTCGC GCCGCGGGTA
CATAATTCCG GGCATTGGAC GCTGGACGGC GCCTCGGTGT CGCAGTTCGA GCAGCACATC
CGCGCGGTGG CCGGCTGGCC GCTGGCGCAG CCGGTGCGGC ACGGCGCGGT CACCATGACC
AATCTGATCG GCGAAGAGAT CGACGACTAT CCGAAATGGC TGTCGGAACC CGGCGCCACC
GTGCATCTGT ACGGCAAACG CAGCGCCCGG CCGGGCCGCA AGATGGGCCA CGTCACCGTG
GTGCAGCGGG CCAAGTCTTG A
 
Protein sequence
MTELKRVTLK PGDTIGILGG GQLGRMLALA AARLGLKCQV FSPDPDSPAF DVVQYATCAE 
YADVEALELF ANDVDVITYE FENIPSSVAA ILASRRPVVP DPKILELTQD RLVEKDFVTK
LGIPTAAYAD VSSPQTLHAA VARIGLPAVI KTRRFGYDGK GQAIIREGDD LDAVWADLDT
RSAILEAFVP FEREISVIAA RGFDGQVVCY DVTENEHRDH ILKVSRVPAA ISDELAARAR
GIAETIANAV GYVGVLAVEL FVAPGHDGPL LLVNEVAPRV HNSGHWTLDG ASVSQFEQHI
RAVAGWPLAQ PVRHGAVTMT NLIGEEIDDY PKWLSEPGAT VHLYGKRSAR PGRKMGHVTV
VQRAKS