Gene RPB_1446 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_1446 
Symbol 
ID3908396 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp1633396 
End bp1634490 
Gene Length1095 bp 
Protein Length364 aa 
Translation table11 
GC content69% 
IMG OID637883340 
Productphosphoribosylaminoimidazole carboxylase ATPase subunit 
Protein accessionYP_485067 
Protein GI86748571 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0026] Phosphoribosylaminoimidazole carboxylase (NCAIR synthetase) 
TIGRFAM ID[TIGR01161] phosphoribosylaminoimidazole carboxylase, PurK protein 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACCGGTC CCGCTCGGCA AGTGCTCAAA CCCGGCGACA CCATCGGCAT TCTCGGCGGC 
GGCCAGCTCG GCCGGATGCT GGCGATGGCC GCAGCAAGGC TCGGCCTGCG CTGCAATGTG
TTCTCGCCGG ACCCGGATTC GCCGGCCTTC GACGTGGTGC AGAACGCCGT CTGCGCCGAA
TATGCCGATG TCGAGGCGCT GGAGATGTTC GCCGCCGACG TCGACGTCAT CACCTATGAA
TTCGAGAACG TGCCGGCCTC GGCGGCGCTG GTGCTGGCGG CGCGCAAGCC GGTGCTGCCG
GACTACAAGA TCCTGGAGAC CACCCAGGAT CGCCTCGCCG AGAAGGATTT CGTCACCGGC
CTCGGCATCG GCACCGCCGC CTATGCCGAC GTCACCTCGG CGCAGACGCT ACGCGCCGCG
ATCGCCAAGC TCGGCCTGCC CGCAGTGCTG AAGACGCGGC GGTTCGGCTA TGACGGCAAG
GGCCAGGTGA TCATCCGCGA GGGCGACGAT CCCGATGCGG CCTGGGAGAA GCTGGAGACC
CGCGCGGCGA TTCTCGAGGC CTTCGTGCCG TTCGAGCGCG AAGTCTCGGT GATCGCCGCG
CGCGGCGCCG ACGGCCAGGT GGTGTGCTAC GATGTCACCG AGAACGAGCA CCGCGACCAC
ATCCTCAAAG TGTCGCGGGT GCCGGCGCCG GTGAGCGACT CCGTCGCCGG CGAGGCACGG
CGGATCGCCA CCAGCATCGC CGATGCGCTG AACTATGTCG GCGTGCTGGC GGTCGAGATG
TTCGTGGTGC CGGGCGACGG CGGCGCGACC GTGCTGGTCA ACGAGATCGC GCCCCGGGTG
CACAATTCCG GGCACTGGAC GCTCGACGGC GCCTCGGTGT CGCAGTTCGA GCAGCACATC
CGGGCGATCG CCGGCTGGCC GCTGGCGGAA CCGCTACGCC ACGGCGCCGT CACCATGACC
AACCTGATCG GCCACGATGT CGACGATTAT GCCCGCTGGC TGACGGTTCC CGGCGCCACG
GTGCATCTCT ACGGCAAGCG GACGGCTTTG CCGGGCCGTA AGATGGGCCA CGTCACCGTG
ATCGAGCCAC GATGA
 
Protein sequence
MTGPARQVLK PGDTIGILGG GQLGRMLAMA AARLGLRCNV FSPDPDSPAF DVVQNAVCAE 
YADVEALEMF AADVDVITYE FENVPASAAL VLAARKPVLP DYKILETTQD RLAEKDFVTG
LGIGTAAYAD VTSAQTLRAA IAKLGLPAVL KTRRFGYDGK GQVIIREGDD PDAAWEKLET
RAAILEAFVP FEREVSVIAA RGADGQVVCY DVTENEHRDH ILKVSRVPAP VSDSVAGEAR
RIATSIADAL NYVGVLAVEM FVVPGDGGAT VLVNEIAPRV HNSGHWTLDG ASVSQFEQHI
RAIAGWPLAE PLRHGAVTMT NLIGHDVDDY ARWLTVPGAT VHLYGKRTAL PGRKMGHVTV
IEPR