Gene RPB_0951 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_0951 
Symbol 
ID3909306 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp1100131 
End bp1101516 
Gene Length1386 bp 
Protein Length461 aa 
Translation table11 
GC content64% 
IMG OID637882844 
Productribulose bisphosphate carboxylase 
Protein accessionYP_484572 
Protein GI86748076 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1850] Ribulose 1,5-bisphosphate carboxylase, large subunit 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGACCAGT CGAGCCGCTA CGCCAACCTC AACCTCAAAG AAAGCGATCT GATCGCGGGC 
GGGCGGCATG TGCTGTGCGC CTACATCATG AAGCCGAAGG ACGGCTTCGG CAATTTCCTG
CAGACCGCCG CACATTTTTC GGCCGAATCC TCGACTGGTA CCAATGTCGA AGTCTCCACC
ACCGACGACT TCACCCGCGG CGTCGATGCG CTGGTCTACG AGATCGACGA AGCCAACAAC
GTGATGAAGA TCGCCTACCC GATCGAACTG TTCGATCGCA ACGTGATCGA TGGCCGCGCG
ATGATCGCCT CGTTCCTGAC GCTGACGATC GGCAACAACC AGGGCATGGG CGACGTCGAA
TACGCCAAGA TGCACGATTT CTACGTGCCG CCCGCGTATC TGCGGCTGTT CGACGGCCCC
TCGACCACGA TCCGGGATCT GTGGCGCGTG CTCGGCCGGC CGGTGGTCGA CGGCGGCTTC
ATCGTCGGCA CCATCATCAA GCCCAAGCTC GGCCTGCGGC CGCAGCCTTT CGCCGATGCC
TGCTACGATT TCTGGCTCGG CGGCGATTTC ATCAAGAACG ACGAACCGCA GGGCAATCAG
GTGTTTGCGC CGTTCAAGGA GACGGTGCGG GCGGTCAACG AGGCGATGCG CCGCGCCCAG
GACAAGACCG GCGAGCCGAA GCTGTTCTCG TTCAACATCA CCGCCGACGA TCACTACGAG
ATGGTGGCGC GCGGCGAATA CATCCTCGAG ACCTTCGCCG ACAACGCCGA CCACGTCGCC
TTCCTGGTCG ACGGCTATGT CGCCGGCCCC GCCGCGGTGA CCACGGCGCG CCGCGCGTTC
CCGAAGCAGT ATCTGCACTA TCATCGCGCC GGCCACGGCG CGGTGACCTC GCCGCAGTCA
AAGCGCGGCT ACACCGCATT CGTGCTGTCG AAGATGGCCC GGCTGCAGGG AGCCTCCGGC
ATCCACACCG GCACCATGGG CTTCGGCAAG ATGGAAGGCG AAGCCGCCGA TCGCGCCATG
GCCTACATGA TCACCGAAGA CTCGGCGGAC GGACCGTTCT TCCACCAGGA ATGGCTCGGC
ATGAATCCGA CCACGCCGAT CATCTCCGGC GGCATGAACG CGCTGCGGAT GCCCGGCTTC
TTCGACAATC TCGGCCACTC CAACCTGATC ATGACCGCGG GCGGCGGCGC CTTCGGCCAT
ATCGACGGCG GCGCGGCGGG CGCCAAGTCG CTGCGGCAGG CTGAGCAGTG CTGGAAGGCT
GGCGCCGATC CGGTCGAATT CGCCAAGGAT CATCGCGAAT TCGCCCGCGC CTTCGAGAGC
TTCCCGCACG ATGCCGATGC GCTGTACCCG AACTGGCGCA ATTCGCTCAA GCTCGCAGCC
GCGTAA
 
Protein sequence
MDQSSRYANL NLKESDLIAG GRHVLCAYIM KPKDGFGNFL QTAAHFSAES STGTNVEVST 
TDDFTRGVDA LVYEIDEANN VMKIAYPIEL FDRNVIDGRA MIASFLTLTI GNNQGMGDVE
YAKMHDFYVP PAYLRLFDGP STTIRDLWRV LGRPVVDGGF IVGTIIKPKL GLRPQPFADA
CYDFWLGGDF IKNDEPQGNQ VFAPFKETVR AVNEAMRRAQ DKTGEPKLFS FNITADDHYE
MVARGEYILE TFADNADHVA FLVDGYVAGP AAVTTARRAF PKQYLHYHRA GHGAVTSPQS
KRGYTAFVLS KMARLQGASG IHTGTMGFGK MEGEAADRAM AYMITEDSAD GPFFHQEWLG
MNPTTPIISG GMNALRMPGF FDNLGHSNLI MTAGGGAFGH IDGGAAGAKS LRQAEQCWKA
GADPVEFAKD HREFARAFES FPHDADALYP NWRNSLKLAA A