Gene RPB_3964 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_3964 
SymbolrbcL 
ID3911771 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp4525057 
End bp4526514 
Gene Length1458 bp 
Protein Length485 aa 
Translation table11 
GC content64% 
IMG OID637885868 
Productribulose bisophosphate carboxylase 
Protein accessionYP_487568 
Protein GI86751072 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1850] Ribulose 1,5-bisphosphate carboxylase, large subunit 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.909404 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.170199 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACGAGT CAGTCACCGT CCGCGGCAAG GATCGCTACA AATCCGGCGT GATGGAATAC 
AAGAAGATGG GCTATTGGGA GCCCGACTAC GAGCCCAAAG ACACCGACGT CATCGCGCTG
TTCCGTGTCA CGCCGCAGGA CGGCGTGGAT CCGATCGAGG CATCGGCAGC GGTGGCCGGC
GAGTCTTCGA CCGCGACCTG GACCGTGGTG TGGACCGACC GTCTGACCGC GGCGGAGAAG
TACCGCGCGA AGTGCTATCG CGTCGATCCG GTGCCGAATT CGCCCGGCCA GTATTTCGCT
TACATCGCCT ACGATCTCGA CCTGTTCGAG AATGGCTCGA TCGCCAATCT GTCGGCGTCG
ATCATCGGCA ACGTGTTCGG CTTCAAGCCA CTGAAAGCGC TGCGGCTCGA GGACATGCGG
CTGCCGGTCG CCTATGTGAA GACGTTCCAG GGCCCCGCCA CCGGCATCGT GGTCGAGCGC
GAGCGGATGG ACAAGTTCGG ACGTCCCTTG CTCGGCGCCA CCGTCAAGCC GAAGCTCGGC
CTGTCGGGCC GCAACTACGG CCGCGTCGTC TACGAGGCGC TGAAGGGCGG GCTCGACTTC
ACCAAGGACG ACGAGAACAT CAACTCGCAG CCGTTCATGC ATTGGCGCGA GCGCTTCCTG
TATTGCATGG AGGCGGTGAA CAAGGCGCAG GCGGCGTCGG GCGAGATCAA GGGCACCTAT
CTCAACGTCA CCGCCGGCAC CATGGAGGAG ATGTACGAGC GCGCCGAATT CGCCAAACAG
CTCGGCTCGG TGATCATCAT GATCGACCTG GTGATCGGCT ACACCGCGAT CCAGTCGATG
GCGAAATGGG CACGCAAGAA CGACATGATC TTGCATCTGC ATCGCGCCGG CCATTCGACC
TACACCCGCC AGCGCAATCA CGGCGTGTCG TTCCGCGTCA TCGCCAAATG GATGCGGCTC
GCCGGCGTCG ATCACATCCA TGCCGGCACC GTGGTCGGCA AGCTCGAAGG CGATCCGGCG
ACCACCAAGG GCTACTACGA CATCTGCCGC GAGGACTACA ACCCGGCGAA TCTCGAGCAC
GGCCTGTTCT TCGACCAGCA CTGGGCCAGC CTGAACAAGC TGATGCCGGT GGCCTCGGGC
GGCATCCATG CCGGCCAGAT GCACCAGCTG CTCGACCTGC TCGGTGAGGA CGTCGTGCTG
CAGTTCGGCG GCGGCACCAT CGGCCACCCG ATGGGCATCG CGGCCGGCGC CACCGCCAAT
CGCGTCGCGC TGGAGGCGAT GATCCTCGCT CGCAACGAGG GCCGCGACTA CGTGCACGAA
GGCCCGGAGA TTCTCGCCAA GGCGGCGCAG ACCTGCACGC CGCTGAAGGC TGCGCTCGAC
ACCTGGAAGA ACGTGTCCTT CAATTACGAA TCCACCGACA CCCCCGACTA CGCGCCGACC
CCCAGCGTCT CGGTCTAA
 
Protein sequence
MNESVTVRGK DRYKSGVMEY KKMGYWEPDY EPKDTDVIAL FRVTPQDGVD PIEASAAVAG 
ESSTATWTVV WTDRLTAAEK YRAKCYRVDP VPNSPGQYFA YIAYDLDLFE NGSIANLSAS
IIGNVFGFKP LKALRLEDMR LPVAYVKTFQ GPATGIVVER ERMDKFGRPL LGATVKPKLG
LSGRNYGRVV YEALKGGLDF TKDDENINSQ PFMHWRERFL YCMEAVNKAQ AASGEIKGTY
LNVTAGTMEE MYERAEFAKQ LGSVIIMIDL VIGYTAIQSM AKWARKNDMI LHLHRAGHST
YTRQRNHGVS FRVIAKWMRL AGVDHIHAGT VVGKLEGDPA TTKGYYDICR EDYNPANLEH
GLFFDQHWAS LNKLMPVASG GIHAGQMHQL LDLLGEDVVL QFGGGTIGHP MGIAAGATAN
RVALEAMILA RNEGRDYVHE GPEILAKAAQ TCTPLKAALD TWKNVSFNYE STDTPDYAPT
PSVSV