Gene RPD_3722 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPD_3722 
SymbolrbcL 
ID4024238 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB5 
KingdomBacteria 
Replicon accessionNC_007958 
Strand
Start bp4156303 
End bp4157760 
Gene Length1458 bp 
Protein Length485 aa 
Translation table11 
GC content63% 
IMG OID637963926 
Productribulose bisophosphate carboxylase 
Protein accessionYP_570844 
Protein GI91978185 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1850] Ribulose 1,5-bisphosphate carboxylase, large subunit 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.0880816 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00193394 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAACGACT CAATCACGGT CCGCGGCAAG GATCGCTACA AATCCGGCGT GATGGAATAC 
AAGAAGATGG GCTATTGGGA GCCTGACTAC GTGCCCAAGG ACACCGACGT CATCGCTCTG
TTCCGCGTCA CCCCGCAGGA CGGCGTCGAT CCGATCGAAG CTTCCGCTGC CGTCGCGGGC
GAATCCTCGA CCGCCACCTG GACCGTGGTG TGGACCGATC GCTTGACCGC GGCCGAGAAG
TATCGTGCGA AGTGCTATCG CGTCGATCCC GTGCCGAATT CGCCCGGCCA GTATTTTGCT
TACATCGCCT ACGATCTCGA CCTGTTCGAG AACGGCTCGA TCGCCAATCT GTCGGCGTCG
ATCATCGGCA ACGTGTTCGG ATTCAAGCCG CTGAAGGCGT TGCGGCTCGA GGACATGCGG
CTGCCGATCG CTTACGTCAA GACGTTCCAG GGGCCGGCCA CCGGCATCGT GGTCGAGCGT
GAGCGCATGG ACAAGTTTGG CCGGCCGCTG CTCGGCGCCA CCGTCAAACC GAAGCTCGGC
CTCTCCGGTC GCAACTACGG CCGCGTGGTC TATGAAGCGC TGAAGGGCGG CCTCGACTTC
ACCAAGGACG ACGAGAACAT CAACTCGCAG CCGTTCATGC ATTGGCGCGA GCGCTTCCTG
TACTGCATGG AGGCGGTCAA CAAGGCGCAG GCAGCGTCGG GCGAGATCAA GGGCACCTAT
CTCAACGTCA CCGCCGGCAC CATGGAGGAG ATGTACGAGC GCGCTGAATT CGCCAAGCAG
CTCGGCTCGG TCATCATCAT GATCGATCTG GTGATCGGCT ACACCGCGAT CCAGTCGATG
GCGAAGTGGG CCCGCAGGAA CGACATGATC CTGCATCTGC ACCGCGCCGG TCATTCCACC
TACACCCGCC AGCGCAATCA TGGCGTGTCG TTCCGCGTTA TCGCCAAGTG GATGCGGCTC
GCCGGTGTCG ATCACATCCA TGCCGGCACC GTGGTCGGCA AGCTGGAGGG CGATCCGTCG
ACCACCAAGG GCTACTACGA CATCTGCCGC GAAGACTACA ACCCGGCCAA TCTCGAGCAC
GGCCTGTTCT TCGACCAGCC CTGGGCGAGC CTGAACAAGC TGATGCCGGT CGCTTCCGGC
GGCATCCATG CCGGCCAGAT GCACCAGTTG CTCGATCTGC TCGGCGAGGA CGTCGTGCTG
CAGTTCGGCG GCGGCACGAT CGGCCATCCG ATGGGCATCG CAGCGGGCGC AACCGCCAAC
CGCGTCGCGC TCGAAGCCAT GATCCTCGCT CGCAACGAGG GCCGCGACTA TGTGCACGAA
GGCCCGGAAA TTCTCGCCAA GGCGGCGCAG ACCTGCACGC CGTTGAAGGC GGCGCTCGAC
ACCTGGAAGA ACGTCTCCTT CAACTACGAA TCCACCGATA CCCCCGACTA TGCGCCGACA
CCCAGCGTCT CGATGTAA
 
Protein sequence
MNDSITVRGK DRYKSGVMEY KKMGYWEPDY VPKDTDVIAL FRVTPQDGVD PIEASAAVAG 
ESSTATWTVV WTDRLTAAEK YRAKCYRVDP VPNSPGQYFA YIAYDLDLFE NGSIANLSAS
IIGNVFGFKP LKALRLEDMR LPIAYVKTFQ GPATGIVVER ERMDKFGRPL LGATVKPKLG
LSGRNYGRVV YEALKGGLDF TKDDENINSQ PFMHWRERFL YCMEAVNKAQ AASGEIKGTY
LNVTAGTMEE MYERAEFAKQ LGSVIIMIDL VIGYTAIQSM AKWARRNDMI LHLHRAGHST
YTRQRNHGVS FRVIAKWMRL AGVDHIHAGT VVGKLEGDPS TTKGYYDICR EDYNPANLEH
GLFFDQPWAS LNKLMPVASG GIHAGQMHQL LDLLGEDVVL QFGGGTIGHP MGIAAGATAN
RVALEAMILA RNEGRDYVHE GPEILAKAAQ TCTPLKAALD TWKNVSFNYE STDTPDYAPT
PSVSM