Gene RPD_1054 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPD_1054 
Symbol 
ID4021530 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB5 
KingdomBacteria 
Replicon accessionNC_007958 
Strand
Start bp1208614 
End bp1209999 
Gene Length1386 bp 
Protein Length461 aa 
Translation table11 
GC content64% 
IMG OID637961246 
Productribulose bisphosphate carboxylase 
Protein accessionYP_568193 
Protein GI91975534 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1850] Ribulose 1,5-bisphosphate carboxylase, large subunit 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.559684 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGACCAAT CGAACCGCTA CGCCAACCTC AACCTCAAAG AAAGCGATCT GATCGCCGGC 
GGGCGGCATG TGCTGTGCGC CTACATCATG AAGCCGAAGG CCGGGTTCGG TAATTTCGTG
GAAACGGCGG CGCATTTCGC CGCCGAGTCC TCCACCGGAA CCAATGTCGA AGTCTCGACC
ACCGACGACT TCACCCGCGG CGTCGACGCG CTCGTCTACG AGGTCGACGA AGCCAAGGAA
CTGATGAAGA TCGCCTATCC GATCGAGCTG TTCGACCGCA ACGTGATCGA CGGCCGCGCG
ATGATCGCCT CGTTCCTGAC GCTGACGATC GGCAACAACC AGGGCATGGG CGACGTCGAA
TACGCCAAGA TGCACGACTT CTACGTGCCG CCGGCCTATC TGCGGCTGTT CGACGGTCCG
TCGACCACGA TCAAGGATCT GTGGCGCGTG CTCGGCCGGC CGGTGGTCGA TGGCGGCTTC
ATCGTCGGCA CCATCATCAA GCCGAAGCTC GGCCTGCGGC CGCAGCCGTT CGCCGACGCC
TGCTACGACT TCTGGCTCGG CGGCGACTTC ATCAAGAACG ACGAGCCGCA GGGCAATCAG
GTGTTCGCGC CGTTCAAGGA CACCGTGCGC GCGGTCAACG ACGCGATGCG CCGCGCTCAG
GATGCGACCG GTCAGCCCAA GCTGTTCTCG TTCAACATCA CCGCCGACGA TCACTACGAG
ATGCTGGCGC GTGGCGAGTA CATCCTGGAG ACGTTCGGCG AGAACGCCGA TCACGTCGCC
TTCCTGGTCG ACGGTTACGT CGCCGGTCCG GCCGCGGTGA CCACCGCGCG CCGTGCGTTC
CCGAAGCAGT ATCTGCACTA TCATCGCGCC GGCCATGGCG CGGTGACCTC GCCGCAGAGC
AAGCGCGGCT ACACCGCTTT CGTGCTGTCG AAGATGGCGC GACTGCAGGG CGCCTCCGGC
ATCCACGTCG GCACCATGGG CTATGGCAAG ATGGAAGGCG AAGCTTCCGA TCGCGATTCC
GCTTTCATGA TCACCCAGGA TTCAGCCGAG GGTCCGTACT TCAAGCAGGA GTGGCTCGGC
ATGAACCCGA CCACGCCGAT CATCTCCGGC GGCATGAACG CGCTGCGGAT GCCCGGCTTC
TTCGCCAATC TCGGCCACTC CAACCTGATC ATGACTGCAG GCGGCGGCGC CTTCGGTCAT
ATCGATGGCG GCGCGGCCGG CGCCAGGTCG CTGCGGCAGG CCGAGCAGTG CTGGAAGCAG
GGCGCCGATC CGGTCGCCTT CGCCAAGGAC CACCGCGAAT TCGCCCGCGC CTTCGAGAGC
TTCCCTAACG ACGCCGACAA GCTGTATCCG AACTGGCGCA ACATGCTGAA GCTCGCTGCC
GCGTGA
 
Protein sequence
MDQSNRYANL NLKESDLIAG GRHVLCAYIM KPKAGFGNFV ETAAHFAAES STGTNVEVST 
TDDFTRGVDA LVYEVDEAKE LMKIAYPIEL FDRNVIDGRA MIASFLTLTI GNNQGMGDVE
YAKMHDFYVP PAYLRLFDGP STTIKDLWRV LGRPVVDGGF IVGTIIKPKL GLRPQPFADA
CYDFWLGGDF IKNDEPQGNQ VFAPFKDTVR AVNDAMRRAQ DATGQPKLFS FNITADDHYE
MLARGEYILE TFGENADHVA FLVDGYVAGP AAVTTARRAF PKQYLHYHRA GHGAVTSPQS
KRGYTAFVLS KMARLQGASG IHVGTMGYGK MEGEASDRDS AFMITQDSAE GPYFKQEWLG
MNPTTPIISG GMNALRMPGF FANLGHSNLI MTAGGGAFGH IDGGAAGARS LRQAEQCWKQ
GADPVAFAKD HREFARAFES FPNDADKLYP NWRNMLKLAA A