Gene Rsph17029_4004 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRsph17029_4004 
Symbol 
ID4899144 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides ATCC 17029 
KingdomBacteria 
Replicon accessionNC_009050 
Strand
Start bp1148065 
End bp1149444 
Gene Length1380 bp 
Protein Length459 aa 
Translation table11 
GC content66% 
IMG OID640114607 
Productribulose bisphosphate carboxylase 
Protein accessionYP_001045854 
Protein GI126464741 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1850] Ribulose 1,5-bisphosphate carboxylase, large subunit 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.988794 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGACCAGT CCAACCGCTA CGCCCGGCTT GATCTGAAGG AAGCCGATCT GATCGCAGGC 
GGCCGTCACG TTCTCTGCGC CTATGTCATG AAGCCCAAGG CGGGCTACGG CTATCTGGAG
ACGGCGGCCC ACTTCGCGGC CGAGAGCTCG ACCGGCACCA ACGTCGAGGT CTCGACCACC
GACGATTTCA CCCGCGGCGT CGATGCGCTC GTCTACGAGA TCGACCCGGA GAAGGAGATC
ATGAAGATCG CCTATCCGGT CGAGCTCTTC GACCGCAACA TCATCGACGG GCGGGCGATG
CTCTGCTCGT TCCTGACGCT GACGATCGGC AACAACCAGG GCATGGGCGA TGTCGAATAT
GCCAAGATGC ACGATTTCTA CGTGCCGCCC TGCTATCTGC GCCTGTTCGA CGGCCCCTCG
ATGAACATCG CCGACATGTG GCGCGTGCTG GGGCGCGACG TGCGCAACGG CGGCATGGTG
GTGGGCACGA TCATCAAGCC GAAGCTCGGG CTGCGGCCGA AACCCTTCGC AGATGCCTGC
CACGAGTTCT GGCTGGGCGG CGACTTCATC AAGAACGACG AGCCGCAGGG CAACCAGACC
TTCGCGCCGC TGAAGGAGAC GATCCGCCTC GTGGCCGATG CGATGAAGCG CGCGCAGGAC
GAGACCGGCG AGGCCAAGCT CTTCTCGGCC AACATCACCG CGGACGACCA TTACGAGATG
GTGGCGCGCG GGGAATACAT CCTCGAGACC TTCGGCGAGA ATGCCGACCA TGTGGCCTTC
CTCGTCGACG GCTATGTGAC GGGCCCCGCG GCCATCACCA CCGCGCGGCG CCAGTTCCCG
CGCCAGTTCC TGCATTATCA CCGGGCGGGG CACGGCGCCG TCACCTCGCC GCAGTCGATG
CGGGGCTATA CGGCCTTCGT GCTCTCGAAG ATGGCGCGCC TGCAGGGGGC CTCGGGCATC
CACACCGGCA CCATGGGCTA TGGCAAGATG GAGGGCGAGG CGGCCGACAA GATCATGGCC
TACATGCTGA CCGACGAGGC GGCCGAGGGG CCCTTCTACC GTCAGGACTG GCTGGGGCTG
AAGGCCACGA CGCCCATCAT CTCGGGCGGC ATGAACGCGC TGCGGCTGCC GGGCTTCTTC
GACAATCTCG GCCATTCCAA CGTGATCCAG ACCTCGGGCG GCGGCGCCTT CGGCCATCTC
GACGGCGGCA CGGCGGGGGC GAAGTCGCTG CGCCAGTCGC ACGAGGCCTG GATGGCGGGG
GTGGATCTCG TGACCTATGC CCGCGAGCAT CGCGAGCTTG CCCGTGCCTT CGAGAGCTTC
CCCGCGGATG CCGACAAGTT CTATCCGGGC TGGCGCGACC GGCTGCAGCG CGCAGCCTGA
 
Protein sequence
MDQSNRYARL DLKEADLIAG GRHVLCAYVM KPKAGYGYLE TAAHFAAESS TGTNVEVSTT 
DDFTRGVDAL VYEIDPEKEI MKIAYPVELF DRNIIDGRAM LCSFLTLTIG NNQGMGDVEY
AKMHDFYVPP CYLRLFDGPS MNIADMWRVL GRDVRNGGMV VGTIIKPKLG LRPKPFADAC
HEFWLGGDFI KNDEPQGNQT FAPLKETIRL VADAMKRAQD ETGEAKLFSA NITADDHYEM
VARGEYILET FGENADHVAF LVDGYVTGPA AITTARRQFP RQFLHYHRAG HGAVTSPQSM
RGYTAFVLSK MARLQGASGI HTGTMGYGKM EGEAADKIMA YMLTDEAAEG PFYRQDWLGL
KATTPIISGG MNALRLPGFF DNLGHSNVIQ TSGGGAFGHL DGGTAGAKSL RQSHEAWMAG
VDLVTYAREH RELARAFESF PADADKFYPG WRDRLQRAA