Gene Rsph17025_4020 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRsph17025_4020 
Symbol 
ID5086194 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides ATCC 17025 
KingdomBacteria 
Replicon accessionNC_009430 
Strand
Start bp50571 
End bp51956 
Gene Length1386 bp 
Protein Length461 aa 
Translation table11 
GC content67% 
IMG OID640485578 
Productribulose bisphosphate carboxylase 
Protein accessionYP_001170178 
Protein GI146280021 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1850] Ribulose 1,5-bisphosphate carboxylase, large subunit 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones44 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGACCAGT CCAACCGCTA CGCCCGGCTT GACCTCAAGG AAGCCGATCT GATCGCCGGC 
GGCCGTCACG TCCTCTGCGC CTATGTCATG AAGCCCAAGG CCGGCTACGG CTACCTTGAA
ACGGCGGCCC ATTTCGCCGC CGAAAGCTCC ACCGGCACCA ACGTCGAGGT CTCGACCACC
GACGATTTCA CCCGCGGCGT CGATGCGCTC GTCTATGAGA TCGACGAGGC GCGCGAGCTG
ATGAAGATCG CCTATCCGGT CGAGCTGTTC GACCGCAACA TCATCGACGG GCGGGCGATG
CTCTGCTCGT TCCTGACGCT GACGATCGGC AACAACCAGG GCATGGGCGA CGTCGAATAC
GCCAAGATGC ACGACTTCTA CGTGCCGCCC TGCTACCTGC GCCTGTTCGA CGGCCCCTCG
ATGAACATCG CCGACATGTG GCGGGTGCTG GGGCGCAGCG TGACCGACGG CGGCATGGTC
GTGGGCACCA TCATCAAGCC GAAGCTGGGC TTGCGGCCCA AGCCCTTCGC GGACGCCTGC
TACGAATTCT GGCTGGGCGG CGACTTCATC AAGAACGACG AGCCCCAGGG CAACCAGACC
TTCGCGCCGC TGAAAGAGAC GATCCGTCTG GTCGCCGACG CCATGAAGCG GGCGCAGGAC
GAGACCGGCG AGGCGAAGCT CTTCTCGGCC AACATCACCG CCGACGACCA TTACGAGATG
GTGGCGCGCG GCGAATACAT CCTTGAGACC TTCGGCGAGA ACGCCGACCA TGTGGCCTTC
CTGGTGGATG GCTATGTGAC CGGCCCCGCG GCCATCACCA CCGCGCGGCG CCAGTTCCCG
CGGCAGTTCC TGCACTACCA CCGCGCGGGT CACGGGGCGG TGACCTCGCC GCAGTCGATG
CGCGGCTACA CCGCCTTCGT GCTCTCGAAG ATGGCGCGCC TGCAGGGCGC CTCCGGCATC
CACACCGGCA CGATGGGCTT TGGCAAGATG GAGGGCGAGG CCGCCGACAG GATCATGGCC
TTCATGCTGA CCGACGATGC GGCGCAGGGG CCCTTCTACC CGCAGGACTG GCTGGGGATG
AAGGCCACGA CGCCGATCAT CTCGGGGGGC ATGAACGCGC TCAGGCTGCC GGGCTTCTTC
GACAACCTCG GCCATTCGAA TGTGATCCAG ACCTCGGGCG GCGGCGCCTT CGGCCATCTC
GACGGCGGCA CGGCGGGCGC GAAGTCGCTG CGCCAGGCGC ATGACGCCTG GAAGGCGGGC
GTGGATCTCG TCACCTATGC CCGCGAGCAT CGCGAACTGG CCCGCGCCTT CGAAAGCTTC
CCGGCCGATG CCGACCGGTT CCACCCCGGC TGGCGCGAGA AGCTGCAGCT CGCCGGCGCG
GCCTGA
 
Protein sequence
MDQSNRYARL DLKEADLIAG GRHVLCAYVM KPKAGYGYLE TAAHFAAESS TGTNVEVSTT 
DDFTRGVDAL VYEIDEAREL MKIAYPVELF DRNIIDGRAM LCSFLTLTIG NNQGMGDVEY
AKMHDFYVPP CYLRLFDGPS MNIADMWRVL GRSVTDGGMV VGTIIKPKLG LRPKPFADAC
YEFWLGGDFI KNDEPQGNQT FAPLKETIRL VADAMKRAQD ETGEAKLFSA NITADDHYEM
VARGEYILET FGENADHVAF LVDGYVTGPA AITTARRQFP RQFLHYHRAG HGAVTSPQSM
RGYTAFVLSK MARLQGASGI HTGTMGFGKM EGEAADRIMA FMLTDDAAQG PFYPQDWLGM
KATTPIISGG MNALRLPGFF DNLGHSNVIQ TSGGGAFGHL DGGTAGAKSL RQAHDAWKAG
VDLVTYAREH RELARAFESF PADADRFHPG WREKLQLAGA A