Gene Rsph17029_1962 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRsph17029_1962 
Symbol 
ID4895077 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides ATCC 17029 
KingdomBacteria 
Replicon accessionNC_009049 
Strand
Start bp2078146 
End bp2079504 
Gene Length1359 bp 
Protein Length452 aa 
Translation table11 
GC content67% 
IMG OID640112556 
Productcoproporphyrinogen III oxidase 
Protein accessionYP_001043838 
Protein GI126462724 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0635] Coproporphyrinogen III oxidase and related Fe-S oxidoreductases 
TIGRFAM ID[TIGR00538] oxygen-independent coproporphyrinogen III oxidase 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.429779 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACAAACA TCGCCCTCCT CCAGAGTCTC GGACTCTTCG ATGCGCGCGT GCCGCGCTAC 
ACGAGCTATC CTGCGGCGCC GGTCTTCTCG GGCGCCGTCG GAGCGGACTT TCAGGCACAG
GCGATCGAAG CCCTCGATCC CGCCGTGCCG ATCTCGGTCT ATGTCCACGT CCCCTTCTGC
GAGCGGCTCT GCTGGTTCTG CGCCTGCCGC ACGCAGGGGA CCCAGACGCT GGCCCCGGTC
GAAGCCTATG TGGGCACGCT CCTGCAGGAG CTGGAGCTGG TGAAGCAGCA CCTGCCCGCC
GGCGTAAAGG CCGGGCGGCT GCACTGGGGC GGAGGCACGC CCACGATCCT GTCGCCCGAG
CTGATCCACA AGCTCGCGCA GGCGATCAAG GCCGTCATCC CCTTCGCCGA GGACTACGAA
TTCTCGGTCG AAATCGATCC GATGATGGTC GATGAGCCCA AGATCCGGGC GCTGAGCGAG
GAGGGCATGA ACCGCGCCTC GATCGGCATC CAGGACTTCA CCGACATCGT GCAGAATGCG
ATCGGGCGCG AGCAGCCCTT CGAGAACACC AAGGCCTGCG TCGAGACGCT GCGCCGCTAC
GGCGTCCATT CGCTGAACAC CGACCTCGTC TACGGGCTGC CGCACCAGAA CCGCGAGAGC
CTTGCCGCCA CCATCGACAA GGTGCTCTCG CTGAGGCCCG ACCGCGTGGC GATCTTCGGC
TATGCCCATG TGCCGTGGAT GGCCAAGCGC CAGAAGCTGA TCGACGAGAC CGTGCTGCCC
CCCGACATCG AGCGGCACGA GCTGGCCAAT CTGGCGGCGC GGCTCTTCAC CGAAGGCGGG
TTCGAGCGCA TCGGGATCGA CCATTTCGCG CTGCCCGACG ACAGCATGGC GGTGGCTGCC
CGCAGCGGAA AGCTGCGCCG CAACTTCCAG GGCTACACCG ACGACACCTG CCCGACGCTT
CTGGGCATCG GCGCCTCGTC GATCTCGAAG TTCGAGCAGG GCTATCTGCA GAACACGGCC
GCCACCGCCG CCTATATCAA GTCGATCGAG GAGGGGCGGC TGCCCGGCTA CCGGGGCCAC
CGCATGACCG AGGAGGATTA CCTCCACGGC CGCGCCATCG AGATGATCAT GTGCGACTTC
TTCCTCGACC TGCCCGCGCT GCGCGCGCGC TTCGGCGAGC CGGCCGAGAC CATGGTTCCG
CGCATCGCCG AGGCGGCCGA GAAGTTCACG CCCTTCGTCA CGGTGGACGC GGACGGCTCG
ATGTCGATTG CGAAGGAAGG CCGGGCGCTG GCGCGGATGA TCGCGCGGCT GTTCGACGCC
TACGAGACGC CGGAAGCCCG CTACTCGCAG GCCTCGTGA
 
Protein sequence
MTNIALLQSL GLFDARVPRY TSYPAAPVFS GAVGADFQAQ AIEALDPAVP ISVYVHVPFC 
ERLCWFCACR TQGTQTLAPV EAYVGTLLQE LELVKQHLPA GVKAGRLHWG GGTPTILSPE
LIHKLAQAIK AVIPFAEDYE FSVEIDPMMV DEPKIRALSE EGMNRASIGI QDFTDIVQNA
IGREQPFENT KACVETLRRY GVHSLNTDLV YGLPHQNRES LAATIDKVLS LRPDRVAIFG
YAHVPWMAKR QKLIDETVLP PDIERHELAN LAARLFTEGG FERIGIDHFA LPDDSMAVAA
RSGKLRRNFQ GYTDDTCPTL LGIGASSISK FEQGYLQNTA ATAAYIKSIE EGRLPGYRGH
RMTEEDYLHG RAIEMIMCDF FLDLPALRAR FGEPAETMVP RIAEAAEKFT PFVTVDADGS
MSIAKEGRAL ARMIARLFDA YETPEARYSQ AS