Gene Rsph17029_1058 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRsph17029_1058 
Symbol 
ID4896048 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides ATCC 17029 
KingdomBacteria 
Replicon accessionNC_009049 
Strand
Start bp1089337 
End bp1090380 
Gene Length1044 bp 
Protein Length347 aa 
Translation table11 
GC content69% 
IMG OID640111645 
Productcytochrome-c peroxidase 
Protein accessionYP_001042941 
Protein GI126461827 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG1858] Cytochrome c peroxidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0622508 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.212255 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGCCTGA CCCTCACCGT CCTGATCGCG ACGACGGCCC TCGCCGGCGC GGCTCAGGCC 
GACGCCCTCC GGGACAAGGC TCTGGGATAT TTTGCCCCGC TGCCCTCGAC GGTTCCGGCC
GTGAAGGACA ACCGCATCAC CCCGGAGAAG ATCGAGCTCG GCAAGGCGCT CTTCTTCGAT
CCGCGTCTGT CGGCCTCGGG CGTCTTCTCC TGCTATTCCT GCCACAACCT CACGACGGGC
GGCGGCGACA ACCTCGAGAC CTCGATCGGC CACGGCTGGC AGAAGGGGCC GCGGAACGCG
CCCACCGTGC TCAATGCGGT GCTGAACGAG GCGCAGTTCT GGGACGGGCG GGCCGACGAC
CTGAAGGCGC AGGCCAAGGG GCCGGTGCAG GCGGGCGTCG AGATGGCGAA CACGCCCGGG
CAGGTCGAGG TGACGCTGAA ATCCCTGCCG CAATATGTCG ACTGGTTCGC CGCCGCCTTC
CCGGGCGAGC CGGAGCCCAC CAGCTTCGAC AACATGGCCA AGGCCATCGA GGCCTTCGAG
GCGACGCTCA TCACGCCTGC GCCCTTCGAC GCCTTCCTGA ACGGAGACGA TGCGGCCCTG
ACCGAGGATC AGCGGGCGGG CCTCGATCTC TTCATCGACA AGGGCTGCTC GACCTGCCAC
TCGGGCGTGA ACGTGGGCGG GCACGGCTAC TATCCGTTCG GCCTGATCGA GAAGCCCGGC
GCGGACATCC TGCCCGAGGG CGACAAGGGC CGTTTCGCGG TGACGGCCAC GGTGGACGAC
GAATATGTCT TCCGGGCGGC GCCGCTGCGC AACGTGGCGG TCACGGCGCC CTATTTCCAC
TCGGGCAAGG TGTGGGACCT GAAGACCGCC GTCACGATCA TGGCCGAGAG CCAGCTCGGC
GAGACGATGA GCGATCAGGA GGTGGGGCAG GTCGTGGCCT TCCTCGAGAG CCTCACGGGG
ACCATGCCGC CGGTCACGCT GCCGGTGCTG CCTGCCGAGA CGGCAGGCAC GCCGCGCCCC
ACGGCCGAGA TCCGGGTCGA GTGA
 
Protein sequence
MRLTLTVLIA TTALAGAAQA DALRDKALGY FAPLPSTVPA VKDNRITPEK IELGKALFFD 
PRLSASGVFS CYSCHNLTTG GGDNLETSIG HGWQKGPRNA PTVLNAVLNE AQFWDGRADD
LKAQAKGPVQ AGVEMANTPG QVEVTLKSLP QYVDWFAAAF PGEPEPTSFD NMAKAIEAFE
ATLITPAPFD AFLNGDDAAL TEDQRAGLDL FIDKGCSTCH SGVNVGGHGY YPFGLIEKPG
ADILPEGDKG RFAVTATVDD EYVFRAAPLR NVAVTAPYFH SGKVWDLKTA VTIMAESQLG
ETMSDQEVGQ VVAFLESLTG TMPPVTLPVL PAETAGTPRP TAEIRVE