Gene Rsph17029_3626 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRsph17029_3626 
Symbol 
ID4898605 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides ATCC 17029 
KingdomBacteria 
Replicon accessionNC_009050 
Strand
Start bp718797 
End bp720449 
Gene Length1653 bp 
Protein Length550 aa 
Translation table11 
GC content66% 
IMG OID640114234 
Productheme peroxidase 
Protein accessionYP_001045488 
Protein GI126464375 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.897978 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.572751 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGAGCGCCG CGCTGCGCAG CAAGCGCCGG GACAAGTTCG TCCCGCGCCT GCGGGTCTTC 
CTGCTGCAGA ACTTCGCGCC GGTCTGGCGC ACGGTGGAGC GGGTTCCCCC GCTCGCCCGG
CTGGTCAATC GCCTCATCAT CAACACGGCT TGCAACACCG CCGCCTTCCG TCCCCATGTC
CTGAGCACGC TCGGGGACTA CACGTCCTGG AGCTCCCTCA CCGACCGGAC CTATCTGGGC
CGCATGGTGC CGCCCCTGCC TGCAACCGCG CAAGGCGGCC CCTCGCTCGC AGAGACCACC
GCGCTCTTCG CGCCGCTGGG ACCGCAGCAG CAGTGCCCCA AGTCGACGCT GCTGTTCCCG
ATCTTCGCGC AATATCTCAC CGATGGCTTC CTGCGCACCA GCAGCACGGA TCGCGCAAGG
ACCACCTCCA ACCATGACAT CGACCTGTCA CCGCTCTACG GCCGCACCCC TGCCCAGACC
CTGGCGCTCC GGGCGACACC GGATCAAGGC CATGGAAAAG GCCGCCTGAA GTCGCAGTGG
ATCGGGGACG AGGAGTTTCC GCCCGATCTC TACCGGACCG GCACCAGCGA CATTGCCGCC
GATTTCGCGG ATGCCCACGG CACGTCCCTG CTCGACCTGC CGCTCGGCAT GAACGCCGAT
CCGCCCTGGG CCGCGGGCGA GCATTCGCGG CGCAGGCTGT TCGCGGTGGG CGGCGACCGC
GTGAACTCGA CCGCGCTCGT CGCGATGCTG AACACGCTGT TCCTGCGCGA GCACAACCGC
CTTGCGCGCG AACTGGAGAG GCGCAACCCC GGCTGGGACG ATACGCGCGT GTTCGAGACG
GCCCGCAACA TCGTGATCGT TCTCTTCATC AAGATCGTGA TCGAGGAATA TATCAATCAC
ATCTCCTCGG CCTGTTTCCG GCTCCGTGCC GATCCGCGCG TGGCGTGGAA GGCACCGTGG
AACAAGCCGA ACTGGATGAC GGTCGAATTC TCGCTGCTCT ATCGCTGGCA TTCGCTGGTG
CCGGAGACGA TGCTGTGGGA CGGGACGCGG ATGGATACCG CAGCCATCCT GCTCGACAAC
ACGAAGCTCA TCGAGGCGGG GCTCGCGAAG GCCTTCAAAT GGGCCGGCCA GACGCCCGCC
GCCCGCCTCG GGCTGCACAA TACCGCGATC TATCTGGAGA ACCAGCTCAC GGTCGAGTCC
CGGGCCATCG AACAGAACCG CGCGCGCCGG CTGCCGGGCT ACAATGCCTA TCGGAAGGCG
ATGGGCATGA ATCCCGTCGA CGATTTCGAC TGCATGACCG GCGACCGGGC GCGGCAGGAA
GAGCTCCGGG CCCTCTACCG GACGCCCGAG GCGGTCGACT TCTATGTCGG ACTGTTTGCC
GAGGATGCCG GCCTGAACAC GCCGATGCCG CCGCTCCTCG GCGCCATGGT GGCGCTCGAT
GCCTTTTCGC AGGCGCTGAA CAATCCGCTC CTGTCGAAGC AGGTCTATGG CAAGGAAACC
TTCACCGGCT ACGGGCTCGA CGTCATCGAG GCGACGGGAA CCCTCTGGGA CATTCTCGTG
CGCAACCTCG GCCCCACCGC GCCCTCCGAC ATCAGGGCGG AGGATGTCCG CATGACGCGG
CCCGATTGGC GCCGCAGGTT CTCGGCGTTC TAG
 
Protein sequence
MSAALRSKRR DKFVPRLRVF LLQNFAPVWR TVERVPPLAR LVNRLIINTA CNTAAFRPHV 
LSTLGDYTSW SSLTDRTYLG RMVPPLPATA QGGPSLAETT ALFAPLGPQQ QCPKSTLLFP
IFAQYLTDGF LRTSSTDRAR TTSNHDIDLS PLYGRTPAQT LALRATPDQG HGKGRLKSQW
IGDEEFPPDL YRTGTSDIAA DFADAHGTSL LDLPLGMNAD PPWAAGEHSR RRLFAVGGDR
VNSTALVAML NTLFLREHNR LARELERRNP GWDDTRVFET ARNIVIVLFI KIVIEEYINH
ISSACFRLRA DPRVAWKAPW NKPNWMTVEF SLLYRWHSLV PETMLWDGTR MDTAAILLDN
TKLIEAGLAK AFKWAGQTPA ARLGLHNTAI YLENQLTVES RAIEQNRARR LPGYNAYRKA
MGMNPVDDFD CMTGDRARQE ELRALYRTPE AVDFYVGLFA EDAGLNTPMP PLLGAMVALD
AFSQALNNPL LSKQVYGKET FTGYGLDVIE ATGTLWDILV RNLGPTAPSD IRAEDVRMTR
PDWRRRFSAF