Gene RPB_0794 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_0794 
Symbol 
ID3909608 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp889904 
End bp891883 
Gene Length1980 bp 
Protein Length659 aa 
Translation table11 
GC content66% 
IMG OID637882686 
Producttwin-arginine translocation pathway signal 
Protein accessionYP_484416 
Protein GI86747920 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.410839 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACTCATC ACCTTCCGTC ACCTGCGCCG TCGAGCGGGC TGCTGTCCAC TCGCCGGGCG 
TTTCTGCAAT CATCCGCGGC CTTCATCGGC GCGCTGTCCC TCGGATCGTC GCTCGGCGCA
CCGGCATGGG GCCGCGACCT TCGACGCTAC CCGATCGCGA CGCCGGCCGA AACGACCGTG
ACGCAGATGC TGGCCTTCCC GGCGACGATC GAGCCCGGCC TGGCGAAGAC CGCGCTGCAT
CAGGTCGCGC GCTACAAGGA TTTCGGCTAC GGCGAATGGA CGCTCGGCTC CGGCCTGCCG
ATCGTGACGC GCACCGACCT GATGCGAGCC GGCTACGAAA AGCCGGTGGG CGGCGAGAGC
AAGCGACTGA TCCGGTTCTT CGCGTTCACC GACGTGCACA TCACCGACAA GGAAGCGCCC
AACCAGCTGA TCGGATTCCA GCAGACCGAG CCCGCGGCGG TGAACAACAC CTCGATCTAT
TCGCCGGTGA TGCCGTACAC GACGCAGGTG CTGGACGCAG CGGTGCAGAC GGTCAACGAC
CTGCACAGCC GCGACCCGTT CGATTTCGGC ATCGCGCTCG GCGACGCCTG CAACAGCACG
TCCTACAATG AAGTGCGCTG GTACATCGAC GTGCTCGACG GCCAGCCGAT CACGCCGAGC
TCCGGCGACC ATCGCGGCCG CGACAGCGTC GACTTCCAGA TGCCGTTTCA GGCCGCCGGT
CTCGCCGCGG ATCTGCCCTG GTATCAGGTC CTCGGCAACC ACGATCATTT CATGATCGGC
TCGTTTCCGG TCGATGCCGA TCCGACGATC GGGCTACGGC AGTCCTACAC GGCCGACAGG
ATCTGGGCGG TCGGCGACGT GCTGAAGCCC AATCGCGAAG GCTTCCCGGC GCTGTTCGAC
TATCGCGGCC TCAAGGCCAC GCCGGCGTAC TATCCGGGAG TGATCGACGG CGCGAGCCCG
TATGGCGCGA TCATTCATAC CGGCCGCGCC GACGATCCGG CCTTCGCCGG CAAGCCGCCG
CAGATCGCCG CCGATCCCGG CCGCCGTCCG CTGGCGCGCG CCGAATGGCT CGCCGAATTC
CGCAACACCA CGACGCGTCC GAAGGGGCAC GGCTTCGATC TGATCGACGG CGCCGGCGAC
GGCTTCGCCT GCTACAGCTT CGTACCGAAA TCCAACCTGC CGCTGAAGGT GATCGTGCTC
GACGTCACCC AGTCCGAACA GGACGGCTCG CGCGACATCC ACGGCCACGG CTTTCTCGAT
GCCAGGCGCT GGGACTGGCT GAAGGCCGAG CTGGCGCGCG GCCAGGCCGA CGATCAGCTG
ATGATCATCG CCAACCACAT TCCGATCGGG GTGTCGCCGA TCGGCTCCGA AATGGAATGG
TGGCTGGGCG ACGCCAATGC GGCGCCTGAT TTTGCCAACG CCGTCGACCT CGCCGGCCTG
GTGACGACGC TGCAGGCTGC GCCGAACCTG CTGATGTGGA TCGCCGGGCA TCGCCATTTG
AATGTGGTGA AGGCGTTCCC CTCCGCCGAC CCGGACCGGC CGGAGCAGGG CTTCTGGCAG
GTCGAGACCT GCTCGCTGCG CGACTTTCCG CAGCAATTCC GGACTTTCGA GATCAGGCTC
AACGCGGACG ACACGGTGTC GATCGAGGCG ATGAATGTCG ATATCGCCGT CGCCGACGGC
ACGCCGGCGG CGCAATCGCG CAAATACGCC ATCGCCACCC AGCAGATCAT CCAGAACGAC
CTGCGGCCCA ACAGCCCGAA CTACGCGACC GCCGGCGGCA AGATTCCGGT GCCGAGCATG
GACCCGACGC GGCCGCAGAG CGACGATCCC AAAGCCACCG ATCCGTCGAT CCGGTTCGTC
GATCTGAGAA GCGCGGACAA GCCGGTCCAG TATCATGCGT CGAGCAATGT CGCACTGCTG
AAGCAGCTCA GCCCGCGGAT GGTCGAGGTG CTGGAGCGGA GGGTGGCGAT GCGGAAGTAG
 
Protein sequence
MTHHLPSPAP SSGLLSTRRA FLQSSAAFIG ALSLGSSLGA PAWGRDLRRY PIATPAETTV 
TQMLAFPATI EPGLAKTALH QVARYKDFGY GEWTLGSGLP IVTRTDLMRA GYEKPVGGES
KRLIRFFAFT DVHITDKEAP NQLIGFQQTE PAAVNNTSIY SPVMPYTTQV LDAAVQTVND
LHSRDPFDFG IALGDACNST SYNEVRWYID VLDGQPITPS SGDHRGRDSV DFQMPFQAAG
LAADLPWYQV LGNHDHFMIG SFPVDADPTI GLRQSYTADR IWAVGDVLKP NREGFPALFD
YRGLKATPAY YPGVIDGASP YGAIIHTGRA DDPAFAGKPP QIAADPGRRP LARAEWLAEF
RNTTTRPKGH GFDLIDGAGD GFACYSFVPK SNLPLKVIVL DVTQSEQDGS RDIHGHGFLD
ARRWDWLKAE LARGQADDQL MIIANHIPIG VSPIGSEMEW WLGDANAAPD FANAVDLAGL
VTTLQAAPNL LMWIAGHRHL NVVKAFPSAD PDRPEQGFWQ VETCSLRDFP QQFRTFEIRL
NADDTVSIEA MNVDIAVADG TPAAQSRKYA IATQQIIQND LRPNSPNYAT AGGKIPVPSM
DPTRPQSDDP KATDPSIRFV DLRSADKPVQ YHASSNVALL KQLSPRMVEV LERRVAMRK