Gene RPB_2219 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_2219 
Symbol 
ID3907961 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp2531968 
End bp2533695 
Gene Length1728 bp 
Protein Length575 aa 
Translation table11 
GC content64% 
IMG OID637884114 
Producthemolysin activation/secretion protein 
Protein accessionYP_485835 
Protein GI86749339 
COG category[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG2831] Hemolysin activation/secretion protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0458384 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.400148 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCGATTCG AAATTAGCTA TCAGCGTCGC GGCTCCACGG CCGCGCGCGC GGCGGTGTTT 
TGCCTGAAAT CGGCGACGGT CGGACTGGCC GCAGTCGGCC TGGTGAGGCC GGCTGACGCC
CGCGAGGTTG CCGTCTCGGC CGCAGCGCAG CCGGCCGCGG CGCAGCCGTC CAATCCGGCC
GCTCAACCGC AGCCGGCGCA ACGCTTCGAC ATCGATGATT TCGCAATCCA GGGCGCCGAC
AAGTTGCCGC AGATCGAGAT CGAAGAGGCG GTCTATCCGT TTCTCGGACC GAACAAGACG
GCGGATGACG TCGAGAAGGC GCGGGCCGCG CTGGAGAAGG CGTATCACGA CAAGGGATTT
CAGACCGTCA GTGTGGCGGT GCCTCCACAG AATGTCGGGC GCAAGGTGGT GGTCCTGAAG
GTCACCGAGA TGAAGGTCGG CCGGCTGCGG GTGAAGAATT CGCGTTATTT CGACGTCGAC
AGGATCAAGC AGACGGCGCC GTCGCTGAAG GAGGGCACGG TTCCTAACTT CAAGGACGTC
ACCAAGGACA TCGTCGCGCT CAACCAGTGG CCCGACCGCC GGGTCACACC CGCGCTGCGA
GCCGGCGTCG CACCGGGCAC GGTCGACGTC GACCTCAACG TCGAAGACAA GGCGCCGATC
CACGCTTCCG TCGAAGTGAA CAATCGCCAG TCGCCGAGCA CCACGGCGAC GCGTCTGAAC
GCCACGGTCC ACTACGACAA TCTCTGGCAG CTCGGGCATT CGGCGAGCTT CACCTATCAG
GTCGCGCCGG AGCGGCGGCA GGATGCCGAG GTGTTCTCCG GCTCGTACCT CGCGCGGCTG
CCCAATCTCG ATTGGATCAA CCTGCTGATC TACGGCGTGT CGTCGAGTAG CAGCGTCGCC
AGCGTCGGCG GTACCAACAT CGTCGGTCCC GGCCAGATCA TCGGCACGCG TGCGATAATG
ACGTTGCCGG GGCGCGACGG CTTCTTTCAC ACGCTGTCGG CGGGCCTCGA CTACAAGCAT
TTCGACCAGA CGGTGGCGCT CGGCGCAGAC GCCTTCTCGT CGCCGGTCAC CTACTATCCG
GCAGTCGCCA GCTACGGCGC GACCTTCCAG GGCGAAAACT ACACAACGCA GTTCAACGCC
TCGGTCACCT ACAATCTGCG AACCCTGTCG AGCAGCGCCG CCGATTTCGA CGCCAAGAGA
TATTTCGCAT CGCCGAGCTT CACCCATTTC AACGCGGATG TGTCGCATAC CCAGGAATTG
CCTGAGGGAT TGCAGCTCTG GGGCAAGGTC GCGTCGCAGG TCGCGGACGG ACCGTTGGTG
TCGAGCGAGC AGATCAGCGT GGGCGGCATG GACACCGTGC GCGGCTATCT CGAATCCGAA
ACGCTCGGCG ACGACGGCGT CGTCGGCAAT TTCGAACTCC GTAGTCCGGA CATCGGTGCC
TGGCTGCAGA AGGAGATGAA GGACGAGACC GGGCAGGGAA CACCCCGCTT CACCACGTTC
AACGAGTGGC GGGTCTTCGG CTTCGCGGAC GCGGGCCACG CCAACGTTCA GCGGCCGTTG
CCCAACCAGA TCTCGTCGTT CGACCTGTGG AGCTACGGCG TCGGCTCCCG GTTCAAAGTG
TTCAGCACGA TTAACGGGGT CGTCGTGCTG TCGGTGCCGA TGAAGGACCA AGCCTACACC
CGCGCGGGTG ATCCGCGCTT CAATTTCCGT GTTTGGGGCG AATTCTGA
 
Protein sequence
MRFEISYQRR GSTAARAAVF CLKSATVGLA AVGLVRPADA REVAVSAAAQ PAAAQPSNPA 
AQPQPAQRFD IDDFAIQGAD KLPQIEIEEA VYPFLGPNKT ADDVEKARAA LEKAYHDKGF
QTVSVAVPPQ NVGRKVVVLK VTEMKVGRLR VKNSRYFDVD RIKQTAPSLK EGTVPNFKDV
TKDIVALNQW PDRRVTPALR AGVAPGTVDV DLNVEDKAPI HASVEVNNRQ SPSTTATRLN
ATVHYDNLWQ LGHSASFTYQ VAPERRQDAE VFSGSYLARL PNLDWINLLI YGVSSSSSVA
SVGGTNIVGP GQIIGTRAIM TLPGRDGFFH TLSAGLDYKH FDQTVALGAD AFSSPVTYYP
AVASYGATFQ GENYTTQFNA SVTYNLRTLS SSAADFDAKR YFASPSFTHF NADVSHTQEL
PEGLQLWGKV ASQVADGPLV SSEQISVGGM DTVRGYLESE TLGDDGVVGN FELRSPDIGA
WLQKEMKDET GQGTPRFTTF NEWRVFGFAD AGHANVQRPL PNQISSFDLW SYGVGSRFKV
FSTINGVVVL SVPMKDQAYT RAGDPRFNFR VWGEF