Gene RPD_4034 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPD_4034 
Symbol 
ID4024551 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB5 
KingdomBacteria 
Replicon accessionNC_007958 
Strand
Start bp4483233 
End bp4484621 
Gene Length1389 bp 
Protein Length462 aa 
Translation table11 
GC content67% 
IMG OID637964237 
Productpeptidase M16-like 
Protein accessionYP_571154 
Protein GI91978495 
COG category[R] General function prediction only 
COG ID[COG0612] Predicted Zn-dependent peptidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.450059 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAGCATCG CACGCCAATC CATGAAGCGT TTCGCGCTCG CCGCCACCAT CGCCGTCGCT 
GCCGCATGGT CGGCGACCCC AAGTCTTGCG GCCGCGAAAA TCCAGAAGCT GGTGACGCCG
GGCGGCATCC AGGCGTGGTT CGTGCAGGAC GCGACCGTGC CGCTGATCTC GATGGAATAC
TCTTTCGGCG GCGGCGCCAC CCAGGACCCC GCCGACAAGC CCGGCGTCGG CCACATGGTG
GCGAATCTGA TCGACGAAGG CTCGGGCGAC ATGGACTCGG CGACGTTCCA CGAGCGCATG
GATCGCCGCG CCATCCAGCT GTCGTTCAAC GTGACCCGCG ATTATTTCCG CGGCTCGCTG
CGAATGCTGA AGGAAAACCG CGACGAGGCG TTTGGTCTGG TGCGCACCGC GCTGACCGCG
CCGCGGTTCG AGGGCAAGGA CGTCGAGCGA ATCCGCGCGC AATTGACTTC GACGCTGCGC
CGCCAGTCGC TCGATCCGAA CACGATGGCG ACCCGCAAAT TTCTCGAAGT CGCGTTCGGC
GATCATCCTT ATGGCCGACC CTCGACCGGC ACGCTGGAAA GCCTGCCGAA GGTCACCGTC
GACGACATGA AGGCTTATGT CGGCCGCGTG CTGGCGAAGG ATACGCTCAA CATCGCGGTG
GTCGGCGACG TCGACGCCGC GACCCTCGCC AAGCTGCTGG ACGACACCTT CGGCAGCCTG
CCGGCCAAGG CGCAGCTTGC GCCGGTGGCC GACATCGTCG CCGCCAAGCC GCCGCAGCGC
AGCTTTGTGC CGCTCGACGT GCCGCAGACC GTGGTGATGT TCGGCGGACC GGGCCTGAAG
CGCCACGATC CGGATTTCAT GGCCGCCTAT GTGGTCAACC ACATCCTCGG CGGCGGCTCG
CTGTCGTCGC GGCTGTATCG CGAGGTCCGC GAGAAGCGCG GGCTGGCTTA TTCGATCTAT
GAGTCTCTGC TGTGGATGGA GCGTTCGGCG CTGTTCACCG GCGCCACCGG CACCCGCGCC
GACCGCGCGA CGCAAACGAT CGACGCCATC GATGCCGAGG TCAAGCGGAT CGCCGACGAG
GGCCCGACCC AGCAGGAGCT CGACGAGGCA AAATCGTATC TCAAGGGCTC GCAGATGTTG
TCGCTCGACA CTTCGGCCAA GCTCGCCCAG GCGCTGCTGC AGTATCAGAA TGACGGTCTG
CCGATCGACT ATATCGACAA GCGCAACGCC ATCGTCGACG CCGTGACGCT CGACGACGCC
AGGCGAGCCG CCAAGCGGCT GTGGTCCGAT GGGCTGCTCA CCGTCGTCGT CGGCCGCACT
CCGCAGGCCG CCGCCGAACC CGGCGCCGTC AAACCCGCCG GCACGCCGCC GCCGCCAAGG
GCGAACTGA
 
Protein sequence
MSIARQSMKR FALAATIAVA AAWSATPSLA AAKIQKLVTP GGIQAWFVQD ATVPLISMEY 
SFGGGATQDP ADKPGVGHMV ANLIDEGSGD MDSATFHERM DRRAIQLSFN VTRDYFRGSL
RMLKENRDEA FGLVRTALTA PRFEGKDVER IRAQLTSTLR RQSLDPNTMA TRKFLEVAFG
DHPYGRPSTG TLESLPKVTV DDMKAYVGRV LAKDTLNIAV VGDVDAATLA KLLDDTFGSL
PAKAQLAPVA DIVAAKPPQR SFVPLDVPQT VVMFGGPGLK RHDPDFMAAY VVNHILGGGS
LSSRLYREVR EKRGLAYSIY ESLLWMERSA LFTGATGTRA DRATQTIDAI DAEVKRIADE
GPTQQELDEA KSYLKGSQML SLDTSAKLAQ ALLQYQNDGL PIDYIDKRNA IVDAVTLDDA
RRAAKRLWSD GLLTVVVGRT PQAAAEPGAV KPAGTPPPPR AN