Gene RPD_2447 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPD_2447 
Symbol 
ID4022938 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB5 
KingdomBacteria 
Replicon accessionNC_007958 
Strand
Start bp2730459 
End bp2731838 
Gene Length1380 bp 
Protein Length459 aa 
Translation table11 
GC content69% 
IMG OID637962640 
Productpeptidase M23B 
Protein accessionYP_569578 
Protein GI91976919 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0739] Membrane proteins related to metalloendopeptidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCTACC GTTCCGGTCA CCCTTCCGAA CACCATCCCA ACCACGCCCC GAACTACGGC 
CGGCCGCAGC CGCACCGGGC GCAGCGCCCG CATCCGGGGC AAAGGCCTTC TCCTGAAGTC
CCCGCCGAGG CCTCCAGCTA CACCATCGCC CATGCCGGCA AGCAGGTCCG GATCGGGCCG
GTGGTGTTCT GGATCGTGGT CGGCACCATC GTTGCGCTGG GCTGCTGGTC GGCGGCCACC
GCCACTTACT TCGCATTCCG CGACGATGTG CTGACGAGGC TGATCGCCCG CCAGGCGGAG
ATGCAATACG CCTATGAGGA TCGCATCGCC GAGCTGCGCG CCAAGGTCGA TCGCACTACC
AGCCGGCAGT TGCTCGATCA GGAGCAATTC GACCAGAAGC TCGATCAGGT GATGCGCCGC
CAGACCATGC TGGAGTCGCG CGCCAGTGCG CTGAACACCC TGCCCGACGT CGTGGTCACC
GGCAGCATCA AGAGCTCGCG GACGCCGTCG ACCGACACTG CGCCGGCCGG GCCGCTGAAG
CCTTCGCCGA TCAACGACAC CGTGATCTTC GTCGCGCCGC CGGACCGCGA GGCGCGGCTG
GAGTCGCGTT CTCCCGCAGC CGCGCCGGCG CTGCCGACCA CGCAATACGC CAAGGCGCAG
GGCCTCGACA CCGCGCTCTC CAAGCTCGAG CAGTCGCTCG ATCAGGTCGA GAAGCGGCAG
ATCGCGGCGC TCGGCTCCGT CGAGGAATCC TACGAAACCC GCGCCCGCCG GATGCGCGGC
GTCTTCACCG ATCTCGGCCT CGACACCCGC GGGCTGGAAG CCGCCGCACC GCGCGCCGGC
ATCGGCGGTC CGTTCGTGCC GTTGAAGGCG CCGTCGACCA ATGCCAGCTC GTTCGACCGC
CAGCTCTATC GGATCAACCT CGGCCGCGCC CAGCTCGACC GCCTCAACCG GGCCCTGTCG
CTGGTGCCGT ATCGCAAGCC GGTGATCGGC AACGTCGAAT TCTCGTCCGG CTTCGGCGTC
CGCAGCGATC CGTTTCTCGG CCGCCCGGCG ATGCACACCG GCCTCGATTT CCGCGCCTCA
TCCGGCGACC CGGTCCGCGC CACCGCGATC GGCAAGGTAG TGAATGCCGG CTGGCAGGGC
GGCTACGGCC AGATGGTCGA GATCGACCAC GGCAACGGCC TGTCGACCCG CTACGGCCAC
CTGTCGAAGA TCATCGCCAA GGTCGGCCAG AGCATCCAGA TCGGCCAGGT GATCGGCGAA
GTCGGCTCGA CCGGCCGCTC CACCGGCCCG CATCTACACT ACGAAACCCG CATCGAAGGC
GAAGCCGTCG ACCCGCAGAA GTTTTTGCGT GCGGGGGTGC GGCTGGCGGG GGCGGGTTAG
 
Protein sequence
MSYRSGHPSE HHPNHAPNYG RPQPHRAQRP HPGQRPSPEV PAEASSYTIA HAGKQVRIGP 
VVFWIVVGTI VALGCWSAAT ATYFAFRDDV LTRLIARQAE MQYAYEDRIA ELRAKVDRTT
SRQLLDQEQF DQKLDQVMRR QTMLESRASA LNTLPDVVVT GSIKSSRTPS TDTAPAGPLK
PSPINDTVIF VAPPDREARL ESRSPAAAPA LPTTQYAKAQ GLDTALSKLE QSLDQVEKRQ
IAALGSVEES YETRARRMRG VFTDLGLDTR GLEAAAPRAG IGGPFVPLKA PSTNASSFDR
QLYRINLGRA QLDRLNRALS LVPYRKPVIG NVEFSSGFGV RSDPFLGRPA MHTGLDFRAS
SGDPVRATAI GKVVNAGWQG GYGQMVEIDH GNGLSTRYGH LSKIIAKVGQ SIQIGQVIGE
VGSTGRSTGP HLHYETRIEG EAVDPQKFLR AGVRLAGAG