Gene RPD_4092 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPD_4092 
Symbol 
ID4024614 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB5 
KingdomBacteria 
Replicon accessionNC_007958 
Strand
Start bp4553328 
End bp4555373 
Gene Length2046 bp 
Protein Length681 aa 
Translation table11 
GC content67% 
IMG OID637964300 
Productpeptidase M23B 
Protein accessionYP_571212 
Protein GI91978553 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0739] Membrane proteins related to metalloendopeptidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGAGCCAAA GGGTTTCACG GGGACGCAGC CACGGGCGTG AGATCGGCAT GATCGATCTC 
GGTCACGAGC CGCCGCTTTC TGTCGACGGC TCCGAAGCGG CGGTGATCGA TCGCCGCCGC
GTGTCGGTAC AATGGTTCAG CGGCACGATT CTGACCGGCC TGTGCGGCGC AGCCTTGATC
GGCGGCGCCG TTTTCGCGTC GCTCGACGGC GAAACCACCT TCGCCAAAGC GCCGGAGCGG
GTCGAGGCCG CGCTGCGCGG GGCGTTCGGC GCCGACAAGA ACGCCGCCCT GCATAAGAGC
GACCGCCTGC CCCCGCCGAG CGAATCGGCC GCCGCGCGCA ACGTGATTCG GGTCTCGACG
GTGTCCCGCG TCGGCAATCG CGACGTCGTC CGGGTCCGCC CCTTCATCAA GATTGCCGGC
AACCTGTCGC TGACCACCAG CGATCTGTCC GCCAAGATCC CGGCCTTCAA TGCGCAACGG
ATGCTGAGCG ACGTCGGCGC CGCCGCGCCC GCGGCAGAGG ATGCGCAGAA CCCCGACGCT
GTCGAGCCAG ACGCCGAAGT CTCGTTCGTC ACCCGCGATC TCGGCCAGGT GCTGCCCAAG
GCGAAGATCG CCGGCCAGAT CCCGCCCGAT GAAATCCTGA TGCGGGTGCG CGACGCCGCG
AACTGGAAGG GCAATAGCGG CGTGCGCTAC GCCAGCGCGA CCTCCGGCGA GATCGGCGGC
GACATGAGGC TCGCTTACGC GCCCGAAGGC GCCTCCGCCG ACCCTTATGC CGGCTTCGAA
ACCCGGATCG TGCCTGAAAA CGTCACGCTG CTGCCGAAGA CCAAGGAACA GGCGACCGGC
GGCAATCCCA CCGGCGAGCG CGTCCATCTC GTCAAGAAGG GCGACACCGT GGTGTCGGTG
CTGCGCGACC AGGGCGCCAG CGAGGAGGAC GCCAATGCGG TCGCGGCGGC GCTGGGCGCA
CGCGGACGCA GCGGCGGGGT GAAAGAAGGC CAGAAACTGC GAATCCTGAT GGAGCCCGCC
GGCGCCGGCA AGCCGTCGCA GCCGTTCCGG GTGATCATCG CCAACGAATC CACCGTCGAG
GCCGTCGCGG CGCTGTCCGA TCTCGGCCGA TACGTCGCCG TCGACGTCCA GAGCCTCAAC
ACCGTCAGCG ACACCGCCGA CAACAGCGAC GAGGACGAGG ACGACGGCAC CGGCGTTCGC
CTCTACCAGT CGATCTACGA GACCGCGCTG CGCAACAAGG TGCCGCAGGC GATCATCGAC
GACATGATCA AGATCTATTC CTACGACGTC GACTTCCAGC GCAAGGTCCA GGCCGGCGAC
TCGTTCGAGG TGTTCTACGC CGGCGACGAC GAGACCACCG CCACCATCGA GAAGAACGAC
GTGCTGTTCG CCTCGCTCAC CGTGGGCGGC GAGACCAAGA AATACTACCG CTTCCAGACC
CCCGACGACG CCGTGGTCGA TTTCTACGAC GAGAGCGGCA AGAGCGCGAA GAAGTTCCTG
GTGCGCAAGC CGGTCAACAA CGCGATCATG CGCTCGGGCT TCGGCGGCCG CCGCCATCCG
ATTCTCGGCT ACGTCAAAAT GCACACCGGC GTCGACTGGT CGACGCCCTA CGGCACGCCG
ATCTTCGCCT CCGGCAACGG CGTGATCGAA AAGGCCGGCT GGGAAGGCGG CTACGGCAAA
TATATCCGCG TCAAACACAA CAACGGCTAC GAGACCGCCT ACGGCCACAT GTCGGCGTTC
GCCAAGGGCA TGGAGCCCGG CAAGCGGGTG CGCCAGGGCC AGGTGATCGG CTTCGTCGGC
TCCACCGGCC TGTCGACCGG CGCACATGTC CACTACGAAA TCCTCGTCAA CGGCCGCTTC
GTCGACCCGA TGCGCGTCAA GCTACCGCGC GGTCGCTCGC TCGACGGCCC GCTGATGGCG
AGCTTCGAAA AAGAGCGCGA CCGGCTCGAC GGCATGTTGA CCAGCCGCGG CGGCAGCAGA
ATGTCGGAAG ACCTGACCGC AGCGGCCCCG GTGCGCAGCG TGAACGCGGC GGCGGCGCGG
AAGTAG
 
Protein sequence
MSQRVSRGRS HGREIGMIDL GHEPPLSVDG SEAAVIDRRR VSVQWFSGTI LTGLCGAALI 
GGAVFASLDG ETTFAKAPER VEAALRGAFG ADKNAALHKS DRLPPPSESA AARNVIRVST
VSRVGNRDVV RVRPFIKIAG NLSLTTSDLS AKIPAFNAQR MLSDVGAAAP AAEDAQNPDA
VEPDAEVSFV TRDLGQVLPK AKIAGQIPPD EILMRVRDAA NWKGNSGVRY ASATSGEIGG
DMRLAYAPEG ASADPYAGFE TRIVPENVTL LPKTKEQATG GNPTGERVHL VKKGDTVVSV
LRDQGASEED ANAVAAALGA RGRSGGVKEG QKLRILMEPA GAGKPSQPFR VIIANESTVE
AVAALSDLGR YVAVDVQSLN TVSDTADNSD EDEDDGTGVR LYQSIYETAL RNKVPQAIID
DMIKIYSYDV DFQRKVQAGD SFEVFYAGDD ETTATIEKND VLFASLTVGG ETKKYYRFQT
PDDAVVDFYD ESGKSAKKFL VRKPVNNAIM RSGFGGRRHP ILGYVKMHTG VDWSTPYGTP
IFASGNGVIE KAGWEGGYGK YIRVKHNNGY ETAYGHMSAF AKGMEPGKRV RQGQVIGFVG
STGLSTGAHV HYEILVNGRF VDPMRVKLPR GRSLDGPLMA SFEKERDRLD GMLTSRGGSR
MSEDLTAAAP VRSVNAAAAR K