Gene RPB_3003 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_3003 
Symbol 
ID3910802 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp3420507 
End bp3421880 
Gene Length1374 bp 
Protein Length457 aa 
Translation table11 
GC content69% 
IMG OID637884909 
Productpeptidase M23B 
Protein accessionYP_486616 
Protein GI86750120 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0739] Membrane proteins related to metalloendopeptidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.070332 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCCTACC GTTCTGGCCA CCATTCCGAC CATCCGCAGC ACCACGCACC GCATCACGGC 
CGCCAAGCGC CGTACCGGGC TCGCCGCCCG CAGCCGGGGC CTCAGCCTCA GATCGCAGCG
GAGGCCTCGA GCTACACCTT CGCTCACGCC GGCCGGCAGG TGCGGATCGG GCCCGTGCTG
TTCTGGATCG TCGTCGGCAC CATCGTCGCG CTCGGCTGCT GGTCTGCCGC CACCGCCACC
TATTTCGCGT TCCGCGACGA CGTCCTGACC CGGCTGATCG CCCGCCAGGC CGAGATGCAA
TACGCCTACG AGGACCGCAT CGCCGAGCTG CGCGCCAAGG TCGACCGCAC CACCAGCCGG
CAGTTACTCG ATCAGGAGCA GTTCGACCAG AAGCTCGACC AGGTGATGCG CCGCCAGACC
ATGCTGGAAT CGCGCGCCAG CGCGATGAGC ACCCTGCCCG ACGTCGCCGT CACCGGCAGC
ATCAAGAGTT CACGGACGCC CGCACCGGAC AGCACGCCCG CCGGCCCGCT GAAGCCGTCG
CCGATCAACG ACACGGTGAT CTTCGTCGCT CCGCCGGATC GTGAAGCGCG GCTGGAATCA
CGCTCCCCCG CTGCAGCGCC GTCGCTGCCG ACGACGCAAT ACGCCAAGGC CCAGGGCCTC
GACACCGCGC TGACGAAACT CGAGCAGTCG CTCGACCAGG TCGAGAAGCG GCAGATCGCA
ACGCTCGGCT CGGTCGAGGA AAGTTTCGAG TCCCGCGCGC GCCGGATGCG CGGCGTGCTG
GCCGATCTCG GCCTCGCCGC CCGCGGTCTG GAAGCCGCGG CGCCCCGGGC CGGCGTCGGC
GGTCCGTTCG TGCCGCTGAA AGCGCCGTCT GCCAATGCCA GCGCGTTCGA CCGCCAGCTC
TATCGGATCA ATCTCAGCCG ATCGCAGCTC GACCGCCTCA ACCGCGCGCT GACGCTGGTG
CCGTATCGCA AGCCGGTGGT CGGCGAGGTC GAATTCTCCT CGGGCTTCGG CGTCCGCACC
GATCCGTTTC TCGGCCGTCC GGCCATGCAC ACCGGCCTCG ATTTCCGCGG CAACAGCGGC
GACCCGGTCC GCGCCACGGC GATCGGCAAG GTGGTCAACG CGGGCTGGCA GGGCGGCTAC
GGCCAGATGG TCGAGATCGA CCACGGCAAC GGCCTGTCGA CGCGCTACGG CCATCTGTCG
AAGATCATCG CCAAGGTCGG CCAGAGCGTC CAGATCGGCC AGATGATCGG CGAGATCGGC
TCCACCGGCC GCTCCACCGG CCCGCATCTG CACTACGAAA CGCGCATCGA CGGTGAAGCG
GTCGACCCGC AGAAGTTTCT GCGCGCGGGG GTGCGGCTGG CGGGGGCGGG TTAG
 
Protein sequence
MPYRSGHHSD HPQHHAPHHG RQAPYRARRP QPGPQPQIAA EASSYTFAHA GRQVRIGPVL 
FWIVVGTIVA LGCWSAATAT YFAFRDDVLT RLIARQAEMQ YAYEDRIAEL RAKVDRTTSR
QLLDQEQFDQ KLDQVMRRQT MLESRASAMS TLPDVAVTGS IKSSRTPAPD STPAGPLKPS
PINDTVIFVA PPDREARLES RSPAAAPSLP TTQYAKAQGL DTALTKLEQS LDQVEKRQIA
TLGSVEESFE SRARRMRGVL ADLGLAARGL EAAAPRAGVG GPFVPLKAPS ANASAFDRQL
YRINLSRSQL DRLNRALTLV PYRKPVVGEV EFSSGFGVRT DPFLGRPAMH TGLDFRGNSG
DPVRATAIGK VVNAGWQGGY GQMVEIDHGN GLSTRYGHLS KIIAKVGQSV QIGQMIGEIG
STGRSTGPHL HYETRIDGEA VDPQKFLRAG VRLAGAG