Gene RPD_3051 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPD_3051 
Symbol 
ID4023554 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB5 
KingdomBacteria 
Replicon accessionNC_007958 
Strand
Start bp3395860 
End bp3396903 
Gene Length1044 bp 
Protein Length347 aa 
Translation table11 
GC content68% 
IMG OID637963250 
Productlytic transglycosylase, catalytic 
Protein accessionYP_570178 
Protein GI91977519 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0741] Soluble lytic murein transglycosylase and related regulatory proteins (some contain LysM/invasin domains) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACGACTG GATCAGTCGA CAATACGCTG CTCCGCGCAG CGCTCGTGGC CATGCTCGTG 
CTGATCGCGC CGGCCGCTGC GCGCGCCGAG GACGCGCCGC CCGCGAAGCA GGACGACGCC
ACCCAGAGCA AACCCGAACC GCAAGCGAAG CCGGACCAAG CGCCTGCGAG GCCGGATACC
TCCGCTCCTG CAGAGAAGCC CGCCGACAAG CCCACCGATC AGGCGGCCGA GAAGCCTGCG
GATCGGGACA CGCGCGAGTC GATTTGCCTG ATGATCGAGT CGGCGGCGAA GGCGCACGAT
TTGCCGCTCG AATTCTTCGC CCGTGTGATC TGGCAGGAGA GCCGGTTTCA ACCCGACGCG
GTCGGGCCGG TGACGCGCAG CGGCAAGCGT GCGCTGGGCA TCGCACAATT CATGCCGGGC
ACCGCGAGCG AGCGCAGCCT GCTCGATCCG TTCGACCCGG TGCAGGCGCT GCCGAAATCA
GCCGAGTTTC TCCGTGAGTT GCGCGGCCAG TTCGGCAATC TCGGGCTGGC GGCGGCGGCC
TATAATGCCG GTCCGCGCCG GGTGCAGGAA TGGATTGCCG GCACCGGTCC GATGCCGCAG
CAGACCCGCT CTTATGTGTA TGCGATCACC GGAACCTCGG TGGATGATTG GGCTGCGGCG
GGCCGTCAGG CGAAGCCGCC GGAGACCAGA GCGGATAGCG ACTGCCGAAC GCTGATGGCG
CTGCTGCGGC GCGCGCCGAA TCCGTTCGTG GCGCAGCTCG AGCAGCGGGT GAAGCTCGGC
GCCGACAAGC CCTGGGGCGT GCAACTCGCG GCCGGCTTCA ATCGCGATCG GGCTCTGGCG
ATGTATGCGC GGGCGATGTC GCGGTTAAGC GCGGTGATCG GCGACCAGGA TCCGAGCCTG
TCGAGTTCGG TGTTCCGCAG CCGTGGGACG CGGCCGTTCT ATCGCGTGCG GATCGGCGCC
GAGACGCGCC CCGAGGCGAA CGTGTTGTGC GACAAGATCC GCCGCGCCGG CAGCGCCTGT
CTGGTGTTGC GCAATCGCGG CTGA
 
Protein sequence
MTTGSVDNTL LRAALVAMLV LIAPAAARAE DAPPAKQDDA TQSKPEPQAK PDQAPARPDT 
SAPAEKPADK PTDQAAEKPA DRDTRESICL MIESAAKAHD LPLEFFARVI WQESRFQPDA
VGPVTRSGKR ALGIAQFMPG TASERSLLDP FDPVQALPKS AEFLRELRGQ FGNLGLAAAA
YNAGPRRVQE WIAGTGPMPQ QTRSYVYAIT GTSVDDWAAA GRQAKPPETR ADSDCRTLMA
LLRRAPNPFV AQLEQRVKLG ADKPWGVQLA AGFNRDRALA MYARAMSRLS AVIGDQDPSL
SSSVFRSRGT RPFYRVRIGA ETRPEANVLC DKIRRAGSAC LVLRNRG