Gene RPD_3398 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPD_3398 
Symbol 
ID4023910 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB5 
KingdomBacteria 
Replicon accessionNC_007958 
Strand
Start bp3776281 
End bp3777720 
Gene Length1440 bp 
Protein Length479 aa 
Translation table11 
GC content67% 
IMG OID637963603 
ProductUDP-N-acetylmuramoylalanyl-D-glutamyl-2, 6-diaminopimelate--D-alanyl-D-alanine ligase 
Protein accessionYP_570523 
Protein GI91977864 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0770] UDP-N-acetylmuramyl pentapeptide synthase 
TIGRFAM ID[TIGR01143] UDP-N-acetylmuramoyl-tripeptide--D-alanyl-D-alanine ligase 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.146335 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.470298 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCAAAC CACCGCTGTG GACTTCGGAT GCGATGGCGG AGGCGATGCG CGCCGCCGCG 
AGCGGCTCGC TGCCGCGCGA CGTGTTCGGC ATTTCGATCG ACAGCCGCAC GCTGGCGCCG
GGCGATGCGT ATTTCGCGAT CAAGGGTGAC GTCCATGATG GTCACGATTT CGTCGCCGCC
GCGCTGAATG CCGGCGCGGC GCTGGCGGTG GTGGAAAGGG CGCAGCGCGC CAAGTTCGCC
GCCGACGCGC CGCTGCTGGT GGTCGACGAC GTACTCGACG GATTGCGCGA GCTCGGCCGC
GCGGCGCGGG CGCGGCTCGA CGCCAAGATC ATCGCGGTGA CCGGCTCGGT CGGCAAGACC
TCGACCAAGG AAGCGCTGCG CGGCGTGCTC GGGGCGCAAG GCGAGACCCA CGCTTCGGTG
GCGTCGTTCA ACAATCACTG GGGCGTGCCG CTGTCGTTGG CGCGCTGTCC GGCGGACGTG
CGCTACGCGG TGTTCGAAAT CGGCATGAAC CACGCGGGTG AAATCGAACC GCTGGTGAAG
ATGGTGCGAC CACATCTGGT GATCATCACC ACGGTGGAGC CGGTGCACCT CGAGTTCTTC
TCCGGCATCG AAGCGATCGC CGACGCCAAG GCGGAAATCT TCGCTGGGCT GGTGCCCGGC
GGCACGGCGG TTCTCAATCG CGACAATGCG ATGTTCAAGC GGCTCACCGA CAGCGCCCGC
AAGGCCGGCG TCGGTCGCGT CGTATCGTTC GGTGCCGATG TCGAGGCCGA TGCGCGGCTG
CTCGACGTCG CGTTGCACGC CGATTGCTCC GCGGTGCATG CGACGATTTT CGGCCGCGAC
GTCACTTACA AGCTCGGGAT TCCCGGGCGG CATATCGCGA TGAATTCTCT GGCCGTATTG
GCCGCGGCGG AAACTGTCGG CGCTGATCTC GCGCTGGCGG CGCTGGCGCT GTCGCACGTC
CAGCCCGCCG CCGGCCGTGG CGTCCGCCGC GCGCTCGAAT TCGGGCAAAG CGAGGCCACG
CTGATCGACG AGAGCTATAA TGCCAATCCG GCGTCGATGG TGGCGGCGCT GGGCGTGCTC
GGCCAGGTCC CGGTCGGTCC GCAGGGTCGG CGGATCGTTG TGCTCGGCGA CATGCTGGAA
CTCGGCCCGG CCGGGCCGGA GTTGCATCGC GACCTCGCCG AGTCGGTGCG GAATAACGCA
ATCGATCTGG TGTTCTGCTG CGGTCCGCTG ATGCGCAATT TGTGGGACGC CCTTTCCTCA
GGGAAGCGAG GGGGCTATGC AGAGACCGCG GCCGCGCTCG AATCTCAGGT GGTTGCGGCG
ATCCGTGCCG GCGACGTGCT GATGATCAAA GGCTCGCTCG GCTCGCGCAT GAAAACGATT
GTCACCGCGC TCGAGAAGCG CTTTCCCGGC AAGACCGCGC GCGATGACGC TGCGGTGTAA
 
Protein sequence
MSKPPLWTSD AMAEAMRAAA SGSLPRDVFG ISIDSRTLAP GDAYFAIKGD VHDGHDFVAA 
ALNAGAALAV VERAQRAKFA ADAPLLVVDD VLDGLRELGR AARARLDAKI IAVTGSVGKT
STKEALRGVL GAQGETHASV ASFNNHWGVP LSLARCPADV RYAVFEIGMN HAGEIEPLVK
MVRPHLVIIT TVEPVHLEFF SGIEAIADAK AEIFAGLVPG GTAVLNRDNA MFKRLTDSAR
KAGVGRVVSF GADVEADARL LDVALHADCS AVHATIFGRD VTYKLGIPGR HIAMNSLAVL
AAAETVGADL ALAALALSHV QPAAGRGVRR ALEFGQSEAT LIDESYNANP ASMVAALGVL
GQVPVGPQGR RIVVLGDMLE LGPAGPELHR DLAESVRNNA IDLVFCCGPL MRNLWDALSS
GKRGGYAETA AALESQVVAA IRAGDVLMIK GSLGSRMKTI VTALEKRFPG KTARDDAAV