Gene RPB_1992 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_1992 
SymbolmurD 
ID3909498 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp2263174 
End bp2264574 
Gene Length1401 bp 
Protein Length466 aa 
Translation table11 
GC content69% 
IMG OID637883886 
ProductUDP-N-acetylmuramoyl-L-alanyl-D-glutamate synthetase 
Protein accessionYP_485611 
Protein GI86749115 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0771] UDP-N-acetylmuramoylalanine-D-glutamate ligase 
TIGRFAM ID[TIGR01087] UDP-N-acetylmuramoylalanine--D-glutamate ligase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00736654 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATCCCCG TCACCTCTTT CGCCGGGCAA TCCGTCGCGG TGTTCGGGCT CGGCGGCTCG 
GGGCTGGCGA GCTGCCACGC GCTGCGCGCC GGCGGCGCCG AAGTGATCGC CTGCGACGAC
AATCTCGACC GCATGGTCGA AGCGGCGCAG GCCAATTTCA TCACCGCCGA TCTGCGCAAT
CTGCCGTGGA TGAATTTTGC CGCGCTGGTG CTCACGCCGG GCGTGCCGCT GACGCATCCG
ACGCCGCATT GGAGCGTGCT CAAGGCGCGC GAGGCGGGCG TCGAGGTGAT CGGTGACGTC
GAGCTGTTCT GCCGCGAGCG GCGGCTGCAC GCGCCGAACG CGCCGTTCGT CGCCATCACC
GGCACCAACG GCAAGTCCAC CACCACGGCG CTGATCGCGC ATCTGATGCG GCAGGCCGGC
TACGACACCC AGATGGGCGG CAATATCGGC ACCGCGATCC TGTCGCTGGA GCCGCCGCGC
GCCGGCCGCG TCCACGTGAT CGAGATGTCG TCCTACCAGA TCGATCTGAC ACCGTCGCTC
GATCCGAGCG TCGGCATCCT GCTCAATGTC ACCGAGGACC ACATCGATCG CCACGGCACC
ATCGAGCACT ATGCCGCGGT GAAGGAGCGG CTGGTTGCCG GCGTGCAGGA CGGCGGCACC
GCGATCATCG GCGTCGACGA CGGCTTCGGC CGCGACGCCG CCGACCGGCT GGAGCGCGCC
GGCAAGCGCG TGGTGCGGAT TTCGGTGAAG CAGCCGCTCG CCTCGGGCAT CACCGCGGAT
CGCGAGACGA TCGTGCAAGC CGACGGCGGC GCATCGCATG AAGTCGCGAA GCTCGACGGC
ATCGGTTCGC TGCGCGGTTT GCACAACGCG CAGAACGCCG CGGCGGCCGC CGCCGCAGCG
CTGGCGCTCG GCGTCGGCCC GGACGTGCTG CAGAACGGCC TGCGCAGCTT CCCGGGCCTC
GCGCACCGGA TGGAGCAGGT CGGACGCCAA GGCACGACGC TGTTCGTCAA CGACTCCAAG
GGCACCAATG CCGACGCGAC CGCGAAAGCG CTGTCGTCGT TCGGCGAGAT CTTCTGGATC
GCCGGCGGCA AGCCGAAGAC CGGCGGCATC GACAGCCTCG CCGAATACTT CCCGCGCATC
CGCAAGGCCT ATCTGATCGG CGAGGCGGCG CAGGAATTCG CCGCGACGCT GGAAGGGCGT
GTGCCCTACG AGATCAGCGT GACGCTGGAC AACGCGGTGC CGGCCGCCGC ACGCGACGCC
GCATCGTCGG GGCTGCCGGA GCCGGTCGTG CTGCTGTCGC CGGCCTGCGC CTCGTTCGAC
CAGTTCAGGA ATTTCGAAAT CCGCGGGACG AAGTTCCGCG ATCTGGTGAC GGCGCTGGAT
GGGGTGAAGC CGGTGGCCTA G
 
Protein sequence
MIPVTSFAGQ SVAVFGLGGS GLASCHALRA GGAEVIACDD NLDRMVEAAQ ANFITADLRN 
LPWMNFAALV LTPGVPLTHP TPHWSVLKAR EAGVEVIGDV ELFCRERRLH APNAPFVAIT
GTNGKSTTTA LIAHLMRQAG YDTQMGGNIG TAILSLEPPR AGRVHVIEMS SYQIDLTPSL
DPSVGILLNV TEDHIDRHGT IEHYAAVKER LVAGVQDGGT AIIGVDDGFG RDAADRLERA
GKRVVRISVK QPLASGITAD RETIVQADGG ASHEVAKLDG IGSLRGLHNA QNAAAAAAAA
LALGVGPDVL QNGLRSFPGL AHRMEQVGRQ GTTLFVNDSK GTNADATAKA LSSFGEIFWI
AGGKPKTGGI DSLAEYFPRI RKAYLIGEAA QEFAATLEGR VPYEISVTLD NAVPAAARDA
ASSGLPEPVV LLSPACASFD QFRNFEIRGT KFRDLVTALD GVKPVA