Gene Rpal_4052 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRpal_4052 
Symbol 
ID6411735 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris TIE-1 
KingdomBacteria 
Replicon accessionNC_011004 
Strand
Start bp4348916 
End bp4350355 
Gene Length1440 bp 
Protein Length479 aa 
Translation table11 
GC content68% 
IMG OID642713934 
ProductUDP-N-acetylmuramoylalanyl-D-glutamyl-2, 6-diaminopimelate--D-alanyl-D-alanine ligase 
Protein accessionYP_001993023 
Protein GI192292418 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0770] UDP-N-acetylmuramyl pentapeptide synthase 
TIGRFAM ID[TIGR01143] UDP-N-acetylmuramoyl-tripeptide--D-alanyl-D-alanine ligase 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.11633 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCAAAC AACCGCTTTG GACCTCCGAC GCAATGGCGG AGGCGATGGC TGCCACGCGC 
AGCGGCACGC TGCCGCGCGA TGTATATGGG ATTTCGATCG ACAGCCGCAC GTTGGCACCG
GGCGATGCTT ACTTCGCCAT CAAGGGCGAT GTTCATGACG GCCATGACTT CGTCGCCGCG
GCGCTGAACG CCGGCGCCGC GCTGGCGGTG GTGGAGAAGG CGCAGCGCGC CAAGTTCGCT
CCCGATGCGC CGCTGCTCGT CGTCGATGAC GTGCTCGAAG GACTACGCCA GCTCGGCATC
GCGGCGCGCT CGCGGCTGCC CGCCAAAGTG ATCGCGGTGA CCGGCTCGGT CGGCAAGACC
TCGACCAAGG AAGGTCTGCG CGGCGTGCTC GGCGCGCAGG GCGCGACCCA CGCCTCGGTG
GCGTCGTTCA ACAATCACTG GGGCGTGCCG CTGTCGCTGG CGCGCTGTCC GGTGGACTCG
CGGTTTGCGG TGTTCGAGAT CGGCATGAAC CACGCCGGCG AGATCGAGCC GCTGGTGAAG
ATGGTGCGGC CGCACATTGC GATCATCACC ACGGTCGAAG CCGTGCATCT CGAGTTCTTC
TCCGGCATCG AGGGCATCGC CGATGCCAAG TCGGAGATCT TCACCGGGCT CGAGCCGGGC
GGCATCGCCG TGCTGAACCG TGATACGCCG ATGTTCGACC GGCTGTGCAG CAATGCGTTG
CGCGCCAATG TCGGTCGCAT CGTCACCTTC GGTGCCGATC CCGCCGCCGA TGCGCGGCTG
CTCGATGTCG CGCTGCATGC CGACTGCTCG GCCGTGCATG CCAGCATTCT CGGCCACGAC
GTCACCTACA AGCTCGGCAT GCCGGGCCGG CACATGGCGC TGAATTCGCT GGCGGTGCTG
GCCGCTGCGG AGCTTGCCGG CGCCGACCTC GCGCTCGCCG CGCTGGCGCT GTCGCAGGTC
GCACCCGCCG CCGGCCGCGG CGTCCGCAAG CCGTTGCCTG TCGGCTCCGG CGAGGCGACG
CTGATCGACG AGAGCTACAA CGCCAATCCG GCCTCGATGG CCGCGGCGCT TGGCGTGCTC
GGCCGCGCCG AAATCAGCGG GCAGGGGCGG CGGATCGCCG TGCTGGGCGA TATGCTCGAA
CTCGGCCCGC GCGGCCCGGA GCTGCACCGG GGCCTGGAAG AGGCGGTGCG GGCCAATGGC
ATCGACCTGG TGTTCTGCTG CGGCCCGTTG ATGCGCAATT TGTGGGACGC CCTTTCCTCC
GGCAAACGAG GGGGCTATGC AGGCGACGCG GCCGCGCTCG AATCCCAAGT CGTCGCCGCA
ATCCGAGCCG GCGACGTCGT GATGGTGAAG GGGTCGCTCG GTTCGCGCAT GAAAACCATT
GTCACCGCGC TCGAGAAGCG CTTCCCCGGC ACGACCGCAC GCGACGACGC TGCGGTGTAA
 
Protein sequence
MSKQPLWTSD AMAEAMAATR SGTLPRDVYG ISIDSRTLAP GDAYFAIKGD VHDGHDFVAA 
ALNAGAALAV VEKAQRAKFA PDAPLLVVDD VLEGLRQLGI AARSRLPAKV IAVTGSVGKT
STKEGLRGVL GAQGATHASV ASFNNHWGVP LSLARCPVDS RFAVFEIGMN HAGEIEPLVK
MVRPHIAIIT TVEAVHLEFF SGIEGIADAK SEIFTGLEPG GIAVLNRDTP MFDRLCSNAL
RANVGRIVTF GADPAADARL LDVALHADCS AVHASILGHD VTYKLGMPGR HMALNSLAVL
AAAELAGADL ALAALALSQV APAAGRGVRK PLPVGSGEAT LIDESYNANP ASMAAALGVL
GRAEISGQGR RIAVLGDMLE LGPRGPELHR GLEEAVRANG IDLVFCCGPL MRNLWDALSS
GKRGGYAGDA AALESQVVAA IRAGDVVMVK GSLGSRMKTI VTALEKRFPG TTARDDAAV