Gene RPB_3008 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_3008 
Symbol 
ID3910807 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp3427721 
End bp3429031 
Gene Length1311 bp 
Protein Length436 aa 
Translation table11 
GC content65% 
IMG OID637884914 
ProductN-acetylmuramoyl-L-alanine amidase 
Protein accessionYP_486621 
Protein GI86750125 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0860] N-acetylmuramoyl-L-alanine amidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.225785 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGCAGGGGC GCGCAAATCA TTTTGTTTTG TTCAGCGCCG TCTTCGCGTG CGCTGCAGCA 
TGGATGTGCG CGTCGCCGAC CCAGGGGTGG GCCGTGGAAG CCCTGCCTGC CCCCTCGACC
CCAACCCAAA GCACTGCAGC GCCGGTCACA AACGGCTTTC CGATCGCGTC CGATGCCCGT
CTGGCGGGTG ACGAGAAGCA GACCCGGTTC ATCGTCGATT TCGACACCAA GGTTCCGATC
CGCGCGTTCG CGCTGGCGGA TCCGTATCGC ATCGTCATCG ATCTGCCGCA GATCAATTTC
CGCCTGCCGT CGGCCGCCAA TGGTGCAAGT CGTGGCCTGA TCAAGGCTTT TCGGTACGGC
CTGGTGATGC CCGGCGGCTC GCGGATCGTG CTCGAACTGG CGGGGCCGGC GAAGATCGCC
AAGGCCGATA TGCTCGACGC AGCCAATGGG CAGCCGGCGC GGCTGGTGAT CGAACTCGAT
TCGGTTGACC GCACGGCCTT CGTCGCGGCG CTTAGTGCCG AGAAGGCGCC CGAACTGCGA
CCTTCGGTCA GTATGGCGGA TGCGACCTCG TCAGTGCCAG CTGCCGATGC GGCGAAGGAC
GATCCGCGGC CGGTCGTCGT GCTCGATCCG GGACATGGCG GGATCGACAA CGGCACCCAA
TCCGCCAGTG GCATCGCCGA GAAGACGCTG GTGCTGGATT TCGCGCTGGC GTTGCGCGAC
CAGATGGAAA AGGGCGGCAA GTACCGCGTG GTGCTGACCC GCGCCGACGA CACCTTCATT
CCGCTCAACG ACCGGGTGAA GATCGCGCGC GCGCAGTCCG CCGCGCTGTT CGTGTCGATT
CATGCCGACG CGCTGCCGCG CGGCGAGGGC GATGCCCAGG GCGCCACCAT CTACACGCTG
TCCGACAGGG CCTCCGATGC CGAGGCGCAG CGGCTGGCGG ATGCCGAAAA CAGGGCCGAC
GCGATCGGCG GGGTCGATCT GACCGAAGAG CCGACCGAGG TCGCCGATAT CCTGATCGAC
CTCGCGCAGC GCGAGACCAA GACGTTCTCG AACAGCTTCG CCCGGACCTT GATGCGGGAA
ATGAAGGGCG CGACCCGGCT GCACAAGAAT CCTCTCAAAT CCGCCGGCTT CCGGGTCCTG
AAGGCGCCCG ACGTGCCGTC GGTGCTGATC GAACTCGGCT ATGTCTCCAA CAAAGGCGAC
CTCAAGCAGC TGATTTCCGA ACAGTGGCGC ACCAAGACCG TCGGCGCGGT CTCCCAGGCG
ATCGATTCGT TTTTCGCCAG GCGGTTGGTC TCGGCCGGAA AGCCGAACTG A
 
Protein sequence
MQGRANHFVL FSAVFACAAA WMCASPTQGW AVEALPAPST PTQSTAAPVT NGFPIASDAR 
LAGDEKQTRF IVDFDTKVPI RAFALADPYR IVIDLPQINF RLPSAANGAS RGLIKAFRYG
LVMPGGSRIV LELAGPAKIA KADMLDAANG QPARLVIELD SVDRTAFVAA LSAEKAPELR
PSVSMADATS SVPAADAAKD DPRPVVVLDP GHGGIDNGTQ SASGIAEKTL VLDFALALRD
QMEKGGKYRV VLTRADDTFI PLNDRVKIAR AQSAALFVSI HADALPRGEG DAQGATIYTL
SDRASDAEAQ RLADAENRAD AIGGVDLTEE PTEVADILID LAQRETKTFS NSFARTLMRE
MKGATRLHKN PLKSAGFRVL KAPDVPSVLI ELGYVSNKGD LKQLISEQWR TKTVGAVSQA
IDSFFARRLV SAGKPN