Gene RPB_2001 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_2001 
Symbol 
ID3909507 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp2273447 
End bp2274436 
Gene Length990 bp 
Protein Length329 aa 
Translation table11 
GC content69% 
IMG OID637883895 
ProductD-alanine--D-alanine ligase 
Protein accessionYP_485620 
Protein GI86749124 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1181] D-alanine-D-alanine ligase and related ATP-grasp enzymes 
TIGRFAM ID[TIGR01205] D-alanine--D-alanine ligase 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGACAGA CAATTCTCTT CGGCGGCACC AGCAAGGAGC GCCTGGTTTC GGTCGCCTCG 
GCGCAGGCGC TGCACGCCGC GCTGCCCGAT GCCGAATTGT GGTTCTGGAA CGACGACGAC
AGCGTCCACG CGGTGACCGC GGAGGCGCTG CTGGCGCATG CGCGGCCGTT CGAGGAGGCG
TTTCGGCCGG CTGGCGAAAA CATCGGTCCG CTGGAGCGCG CGCTCGATCG CGCGGCGATC
GAACAGCGGC TGCTGGTGCT GGGTCTGCAC GGCGGTGTCG CCGAGAACGG CGAGTTGCAG
GCGATGTGCG AGATGCGCGG CGTGCCGTTC ACCGGGTCGG GCGCGGCGGC GTCGCATCTC
GCCTTCGACA AGGTGGCCGC CAAGCGGTTC GCCGCGATCG CCGGCGTGCG CGCGCCGGCC
GGCATCGCGC TGGCGGAGGC CGAGGCGGCG CTGGCCGCCC ACGGCCGGCT GATCGCCAAG
CCGGCCCGCG ACGGGTCGAG CTACGGCCTG TTCTTCATCA ATGCGAAGCA GGACCTGGTC
GCGGTGCGCG ACGCGGCGAG GTCCGAGGAC TATCTGATCG AGCCGTTCGT CTCCGGCATC
GAAGCGACCT GCGGCGTGCT GGAGCAGGCC GACGGCTCGC TGCTGGCGCT GCCGCCGATC
GAGATCGTGC CGGCCGACGG CGGCTTCGAC TACACCGCGA AATATCTCGC CAAATCGACC
CAGGAGATCT GCCCCGGCCG GTTCGCGCCG CAGATCTCGG CGAGGATCAT GGAGGATGCC
GTGAAGGCGC ATCGGGTGAT GGGCTGCCGC GGCTATTCGC GCTCCGACTT CATCGTCGTC
GCCGACGGTC CGATCTTTCT CGAGACCAAT ACGCTGCCCG GACTGACCAA GGCCTCGCTC
TATCCCAAGG CGCTGCAGGC GCAGGGGATC GCCTTCGTCG ATTTCCTCCA CGGCCAGATC
GCGCTCGCCG AACGCGGCGC CCGGCGTTAA
 
Protein sequence
MRQTILFGGT SKERLVSVAS AQALHAALPD AELWFWNDDD SVHAVTAEAL LAHARPFEEA 
FRPAGENIGP LERALDRAAI EQRLLVLGLH GGVAENGELQ AMCEMRGVPF TGSGAAASHL
AFDKVAAKRF AAIAGVRAPA GIALAEAEAA LAAHGRLIAK PARDGSSYGL FFINAKQDLV
AVRDAARSED YLIEPFVSGI EATCGVLEQA DGSLLALPPI EIVPADGGFD YTAKYLAKST
QEICPGRFAP QISARIMEDA VKAHRVMGCR GYSRSDFIVV ADGPIFLETN TLPGLTKASL
YPKALQAQGI AFVDFLHGQI ALAERGARR