Gene RPB_4525 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_4525 
Symbol 
ID3912342 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp5113782 
End bp5114693 
Gene Length912 bp 
Protein Length303 aa 
Translation table11 
GC content69% 
IMG OID637886429 
Productdihydrodipicolinate synthase 
Protein accessionYP_488119 
Protein GI86751623 
COG category[E] Amino acid transport and metabolism
[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0329] Dihydrodipicolinate synthase/N-acetylneuraminate lyase 
TIGRFAM ID[TIGR00674] dihydrodipicolinate synthase 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAGACCG AATCCGAGCC GCCCAGCGGG CTGTGGCTGC CGCTGATCAC GCCGTTTTGC 
GACGGCGCGC TCGACGCAGC CTCATTGCGC CGGCTGATCG CCCACTATGC GCGGCAGCCG
ATCGACCGGC TGATCCTCGC CGCCACCACC GGCGAAGGGC TGACGCTCGA CGACGACGAA
ATCGAGCAAT TGGTGATGCT GACGGCCGAG GCGCTGGCCG AGGCCGGCCG CGCGCTGCCG
GTGTGGCTCG GCCTGTGCGG CAGCGATACG CGCAAACTGG TGCGGACGCT GGCGCGGGTC
GCGAACTGGC CGGTCGACGG CTATCTGATC GCCTGCCCGT CCTACTCGCG GCCGTCGCAA
CAAGGGCTGG TGCTGCATTT CGAGGCGCTG TCGGACGCCG CGACGCGGCC GATCATGATC
TACAACATTC CCTATCGGAC CGGCGTCAAT CTCGGCAACG CGGCGATGCT GCATCTCGCC
GCGCGCGACA ACATCGTCGG CGTCAAGGAT TGCTGCGCCG ATACGCGGCA GACCGTCGAA
CTGCTGCACG ACAGGCCGCG CGGCTTCGCC GTGCTGAGCG GCGAGGACGC GCTGTATTTC
GGCGCGCTGG TGCGCGGCTG CGACGGCGCG GTGCTGGCCT CGGCGCATAT CGAGACCGAG
GCCTTCGCCA CGGTGCGGCA GCGCATCCTG GCCGACGACC GGCTCGGCGC CTCGATCGCA
TGGCAGCGGC TCGCCGAAGT CCCGAAGCTG CTGTTCGCCG AGCCATCGCC TGCGGCGCTG
AAATACGCGC TGTGGCGGCT GGGGCTGATC GACAGCCCGG AATTGCGGCT GCCGCTGACC
GGCATCAGCG ACGACCTCGC CAAAGCCCTC GACGCCCGGC TTTTGGCCGC TAAATCCGGT
ACTTACGCGT GA
 
Protein sequence
METESEPPSG LWLPLITPFC DGALDAASLR RLIAHYARQP IDRLILAATT GEGLTLDDDE 
IEQLVMLTAE ALAEAGRALP VWLGLCGSDT RKLVRTLARV ANWPVDGYLI ACPSYSRPSQ
QGLVLHFEAL SDAATRPIMI YNIPYRTGVN LGNAAMLHLA ARDNIVGVKD CCADTRQTVE
LLHDRPRGFA VLSGEDALYF GALVRGCDGA VLASAHIETE AFATVRQRIL ADDRLGASIA
WQRLAEVPKL LFAEPSPAAL KYALWRLGLI DSPELRLPLT GISDDLAKAL DARLLAAKSG
TYA