Gene Rpal_0966 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRpal_0966 
Symbol 
ID6408620 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris TIE-1 
KingdomBacteria 
Replicon accessionNC_011004 
Strand
Start bp1033778 
End bp1034695 
Gene Length918 bp 
Protein Length305 aa 
Translation table11 
GC content68% 
IMG OID642710880 
Productdihydrodipicolinate synthase 
Protein accessionYP_001989999 
Protein GI192289394 
COG category[E] Amino acid transport and metabolism
[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0329] Dihydrodipicolinate synthase/N-acetylneuraminate lyase 
TIGRFAM ID[TIGR00674] dihydrodipicolinate synthase 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value0.318668 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGAGACAT CCCTGGACGC GACAGCCGAA CAGCCACGCG GGCTGTGGCT GCCGCTGATC 
ACGCCGTTTC GCGACGGCGA GCTCGACGGC GCATCGCTGC GCCGGCTGAT CGCGCACTAC
GCGCGAGCTC CGCTCGACGG ACTGATCTTA GGGGCCACCA CCGGCGAAGG GCTGACGCTG
AATGAAGACG AGCTCGAACG CCTGGTGATG CTGAGCGCCG ATGCATTGGC GGCGAGCGGC
CGCAAGCTGC CGGTGTATCT CGGGCTGTCC GGCAGCGACA CGCGCAAGCT GGTGAAGACG
CTGGCGCGGA CCGCACACTG GCCGATCGAC GGCGTGCTGA TCGCCTGCCC GTACTACACC
CGCCCGTCGC AGCGGGGATT GGTGCTGCAT TTCGAAGCCG CAGCCGACGC CACCGCAAGG
CCGATCCTGA TCTACAACAT CCCGTATCGC ACCGGCGTCA ATCTACACAA CGAGGCGATG
CTGCGGCTCG CCGAGCGCGC CAACATCGTC GGCGTCAAGG ATTGCTGCGC CGATCCTGCG
CAGACAGCGG AGCTGCTGAG GTTGCGGCCG CCGGGCTTTT CGGTGCTCAC CGGCGAGGAT
GCGCTGGCAT TCGATGCGCT GAGCCGCGGC TGCGACGGCG CGATCCTGGC CTCGGCGCAT
CTGGAGACCG AGGCGTTCGC CGCGATGATG CATCGGCTGC AGGCCGGCGA CCGCCTCGGT
GGGGCGACCG AATGGCAACG GCTCGCCGAC CTGCCGAAGC TGCTGTTCGC CGAGCCATCC
CCGGCGCCGG TGAAATATGC GCTGTGGCGG CGCGGGTTGA TCGACAGCCC AGAAGTGCGG
CTGCCGATGA CACCGGTGTC GCCCACGCTC GCCGCCACCC TTGATGCGTG GATGCTGCCC
GGCCTATCCG CGGCTTGA
 
Protein sequence
METSLDATAE QPRGLWLPLI TPFRDGELDG ASLRRLIAHY ARAPLDGLIL GATTGEGLTL 
NEDELERLVM LSADALAASG RKLPVYLGLS GSDTRKLVKT LARTAHWPID GVLIACPYYT
RPSQRGLVLH FEAAADATAR PILIYNIPYR TGVNLHNEAM LRLAERANIV GVKDCCADPA
QTAELLRLRP PGFSVLTGED ALAFDALSRG CDGAILASAH LETEAFAAMM HRLQAGDRLG
GATEWQRLAD LPKLLFAEPS PAPVKYALWR RGLIDSPEVR LPMTPVSPTL AATLDAWMLP
GLSAA