Gene RPB_2078 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_2078 
Symbol 
ID3909893 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp2361791 
End bp2363077 
Gene Length1287 bp 
Protein Length428 aa 
Translation table11 
GC content72% 
IMG OID637883970 
ProductOrn/DAP/Arg decarboxylase 2 
Protein accessionYP_485695 
Protein GI86749199 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0019] Diaminopimelate decarboxylase 
TIGRFAM ID[TIGR01048] diaminopimelate decarboxylase 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.192664 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCGCCC CGGCCGCGTT CGATCCCGAT CGGGTCGCGA CGCTCGCCGC GCGCCACGGC 
ACGCCGCTGT TCATCTACGA CGCGCCGGCG ATGCGCGCCG CCGCGGCGCG ATTGCAGGAG
GCGCTGCCGC CGGGCGCGCG ATTGTTCTTC GCGGTGAAGG CCAATCCCGC GCCCGATGTG
ATCCGGCTGT TCGCCGGCGA GGGCCTCGGC GCCGAAGTGG CGTCGGGCGG CGAATTGCGG
CTCGCGCTCG CCTGCGGCGT CGCGCCGGAT CGCATCGTGT TCTCCGGCCC GGCGAAGACC
GCGCCGGAGC TGCGCTGCGC GATCGAGGCC TGCATCTTCG CGGTGCAGGC GGAATCCGTC
GCCGAGCTCG ACACGCTGCA GGCGCTGTGC GTCGCGCGCG GCGCGACGGT GCGCGTCGCC
CTGCGCGTCA ATCTCGGGCC GGGCGGCGAA CGCCGCGGCG GCTGGGGCGG GCCTTCGCCG
TTCGGCATGG ATACCGACGC GCTGGACGAG GTCACAGCGC GCGCCGCGCG GCTCGATCGT
CTGCGCATTG TCGGCCTGCA CAATCACCAG GCGTCGCAGA CGCTCGATCC GGCGAAGCTG
ATCGCGCGGT TCGACGCCTT TGCGCGCGTG GCGGCGTCGC TCGGGTCGCG CTTCGATCTG
CAGTTCGTCA ATTTCGGCGG CGGCTTCGGT GCGCCGTTCT ACGCCGACGA CGCGCCGCTC
GATCTCGCGC CGGTCCGCGC GTGTTTCGCC GCGCTCGCCG GCGTGTTCGG CGACCGGCCG
CTGCAGTTCG CCGCCGAATC CGGGCGCTAT CTCGTCGGGC CCGCGGGCTG CTACGTCGCG
CGCGTGGTCG ATGTGAAGCG GTCGTTCGGC GTGCGCTACG CGCTGCTCGA CGGCGGCATT
CATCACGTGC TCGGCCTGTC CGGAACGATG CGGTCGCTGC GCCGGCCGGT GGCGGTGGCG
CGGGTCGGCG CGCGATCGGG GGAGCCTTGC GAGCCGACCG AAATCGCCGG GCCGCTGTGC
ACGCCGATCG ATCGCCTCGC CGGCGCCGCC GAGCTGCCGT GCGATCTCGC CGCCGGCGAC
CTGCTGGCGT TCGCCAATTG CGGAGCCTAT GCCAAGCACG CGAGCCCGCT GAACTTCCTC
GGCCACGACT GGCCGGCCGA ACTGATGATC GACGGCGCGC GCGTCATCGT CCTGTCGCCG
CAAATTGCAT TCGGGCCGGC GCTGTGGCAA TCACGCGAGG TACTCCAGCG ATTTCGATGC
GATGGCAAAA TCCCCACGAG CTCCTGA
 
Protein sequence
MSAPAAFDPD RVATLAARHG TPLFIYDAPA MRAAAARLQE ALPPGARLFF AVKANPAPDV 
IRLFAGEGLG AEVASGGELR LALACGVAPD RIVFSGPAKT APELRCAIEA CIFAVQAESV
AELDTLQALC VARGATVRVA LRVNLGPGGE RRGGWGGPSP FGMDTDALDE VTARAARLDR
LRIVGLHNHQ ASQTLDPAKL IARFDAFARV AASLGSRFDL QFVNFGGGFG APFYADDAPL
DLAPVRACFA ALAGVFGDRP LQFAAESGRY LVGPAGCYVA RVVDVKRSFG VRYALLDGGI
HHVLGLSGTM RSLRRPVAVA RVGARSGEPC EPTEIAGPLC TPIDRLAGAA ELPCDLAAGD
LLAFANCGAY AKHASPLNFL GHDWPAELMI DGARVIVLSP QIAFGPALWQ SREVLQRFRC
DGKIPTSS