Gene RPD_2981 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPD_2981 
Symbol 
ID4023484 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB5 
KingdomBacteria 
Replicon accessionNC_007958 
Strand
Start bp3321148 
End bp3322542 
Gene Length1395 bp 
Protein Length464 aa 
Translation table11 
GC content64% 
IMG OID637963180 
Productethanolamine ammonia lyase large subunit 
Protein accessionYP_570108 
Protein GI91977449 
COG category[E] Amino acid transport and metabolism 
COG ID[COG4303] Ethanolamine ammonia-lyase, large subunit 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.0857022 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.119389 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGCGCTACC GGACCGCAAT CGACCAGCAA CTATTCGCCT TCGACAGCCT GAAGCAGGTG 
ATGGCTTACG CCAGCCCCGC GCGCTCGGGT GACTATCTTG CGGGCATCGG CGCCGCCACC
GCGCAGGAGC GGATGGCGGC GCGGCATGTG CTGGCGGAGG TGCCGCTGAA GCAGTTCCTC
AACGAGGCGC TGATCCCCTA TGAAGACGAC AACATCACGC GGCTGATCAT CGACGGTCAT
GATGCAAAGG CGTTCGCGCC GGTGTCGCAC ATGACCGTCG GCGATTTCCG TAACTGGCTT
CTGTCCGAAC AGGCGACGAC GCAGGCCCTC GCCGCGCTGG CGCAGGGCCT GACTCCGGAG
ATGGTCGCGG CGGTCTCCAA GATCATGCGC AATCAGGATC TGATCGCGGT GGCGCGCAAG
GTCCGGGTCG TCACCCGCTT CCGCAACACC ATCGGGCTCG CGGGGCACCT CGCGGTCCGC
CTGCAACCCA ACCATCCCAC CGACGATCTA CGCGGCGTCG CCGCGTCGAC GCTGGACGGC
CTGTTGATGG GCTCCGGCGA CGCCGTCATC GGGCTCAATC CCGCCTCCGA CAGCCTGCCG
GTGCTCGGCG ATCTGCTGCG GATGCTGGAC GAGGTGATCC ATCGTTTCGA AATCCCGACC
CAGAGCTGTG TACTGACCCA TGTCACCAAC ACGGTGCAAC TCATCAACGA CGGCGCGCCG
GTCGATCTCG TCTTCCAGTC GATCGGCGGC ACCGAAAAAA CCAACCTGTC GTTCGGGGTG
ACGCCGGAGA TTTTGCACGA GGCGCGCGAG GCGGCGCTGT CGTTGAAACG CGGCACCGTC
GGCGACAACG TGATGTATTT CGAGACCGGG CAAGGCAGCG CGCTGTCGGC CGACGCGAAT
TTCGGCGTCG ATCAGCAGAC CTGCGAGGCG CGCGCCTACG CCTTGGCGCG GCTCTACCAG
CCGCTGCTGG TGAACACCGT GGTCGGCTTC ATCGGCCCGG AATATCTCTA TGACGGCAAA
CAGATCATCC GCGCCGGACT GGAAGATCAT TTCTGCGGCA AGCTGCTCGG CCTGCCGCTC
GGTTGCGACA TCTGCTATAC CAACCACGCC GAGGCCGATC AGGACGACAT GGATACGCTG
CTGGTGCTGC TCGGCGCCGC CGGCATCAGT TTCATCATGG GCATTCCCGG CGCCGACGAC
GTGATGCTCA ACTATCAGAG CACCTCGTTT CACGACGCGC TGTTCCTTCG CGACCTCATG
AACCTGAAAC GCGCGCCCGA ATTCGAGATG TGGCTGCAAC GTATGCAGAT CACCGACGAT
GCCGGGCGGC TGCGCCCGCC CTCGCCGAAC CCGCTGCTCG GCGGCATGGG CAAGCTGAAA
TCGCTGGTCG CATGA
 
Protein sequence
MRYRTAIDQQ LFAFDSLKQV MAYASPARSG DYLAGIGAAT AQERMAARHV LAEVPLKQFL 
NEALIPYEDD NITRLIIDGH DAKAFAPVSH MTVGDFRNWL LSEQATTQAL AALAQGLTPE
MVAAVSKIMR NQDLIAVARK VRVVTRFRNT IGLAGHLAVR LQPNHPTDDL RGVAASTLDG
LLMGSGDAVI GLNPASDSLP VLGDLLRMLD EVIHRFEIPT QSCVLTHVTN TVQLINDGAP
VDLVFQSIGG TEKTNLSFGV TPEILHEARE AALSLKRGTV GDNVMYFETG QGSALSADAN
FGVDQQTCEA RAYALARLYQ PLLVNTVVGF IGPEYLYDGK QIIRAGLEDH FCGKLLGLPL
GCDICYTNHA EADQDDMDTL LVLLGAAGIS FIMGIPGADD VMLNYQSTSF HDALFLRDLM
NLKRAPEFEM WLQRMQITDD AGRLRPPSPN PLLGGMGKLK SLVA