Gene RPD_3295 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPD_3295 
Symbol 
ID4023805 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB5 
KingdomBacteria 
Replicon accessionNC_007958 
Strand
Start bp3647834 
End bp3649009 
Gene Length1176 bp 
Protein Length391 aa 
Translation table11 
GC content66% 
IMG OID637963499 
Productpeptidase M20D, amidohydrolase 
Protein accessionYP_570420 
Protein GI91977761 
COG category[R] General function prediction only 
COG ID[COG1473] Metal-dependent amidase/aminoacylase/carboxypeptidase 
TIGRFAM ID[TIGR01891] amidohydrolase 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.353614 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0242454 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCGACGA TCGACCGCAT CGATGCCTTC GCCGACGAAC TGACCGCGAT CCGCCGCGAT 
CTGCATATGC ATCCGGAGAT CGGCTTCGAG GAGGTCCGCA CCTCCAACAT GGTCGCCGAG
AAGCTCGACG AATGGGGCAT CGAGGTGCAT CGCGGGCTCG GCGGCACCGG CGTGGTCGGC
GTGCTCAAGG GCAAGGGCAA TAGCGGCCGC CGGATCGGCC TGCGCGCCGA CATGGACGCG
CTGCCGATGG AGGAGCACAC CAACCTGCCG TGGCGCTCGA CCATCCCCGG CCGCTTCCAC
GGCTGTGGCC ATGACGGCCA CACCACGATG TTGTTGGGAA CCGCGCGGTA TCTCGCCGAG
ACCCGGAATT TCGACGGCAC GGTGCATTTC ATCTTCCAAC CGGCCGAAGA AGGGCTCGGC
GGCGCGCGCG CGATGATCAA GGACGGGCTG TTCCAGAAAT TTCCGTGTGA CGAACTCTAC
GGCCTGCACA ACGCGCCCGA TCTGGCGCAT GGCGAGGTCG CGATCCTGCC CGGACCGGCG
ATGGCGGGCG CCGACTTTTT CGACATCACC ATCTCGGGCT ACGGCGCGCA TGGCGCGATG
CCGGAGCGCT CCAAGGACCC GGTGGTGATC GCGATGACGC TCGGCCAGGC GCTGCAGACC
ATCGTCAGCC GCAACGTCGA TCCGCTGCAC TCGGCGGTGC TGTCGATCAC CCAGATTCAT
TCCGGCTCGG CCTACAATGT GATTCCCGGC GAAGCGCGGC TCGCCGGCAC CGTCCGTGCG
TTCTCCGACG ACATCCGCAA GCTGGTGCGC GAGCGAATGC GCGCGCTCGC CGCCGGCATC
GCAGCGGCGT TCGACGTCGA GATCAGCGTC GACATCCGCG ACATCTTCAG CGTGCTGGTG
AACCAGCAGG AGCAGAGCGA CGTGGTCGCT GCGGTCGCGC GCGGCGTGGT CGGCGACGCC
AACGTCAAGC TGCGCGAGCA GCCGAAAATG GGTAGCGAGG ATTTCGCCGA CATGCTGCAG
ACGATCCCCG GCGCGTATTT CTGGATCGGC CATGACGGCG ACGTACCGGT GCACAATCCC
GGCTACGTGC TCGACGACAA AATTCTGCCG ATCGGCGCCA GCATGTTCGC GCGCATCATC
GAAACCCGGC TGCCGGCGGG CGCCTCACAT GTCTGA
 
Protein sequence
MPTIDRIDAF ADELTAIRRD LHMHPEIGFE EVRTSNMVAE KLDEWGIEVH RGLGGTGVVG 
VLKGKGNSGR RIGLRADMDA LPMEEHTNLP WRSTIPGRFH GCGHDGHTTM LLGTARYLAE
TRNFDGTVHF IFQPAEEGLG GARAMIKDGL FQKFPCDELY GLHNAPDLAH GEVAILPGPA
MAGADFFDIT ISGYGAHGAM PERSKDPVVI AMTLGQALQT IVSRNVDPLH SAVLSITQIH
SGSAYNVIPG EARLAGTVRA FSDDIRKLVR ERMRALAAGI AAAFDVEISV DIRDIFSVLV
NQQEQSDVVA AVARGVVGDA NVKLREQPKM GSEDFADMLQ TIPGAYFWIG HDGDVPVHNP
GYVLDDKILP IGASMFARII ETRLPAGASH V