Gene RPD_4035 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPD_4035 
Symbol 
ID4024552 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB5 
KingdomBacteria 
Replicon accessionNC_007958 
Strand
Start bp4484618 
End bp4486003 
Gene Length1386 bp 
Protein Length461 aa 
Translation table11 
GC content67% 
IMG OID637964238 
Productpeptidase M16-like 
Protein accessionYP_571155 
Protein GI91978496 
COG category[R] General function prediction only 
COG ID[COG0612] Predicted Zn-dependent peptidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.291122 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACAATGT CCATTGCTCG CCCGCGCGCC GCGCTCGCCG TTCTCGCCGC CACCCTCTGC 
CTTGCGGGTC CCGCGGCGGC GCAAAGCGTC ACCGCCGATC CGCCCGCCAC CTTCACGCTC
GGCAACGGGC TGAACGTGGT GGTGATCCCG GATCATCGCA CCCCGGTGGT GACGCAGATG
ATCTGGTACA AGGTCGGCTC CGCTGACGAG ACGCCCGGCA AGTCCGGACT CGCGCATTTC
CTCGAGCATC TGATGTTCAA GGGCACCGCC AAGCACCCGG CCGGCGAGTT CTCGCAGACG
GTGCTGAAGA TCGGCGGCAA CGAGAACGCA TTCACTTCGG TCGACTACAC CGGCTATTTC
CAGCGCGTGC CGCGCGAACA TCTCGACCGG ATGATGGAGC TCGAGGCCGA TCGGATGACC
GATCTGGTGC TGAAGGACGA GAACGTGCTG CCGGAGCGCG ACGTCGTCCT CGAAGAATAC
AACATGCGGG TCGCCAACAA TCCCGACGCG CGGCTGACCG AGCAGATCAT GGCGGCGCTG
TATCTCAACC ACCCCTATGG CCGCCCGGTG ATCGGCTGGC ACCAGGAAAT CCAGAAGCTC
GACCGCGAGG ATGCGCTGGC GTTCTATCGC CGCTTCTACG CGCCGAACAA CGCCACCCTG
GTGATTGCCG GCGACGTCGA TGCCGCGCAG ATCCGGCCGG CGATCGAGCG CACGTACGGC
GCGATCCCGC CGCAGCCGGC GATCGCGGCG CAGCGCGTGC GCCCGCAGGA GCCGACCTCC
GCCGGGCCGC GCACGGTGAC GCTGGCCGAT CCGCGGGTCG AGCAGCCGAG CGTGCGGCGC
TATTATCTGG CGCCGTCGGC GGTCACCGCC GCCAAGGGCG ACAGCCCCGC GCTCGAAGTG
CTGGCGCAGC TGATGGGTGG TGGCAGCAAC TCCTATCTCT ACCGCGCGCT GGTGATCGAC
CGTCCGCTCG CGATCAGCGT CGGCGCCAAC TATCAAGGCA CCGCGCTCGA CGACAGCCAA
TTCGTGATCG CGGCGACGCC GAGGCCGGGC GTCGAGTTCT CCGAGATCGA GAAGGGGATC
GACAACGTGA TCGCCGAACT CGTCCGCAAT CCGGTCCGCT CCGAGGACCT CGAGCGGGTG
AAGACGCAAC TGATCGCCGA GGCGATCTAT GCGCAGGACA ATCAGGTGAC GCTGGCGCGC
TGGTACGGCG CGGCGCTGAC CTCCGGTCTC AGCGTGCAGG ACATCCAGAC CTGGCCGGAT
CGCATCCGCG CCGTCACCTC GGACCAGGTC CGCGCCGTGG CGCAGCAGTT CCTCGACCGC
AACCGCTCGG TCACCGGCTA TCTGGTCAAG GGCACGTTGC CGAAGCCCGA GGAGAAGCGC
TCGTGA
 
Protein sequence
MTMSIARPRA ALAVLAATLC LAGPAAAQSV TADPPATFTL GNGLNVVVIP DHRTPVVTQM 
IWYKVGSADE TPGKSGLAHF LEHLMFKGTA KHPAGEFSQT VLKIGGNENA FTSVDYTGYF
QRVPREHLDR MMELEADRMT DLVLKDENVL PERDVVLEEY NMRVANNPDA RLTEQIMAAL
YLNHPYGRPV IGWHQEIQKL DREDALAFYR RFYAPNNATL VIAGDVDAAQ IRPAIERTYG
AIPPQPAIAA QRVRPQEPTS AGPRTVTLAD PRVEQPSVRR YYLAPSAVTA AKGDSPALEV
LAQLMGGGSN SYLYRALVID RPLAISVGAN YQGTALDDSQ FVIAATPRPG VEFSEIEKGI
DNVIAELVRN PVRSEDLERV KTQLIAEAIY AQDNQVTLAR WYGAALTSGL SVQDIQTWPD
RIRAVTSDQV RAVAQQFLDR NRSVTGYLVK GTLPKPEEKR S