Gene RPD_0380 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPD_0380 
Symbol 
ID4020846 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB5 
KingdomBacteria 
Replicon accessionNC_007958 
Strand
Start bp446518 
End bp448239 
Gene Length1722 bp 
Protein Length573 aa 
Translation table11 
GC content68% 
IMG OID637960565 
Productpeptidase S10, serine carboxypeptidase 
Protein accessionYP_567519 
Protein GI91974860 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2939] Carboxypeptidase C (cathepsin A) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGATTCT CGCTCTCGCG GGATGACCGT GAGGGTGTGG CGCAAGCTCT CGATTGTCGC 
CAAACCGACG CAGTTCTGCG CTCAGCTCTC TTCCCTGAAC AATCAACCCG TCGACCGTGC
CGGCCGCATG CGCCGCCGTT CGTTCTGAAG GAGACATTGA TGGCGCTGTC GCCCTGGTCC
GTTTGCCGCG CCGCCATGGC TTTGATGCTC GTCACAACGG CGACGCTCGC ATCCGCGCGG
GCGCAGGACG CCGCGCCGGC GTCGCAGCAG CCGGCCGCGG CGCAGGGCGG CAAGTCCGAA
ACCGGAGGGT CGCGCGGCAA GGCCGCCGCA GCGTCGTCGG ACGCCGAGCA GCATCGCTTG
CCGGCCGACT CGGTCACCCG CCACACGCTG GCGCTGCCGG GCCGCAGCCT GTCCTTCGCC
GCCACCGCCG GCTCGATCCG GCTGTTCAAC GACAAAGCCG AGCCGCAGGC CGACATCGCT
TACACCGCCT ATCAGCTCGA CAATGCCGAG GCGCGGACGC GGCCGGTGAC CTTCCTGTTC
AACGGCGGCC CCGGCGCCTC CTCAGCCTGG CTGCAGCTCG GCGCGGCGGG GCCGTGGCGA
TTGCCGATCT TCGGCGAGGC CGCGGTCGCC TCGGCGACGC CGGCGCTGCA GCCCAACGCC
GAGACTTGGC TCGACTTCAC CGACCTCGTC TTCATCGATC CGGTCGGCAC CGGCTACAGC
CGCCTCGTCG CCAGCGGCGA CGACGTGCGC AAGCAGTTTT ATTCAGTCGA CGGCGACGTC
GACGCGATCG CGCTGACGAT CCGGCGTTGG CTCGAGAAGC ACGACCGGCT GCTGTCGCCG
AAATACGTCG GCGGCGAGAG CTATGGCGGC ATTCGCGGCC CGCGCGTGGT CCGCAATCTG
CAGACCCGCC AGGGCGTCGG CGTCAAAGGC CTGATCCTGG TGTCGCCGCT GCTCGACTTC
CGCGAATATT CCGGCTCGAG CCTTCTGCAA TATGTCGCGC GGCTGCCAAG CATGGCGGCT
GCAGCGCGGC AACAGAAGGG ACCTGTCACC CGCACCGATC TGACCGACGT CGAAGCCTAT
GCGCGCGGCG AATTCCTCGC CGATCTGATC AAGGGCGAAG CCGACCAGGC GGCGACCAAT
CGCCTCGCCG ACCGCGTCGC TACGCTGACC GGGATCGACC CCGCGGTGAG CCGCCGCCTC
GCCGGCCGGC TTGATACCAG CGAGTTCCAG CGCGAGTTCG ATCGTGCCAA TGGCAAGGTG
ACCGGTCGCT TCGACGCCTC GGTGCTCGGC TTCGATCCGT TTCCGGACTC CAGCGACGCG
CAGTTCAGCG ACCCGTCGGC GGACTCGCTG ATCGCGCCGC TGACCAGCGC CGCCGCCGAG
CTCACGCGCA ATCCGCTGCA ATGGCGTCCG GACGGCTCGT ATCACCTGCT CAACAGTTCG
GTCGCGCAGC AATGGGATTT CGGCCGCGGC CGCAACCCGG TGGAATCGCT GACCCAGCTC
CGCGAAATCC TCGCGGTCGA TCCGAAACTG CAGGTGCTGG TGACGCATGG GCTGTTCGAT
CTCGCCACGC CGTATTTCGC CAGCCAGATC GCGATCGATC AGCTGCCGCC ATTCGCATCG
AAGCGGATCA AGCTCGTCAC CTGGCCCGGC GGCCACATGA CCTACGCCCG CGACGACGCA
AGAAAAGCGC TGCGCGGCGA GGTCGGCGCG ATGATGAAGT AG
 
Protein sequence
MGFSLSRDDR EGVAQALDCR QTDAVLRSAL FPEQSTRRPC RPHAPPFVLK ETLMALSPWS 
VCRAAMALML VTTATLASAR AQDAAPASQQ PAAAQGGKSE TGGSRGKAAA ASSDAEQHRL
PADSVTRHTL ALPGRSLSFA ATAGSIRLFN DKAEPQADIA YTAYQLDNAE ARTRPVTFLF
NGGPGASSAW LQLGAAGPWR LPIFGEAAVA SATPALQPNA ETWLDFTDLV FIDPVGTGYS
RLVASGDDVR KQFYSVDGDV DAIALTIRRW LEKHDRLLSP KYVGGESYGG IRGPRVVRNL
QTRQGVGVKG LILVSPLLDF REYSGSSLLQ YVARLPSMAA AARQQKGPVT RTDLTDVEAY
ARGEFLADLI KGEADQAATN RLADRVATLT GIDPAVSRRL AGRLDTSEFQ REFDRANGKV
TGRFDASVLG FDPFPDSSDA QFSDPSADSL IAPLTSAAAE LTRNPLQWRP DGSYHLLNSS
VAQQWDFGRG RNPVESLTQL REILAVDPKL QVLVTHGLFD LATPYFASQI AIDQLPPFAS
KRIKLVTWPG GHMTYARDDA RKALRGEVGA MMK