Gene RPC_3118 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPC_3118 
Symbol 
ID3972974 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB18 
KingdomBacteria 
Replicon accessionNC_007925 
Strand
Start bp3459392 
End bp3460960 
Gene Length1569 bp 
Protein Length522 aa 
Translation table11 
GC content70% 
IMG OID637926227 
Productpeptidase S10, serine carboxypeptidase 
Protein accessionYP_532979 
Protein GI90424609 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2939] Carboxypeptidase C (cathepsin A) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.226388 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTGATGC GGCTGTTGGC GTGGCGTCAT GTTCGGATCG CCCTGGCGCT GTCGATGTTG 
TCGTGGCCGC TCGCGGCCGC GGCGCAGGAG GCAACGTCGG CGGCTCCGCC CGGCGCGCAG
AAGACCGGCG CGCCGTCGCA GAGCGGCTCA TCGCCGGCAT CCGCCGCCGA TCAGCACCGC
CTGCCGCCGG ACTCCATCAC GCAGCACAAG CTGAGCCTCG CCGGCCGCAC GCTCAGCTTC
AGCGCCACCG CGGGATCGAT CCGGCTGTTC GACGACAAGG GCGAGCCGCA GGCCGATATC
GCTTACACTT CCTATCAACG CGACGGCGGC GAGCCGGGTA GCCGCCCGGT GACCTTCCTG
TTCAACGGCG GCCCGGGGGC GTCGTCGGCC TGGCTGCAAT TCGGCGCCGC CGGGCCGTGG
CGGCTGACGT TCGACGCCGA GGGCCCGAGC GCGTCGGCGA CACCGGAGCT GCAGCCCAAC
GCCGAGACCT GGCTCGACTT CACCGATTTG GTGTTCATCG ATCCGGTCGG CACCGGCTAC
AGCCGCTTCG TCGCCAGCGG CGAGGCGGTG CGCAAACGGT TCTATTCGGT CGACGGCGAC
GTCGCTTCGA TCGCGCTGGT GATCCGCCGC TGGCTGGAGA AATCCGACCG GCTGCTGTCG
CCGAAATTCG TCGCCGGCGA AAGCTATGGC GGGATTCGCG GGCCGAAGAT CGTGCACGAT
CTGCAGACCG AGCAGGGCAT CGGCGTGAAG GGCCTGATCC TGGTGTCGCC GCTGTTCGAC
TTCCGCGACT ATTCCGGCTC CAGCCTTCTG CAATACGTCG CCAGGCTGCC CAGCATGGCG
GCGACCGCGC GGCAACTCAA AGCGCCGGTC GGCCGCGCCG ACGTCGCCGA CGTCGAGGCC
TATGCGGGCG GCGATTTCCT GCGCGATCTG CTCAAAGGCC AGGCCGACGC CGAGGCGACC
AGCCGGTTGG CCGACCGGGT CGCGGCGCTG ACCGGGATCG ATCCCTCGGT CAGCCGGCGG
TTGGCCGGGC GGTTCGACGT CTCCGAGTTT CGCCGCGAGT TCGATCGCCG CAACGGTCGG
GTGACCGGCC GCTACGATGC GTCGGTGACG GGCTTCGATC CCTATCCGGA CTCCAACGCG
GCGCGTTTCG ACGATCCGTC GCTGGAGCCG TTGCTGGCGC CGCTCACCAG CGCGGCGATC
GATCACACCG CGCGGCGGCT GAACTGGCGG CCGGACGGCT CCTATCGGCT GCTCAATGGC
GCGGTGGCGG GGGCGTGGGA TTTCGGCCGC GGTCGCCACC CGCCGGAGTC GGTGTCGCAG
CTGCGCCAAG TGCTGGCGCT CGATCCGACG TTCAAGCTGT TGGTGGCGCA CGGCCTGTTC
GATCTGGCGA CGCCGTATTT CGCATCGAAG ATCATCCTCG ATCAGCTACC GGCCTATGCC
TCGACGGATC GCGTCCAGCT CGCGGTCTAT CCCGGCGGCC ACATGTTCTA CTGGCGCGAT
GCGTCGCGCC AGGCGCTGCG CGCCGAAGTC GCGGCGATGA TCCAGGACGG CCGCGCGGTC
AGCCGTTGA
 
Protein sequence
MVMRLLAWRH VRIALALSML SWPLAAAAQE ATSAAPPGAQ KTGAPSQSGS SPASAADQHR 
LPPDSITQHK LSLAGRTLSF SATAGSIRLF DDKGEPQADI AYTSYQRDGG EPGSRPVTFL
FNGGPGASSA WLQFGAAGPW RLTFDAEGPS ASATPELQPN AETWLDFTDL VFIDPVGTGY
SRFVASGEAV RKRFYSVDGD VASIALVIRR WLEKSDRLLS PKFVAGESYG GIRGPKIVHD
LQTEQGIGVK GLILVSPLFD FRDYSGSSLL QYVARLPSMA ATARQLKAPV GRADVADVEA
YAGGDFLRDL LKGQADAEAT SRLADRVAAL TGIDPSVSRR LAGRFDVSEF RREFDRRNGR
VTGRYDASVT GFDPYPDSNA ARFDDPSLEP LLAPLTSAAI DHTARRLNWR PDGSYRLLNG
AVAGAWDFGR GRHPPESVSQ LRQVLALDPT FKLLVAHGLF DLATPYFASK IILDQLPAYA
STDRVQLAVY PGGHMFYWRD ASRQALRAEV AAMIQDGRAV SR