Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPC_3118 |
Symbol | |
ID | 3972974 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris BisB18 |
Kingdom | Bacteria |
Replicon accession | NC_007925 |
Strand | - |
Start bp | 3459392 |
End bp | 3460960 |
Gene Length | 1569 bp |
Protein Length | 522 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 637926227 |
Product | peptidase S10, serine carboxypeptidase |
Protein accession | YP_532979 |
Protein GI | 90424609 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG2939] Carboxypeptidase C (cathepsin A) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.226388 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 19 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGTGATGC GGCTGTTGGC GTGGCGTCAT GTTCGGATCG CCCTGGCGCT GTCGATGTTG TCGTGGCCGC TCGCGGCCGC GGCGCAGGAG GCAACGTCGG CGGCTCCGCC CGGCGCGCAG AAGACCGGCG CGCCGTCGCA GAGCGGCTCA TCGCCGGCAT CCGCCGCCGA TCAGCACCGC CTGCCGCCGG ACTCCATCAC GCAGCACAAG CTGAGCCTCG CCGGCCGCAC GCTCAGCTTC AGCGCCACCG CGGGATCGAT CCGGCTGTTC GACGACAAGG GCGAGCCGCA GGCCGATATC GCTTACACTT CCTATCAACG CGACGGCGGC GAGCCGGGTA GCCGCCCGGT GACCTTCCTG TTCAACGGCG GCCCGGGGGC GTCGTCGGCC TGGCTGCAAT TCGGCGCCGC CGGGCCGTGG CGGCTGACGT TCGACGCCGA GGGCCCGAGC GCGTCGGCGA CACCGGAGCT GCAGCCCAAC GCCGAGACCT GGCTCGACTT CACCGATTTG GTGTTCATCG ATCCGGTCGG CACCGGCTAC AGCCGCTTCG TCGCCAGCGG CGAGGCGGTG CGCAAACGGT TCTATTCGGT CGACGGCGAC GTCGCTTCGA TCGCGCTGGT GATCCGCCGC TGGCTGGAGA AATCCGACCG GCTGCTGTCG CCGAAATTCG TCGCCGGCGA AAGCTATGGC GGGATTCGCG GGCCGAAGAT CGTGCACGAT CTGCAGACCG AGCAGGGCAT CGGCGTGAAG GGCCTGATCC TGGTGTCGCC GCTGTTCGAC TTCCGCGACT ATTCCGGCTC CAGCCTTCTG CAATACGTCG CCAGGCTGCC CAGCATGGCG GCGACCGCGC GGCAACTCAA AGCGCCGGTC GGCCGCGCCG ACGTCGCCGA CGTCGAGGCC TATGCGGGCG GCGATTTCCT GCGCGATCTG CTCAAAGGCC AGGCCGACGC CGAGGCGACC AGCCGGTTGG CCGACCGGGT CGCGGCGCTG ACCGGGATCG ATCCCTCGGT CAGCCGGCGG TTGGCCGGGC GGTTCGACGT CTCCGAGTTT CGCCGCGAGT TCGATCGCCG CAACGGTCGG GTGACCGGCC GCTACGATGC GTCGGTGACG GGCTTCGATC CCTATCCGGA CTCCAACGCG GCGCGTTTCG ACGATCCGTC GCTGGAGCCG TTGCTGGCGC CGCTCACCAG CGCGGCGATC GATCACACCG CGCGGCGGCT GAACTGGCGG CCGGACGGCT CCTATCGGCT GCTCAATGGC GCGGTGGCGG GGGCGTGGGA TTTCGGCCGC GGTCGCCACC CGCCGGAGTC GGTGTCGCAG CTGCGCCAAG TGCTGGCGCT CGATCCGACG TTCAAGCTGT TGGTGGCGCA CGGCCTGTTC GATCTGGCGA CGCCGTATTT CGCATCGAAG ATCATCCTCG ATCAGCTACC GGCCTATGCC TCGACGGATC GCGTCCAGCT CGCGGTCTAT CCCGGCGGCC ACATGTTCTA CTGGCGCGAT GCGTCGCGCC AGGCGCTGCG CGCCGAAGTC GCGGCGATGA TCCAGGACGG CCGCGCGGTC AGCCGTTGA
|
Protein sequence | MVMRLLAWRH VRIALALSML SWPLAAAAQE ATSAAPPGAQ KTGAPSQSGS SPASAADQHR LPPDSITQHK LSLAGRTLSF SATAGSIRLF DDKGEPQADI AYTSYQRDGG EPGSRPVTFL FNGGPGASSA WLQFGAAGPW RLTFDAEGPS ASATPELQPN AETWLDFTDL VFIDPVGTGY SRFVASGEAV RKRFYSVDGD VASIALVIRR WLEKSDRLLS PKFVAGESYG GIRGPKIVHD LQTEQGIGVK GLILVSPLFD FRDYSGSSLL QYVARLPSMA ATARQLKAPV GRADVADVEA YAGGDFLRDL LKGQADAEAT SRLADRVAAL TGIDPSVSRR LAGRFDVSEF RREFDRRNGR VTGRYDASVT GFDPYPDSNA ARFDDPSLEP LLAPLTSAAI DHTARRLNWR PDGSYRLLNG AVAGAWDFGR GRHPPESVSQ LRQVLALDPT FKLLVAHGLF DLATPYFASK IILDQLPAYA STDRVQLAVY PGGHMFYWRD ASRQALRAEV AAMIQDGRAV SR
|
| |