Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPD_0380 |
Symbol | |
ID | 4020846 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris BisB5 |
Kingdom | Bacteria |
Replicon accession | NC_007958 |
Strand | + |
Start bp | 446518 |
End bp | 448239 |
Gene Length | 1722 bp |
Protein Length | 573 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 637960565 |
Product | peptidase S10, serine carboxypeptidase |
Protein accession | YP_567519 |
Protein GI | 91974860 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG2939] Carboxypeptidase C (cathepsin A) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 19 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGGATTCT CGCTCTCGCG GGATGACCGT GAGGGTGTGG CGCAAGCTCT CGATTGTCGC CAAACCGACG CAGTTCTGCG CTCAGCTCTC TTCCCTGAAC AATCAACCCG TCGACCGTGC CGGCCGCATG CGCCGCCGTT CGTTCTGAAG GAGACATTGA TGGCGCTGTC GCCCTGGTCC GTTTGCCGCG CCGCCATGGC TTTGATGCTC GTCACAACGG CGACGCTCGC ATCCGCGCGG GCGCAGGACG CCGCGCCGGC GTCGCAGCAG CCGGCCGCGG CGCAGGGCGG CAAGTCCGAA ACCGGAGGGT CGCGCGGCAA GGCCGCCGCA GCGTCGTCGG ACGCCGAGCA GCATCGCTTG CCGGCCGACT CGGTCACCCG CCACACGCTG GCGCTGCCGG GCCGCAGCCT GTCCTTCGCC GCCACCGCCG GCTCGATCCG GCTGTTCAAC GACAAAGCCG AGCCGCAGGC CGACATCGCT TACACCGCCT ATCAGCTCGA CAATGCCGAG GCGCGGACGC GGCCGGTGAC CTTCCTGTTC AACGGCGGCC CCGGCGCCTC CTCAGCCTGG CTGCAGCTCG GCGCGGCGGG GCCGTGGCGA TTGCCGATCT TCGGCGAGGC CGCGGTCGCC TCGGCGACGC CGGCGCTGCA GCCCAACGCC GAGACTTGGC TCGACTTCAC CGACCTCGTC TTCATCGATC CGGTCGGCAC CGGCTACAGC CGCCTCGTCG CCAGCGGCGA CGACGTGCGC AAGCAGTTTT ATTCAGTCGA CGGCGACGTC GACGCGATCG CGCTGACGAT CCGGCGTTGG CTCGAGAAGC ACGACCGGCT GCTGTCGCCG AAATACGTCG GCGGCGAGAG CTATGGCGGC ATTCGCGGCC CGCGCGTGGT CCGCAATCTG CAGACCCGCC AGGGCGTCGG CGTCAAAGGC CTGATCCTGG TGTCGCCGCT GCTCGACTTC CGCGAATATT CCGGCTCGAG CCTTCTGCAA TATGTCGCGC GGCTGCCAAG CATGGCGGCT GCAGCGCGGC AACAGAAGGG ACCTGTCACC CGCACCGATC TGACCGACGT CGAAGCCTAT GCGCGCGGCG AATTCCTCGC CGATCTGATC AAGGGCGAAG CCGACCAGGC GGCGACCAAT CGCCTCGCCG ACCGCGTCGC TACGCTGACC GGGATCGACC CCGCGGTGAG CCGCCGCCTC GCCGGCCGGC TTGATACCAG CGAGTTCCAG CGCGAGTTCG ATCGTGCCAA TGGCAAGGTG ACCGGTCGCT TCGACGCCTC GGTGCTCGGC TTCGATCCGT TTCCGGACTC CAGCGACGCG CAGTTCAGCG ACCCGTCGGC GGACTCGCTG ATCGCGCCGC TGACCAGCGC CGCCGCCGAG CTCACGCGCA ATCCGCTGCA ATGGCGTCCG GACGGCTCGT ATCACCTGCT CAACAGTTCG GTCGCGCAGC AATGGGATTT CGGCCGCGGC CGCAACCCGG TGGAATCGCT GACCCAGCTC CGCGAAATCC TCGCGGTCGA TCCGAAACTG CAGGTGCTGG TGACGCATGG GCTGTTCGAT CTCGCCACGC CGTATTTCGC CAGCCAGATC GCGATCGATC AGCTGCCGCC ATTCGCATCG AAGCGGATCA AGCTCGTCAC CTGGCCCGGC GGCCACATGA CCTACGCCCG CGACGACGCA AGAAAAGCGC TGCGCGGCGA GGTCGGCGCG ATGATGAAGT AG
|
Protein sequence | MGFSLSRDDR EGVAQALDCR QTDAVLRSAL FPEQSTRRPC RPHAPPFVLK ETLMALSPWS VCRAAMALML VTTATLASAR AQDAAPASQQ PAAAQGGKSE TGGSRGKAAA ASSDAEQHRL PADSVTRHTL ALPGRSLSFA ATAGSIRLFN DKAEPQADIA YTAYQLDNAE ARTRPVTFLF NGGPGASSAW LQLGAAGPWR LPIFGEAAVA SATPALQPNA ETWLDFTDLV FIDPVGTGYS RLVASGDDVR KQFYSVDGDV DAIALTIRRW LEKHDRLLSP KYVGGESYGG IRGPRVVRNL QTRQGVGVKG LILVSPLLDF REYSGSSLLQ YVARLPSMAA AARQQKGPVT RTDLTDVEAY ARGEFLADLI KGEADQAATN RLADRVATLT GIDPAVSRRL AGRLDTSEFQ REFDRANGKV TGRFDASVLG FDPFPDSSDA QFSDPSADSL IAPLTSAAAE LTRNPLQWRP DGSYHLLNSS VAQQWDFGRG RNPVESLTQL REILAVDPKL QVLVTHGLFD LATPYFASQI AIDQLPPFAS KRIKLVTWPG GHMTYARDDA RKALRGEVGA MMK
|
| |