Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPC_3414 |
Symbol | |
ID | 3970458 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris BisB18 |
Kingdom | Bacteria |
Replicon accession | NC_007925 |
Strand | + |
Start bp | 3801926 |
End bp | 3803017 |
Gene Length | 1092 bp |
Protein Length | 363 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 637926525 |
Product | peptidase S1 and S6, chymotrypsin/Hap |
Protein accession | YP_533273 |
Protein GI | 90424903 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG3591] V8-like Glu-specific endopeptidase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.204379 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 0.0483754 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCCCAGG CACCCAGGAA GCCCGCGACG CCTCGGCAGC GGCGATCGCC GCGCGCCGCG ACCCAGAGCG AGCCAGCCAG CAGCATGGCA ACAATCGAGC AGCATAAGGC GGTCTCCGAT CCGCAGCACG ACCCGTGGGT GCTGCCGAGT TTCATGTCTG AGACCAACGG CGGCGGCTCC GCCGAGCAGG TGCCGGGCTT CAACATCTTC CGCGCCATCA CTTCGGGATT CGAGAGCAAG CGCGTCGCCG GTCCGACCAC CGTGCTGGCA CGGCCGGACG GTGGGTTCTT TTCCGAAGGC ACGCCGCCGC TGTCGCCGTC CCCGCAGGCC CGCGCCTTCG CCGCCAAGCC GGAGAACGTC ATGGGCGTCG ACAACCGCGC GATGGTGCCG GACACCTCCA CCACCCCCTG GCGCTGCATC TGCCATCTGG AGGTGGAGTA CGAATCCGGA CCGGTGGGTT TCGGCACCGG ATTCATCATC GGGCCGAAGG CCGTGCTGAC CGCCGCCCAC GTGATCTACA ACAACACCGG CGGCAACAAC CGCAGGGCGC GCAACATCCG CGTCATCCCC GGCCGCAACG GAACCACCGC GCCCTATGGT TACTTCGTGA CCAGTCTCGA CCGCTGCATC ATTCCGGAGC AGTGGCGGCA GGCCAGCGAC ACCATCGGAG ACACCGCCGC CGCGGCGGAT TACGCCGTCA TCCAGTTTCC CGAGCAGTCG GAGTGCGACG GGCTGACCTC GGCCGACCGG CTGGGCTATT TCGGCCTGAA ATGCTTCGCC GACGAGGACA GCGAAAACAA GGCGCAGATG CTGTTCGTCA ACAACGCCGG CTATCCCTAC GAGGCCGACA AGCCGTATGG AACGCTGTGG TACAATGCCG GACGCATCCG CAAAATGGGA CCGAGCTTCG TGGAATACAT GGTCGATACC GAAGGCGGAC AGAGCGGCAG CCCGGTGTAC TTCTACGACG ACCAAACGAA GCAACGCTAT GTGATCGCCG TGCATACCAC CGGCGATTTC GTCAACCGCG GCCTGCGCAT CACCCCCGAG ATTTTCACCA ACCTCAAACA GTGGACCGGG CGCTCGATCT GA
|
Protein sequence | MAQAPRKPAT PRQRRSPRAA TQSEPASSMA TIEQHKAVSD PQHDPWVLPS FMSETNGGGS AEQVPGFNIF RAITSGFESK RVAGPTTVLA RPDGGFFSEG TPPLSPSPQA RAFAAKPENV MGVDNRAMVP DTSTTPWRCI CHLEVEYESG PVGFGTGFII GPKAVLTAAH VIYNNTGGNN RRARNIRVIP GRNGTTAPYG YFVTSLDRCI IPEQWRQASD TIGDTAAAAD YAVIQFPEQS ECDGLTSADR LGYFGLKCFA DEDSENKAQM LFVNNAGYPY EADKPYGTLW YNAGRIRKMG PSFVEYMVDT EGGQSGSPVY FYDDQTKQRY VIAVHTTGDF VNRGLRITPE IFTNLKQWTG RSI
|
| |