Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPB_0440 |
Symbol | |
ID | 3909996 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris HaA2 |
Kingdom | Bacteria |
Replicon accession | NC_007778 |
Strand | - |
Start bp | 482994 |
End bp | 484511 |
Gene Length | 1518 bp |
Protein Length | 505 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 637882326 |
Product | peptidase S10, serine carboxypeptidase |
Protein accession | YP_484062 |
Protein GI | 86747566 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG2939] Carboxypeptidase C (cathepsin A) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.480832 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCGATGT CGCCCGGATT CGCCCGCCGC GTCGCGCTGG CGCTGATACT TCTCGCAACC ACCGCAGCGC TCGCTCCGGC CCGTGCCCAG GACGCCTCGC CCCCGGCCAC CGCGCAAAGC AAACCCGCGC CCGCCGACGC CGAGCAGCAC AAGCTGCCCG CCGATTCGAC GACCAGACAC ACGCTGGCGC TGCCCGGCCG CAGCCTGTCC TTCACCGCCA CCGCCGGATC GATCCGGCTG TTCAACGACA AGGGCGAGCC GCAGGCCGAT ATCGCCACCA CGGCCTATCA GCTCGACAAT ACCGAGGCGC GGACGCGGCC GGTGACCTTC GTGTTCAACG GCGGGCCGGG CGCTTCCTCG GCCTGGCTGC AGCTCGGCGC GGCGGGGCCG TGGCGGCTGC CGATGGCCGG CGACGCCGCA GTCGCTTCGG CGACGCCGGC GCTGCAGCCG AATGCCGAGA CCTGGCTCGA CGTCACCGAC CTCGTCTTCA TCGATCCGGT CGGCACCGGC TACAGCCGTT TCGTCGCCAG CGGCGACGAC GTGCGCAAGC AGTTCTATTC GGTCGACGGC GATGTCGCCG CGATCGCGCT GGTGATCCGG CGCTGGCTGG AGAAGCACGA CCGGCTGCTG TCGCCCAAAT ACGTCGCCGG CGAAAGCTAT GGCGGCATTC GCGGCCCGCG CGTGGTGCGC AATCTGCAGA CCCGCCAAGG CATCGGCGTC AAGGGGCTGA TCCTGGTGTC GCCCTTGCTC GACTTCCGCG AATACTCCGG ATCGAGCCTG CTGCAATACG TCGCCAGGCT GCCGACCATG GCGGCGGCGG CGCGGCAGCG AAAGGGTCCG GTCACCCGCG CCGATCTCGC CGACGTCGAG AGCTACGCGC GCGGCGAATT CATCACCGAT CTGCTCAAGG GGCAGGCCGA CCAGGCCGCC ACCCAGCGCC TCGCCGATCG CGTCGCGGCC TCGAGCGGGA TCGATCCCGC CGTCAGCCGC AGGCTCGCCG GCCGGCTCGA TACCAGCGAA TTCCAGCGCG AGTTCGATCG GGCGAACGGC AAGGTCACCG GCCGGTTCGA CGCCTCGGTG CTGGGCTTCG ATCCGTTTCC GGATTCCAGC GACGCGCAGT TCAGCGACCC GTCGTCGGAA TCGCTGATCG CGCCGCTCAC CAGCGCCGCG ATGGAGCTGA CCCGCAACAT GCTGCAATGG CGCCCGAGCG GCTCGTATCA TCTGCTCAAC GGCGCGGTGT CGCAGCAATG GGATTTCGGC CGCGGCCGCA GCCCGGTCGA ATCGGTTACG CAGCTGCGCG AAATTCTCGC GGTCGACCCG AAGCTGCAGG TGCTGGTGAC GCACGGCCTG TTCGACCTCG CCACGCCGTA TTTCGGCAGC GTCGTCGCCA TCGATCAGTT GCCGCCATTC GCATCGAAGC GGATCAGGCT CGTCACCTGG CCCGGCGGCC ACATGACCTA CGCCCGCGAC GACGCCCGCA AAGCGCTGCG CGGCGAGGTC GGGGCGATGA TGAAATAA
|
Protein sequence | MSMSPGFARR VALALILLAT TAALAPARAQ DASPPATAQS KPAPADAEQH KLPADSTTRH TLALPGRSLS FTATAGSIRL FNDKGEPQAD IATTAYQLDN TEARTRPVTF VFNGGPGASS AWLQLGAAGP WRLPMAGDAA VASATPALQP NAETWLDVTD LVFIDPVGTG YSRFVASGDD VRKQFYSVDG DVAAIALVIR RWLEKHDRLL SPKYVAGESY GGIRGPRVVR NLQTRQGIGV KGLILVSPLL DFREYSGSSL LQYVARLPTM AAAARQRKGP VTRADLADVE SYARGEFITD LLKGQADQAA TQRLADRVAA SSGIDPAVSR RLAGRLDTSE FQREFDRANG KVTGRFDASV LGFDPFPDSS DAQFSDPSSE SLIAPLTSAA MELTRNMLQW RPSGSYHLLN GAVSQQWDFG RGRSPVESVT QLREILAVDP KLQVLVTHGL FDLATPYFGS VVAIDQLPPF ASKRIRLVTW PGGHMTYARD DARKALRGEV GAMMK
|
| |