Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_37419 |
Symbol | CTPA |
ID | 5001647 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009358 |
Strand | - |
Start bp | 874359 |
End bp | 875777 |
Gene Length | 1419 bp |
Protein Length | 446 aa |
Translation table | |
GC content | 60% |
IMG OID | 640417068 |
Product | D1 proceesing peptidase |
Protein accession | XP_001417628 |
Protein GI | 145346296 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0793] Periplasmic protease |
TIGRFAM ID | [TIGR00225] C-terminal peptidase (prc) |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 31 |
Plasmid unclonability p-value | 0.299887 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGGGCCGA CGGCGCGCGC GCGCGCGGCG GCGGCGGCGG CGGCGGCGGC GGCGGCGGCG GTGGCGGCGG CGACGACGTT CGGGACGCCC GCGGCGTTCG CGGATGACGT CGCGCGCGGT CGAGGGGACG CGCGGACGAA ATCTGTGTCG TCGGCGGTGG AACTGGTGAG CGAGATCGCG GCCGAAGCGG CGGAGGCGGA GGCGGAATCG GAGGGCGCGA CGGACGCGAC CATCTTGGAC GAAGCGTGGG GGTTGGTTTT CGACAACTTT TTACCGGCGA GAAAATCTGA GTCGGACGGA TTCGATCGCG CGGCGTGGGA GGCGATCAAG GCCGAACACG AGGCGAATCC GCCTCAAAGT CGCGAGGAGG CGTACGAGAT GATTAAGTCG ATGCTGGGGA CGCTCGGGGA TAAGTTTACG CGCTTCATCG AGCCGGATCG GTTCACTTCG ATGTTGAAAT ACGACATCAC CGGCGTCGGT TTGAACATCG CGGAAGATGC GGACGACCCT GAACGCGTGC GCGTGCTGGG AATGGTGCTC GACTCGAGCG CGATGAAGGC TGGAGTGGCG CAGGATGATG AAATCGTCGC CGTCAACGGC GAACTCGTGC GCGGCTTGAG CGCGTTTCAG GTGTCTTCGC TCATTCAAGA GGCTGACGGG AAGAGCGTGG ATCTAACAAT CTCGCGCACA GGCGAAGACG TCCCGCGCGT CGTTTCTCTG ACGCGAGACA GTCAATTCGA AGCGCCGAAA AGTCCAGTGA GCATGCGTCT GGAGGGCGGA CACGTCGGTT ACATTCGGCT TCGCGAGTTC AACTCGCTCG CCGAGCGCGA TATCGCGAGA GCGATCACGG ATTTAAGGAC GCAAGGAGCA GACGCGTATA TTCTAGACTT ACGCGACAAT CCTGGGGGAT TAGTGCAAGC TGGTGTGGAG ATTGCTCGAT TATTTTTACC TGCGGATTCG ACCATCGCGT ACACCGAAGG TCGAGTCGTC GCCGGAGGCG TCAAACGCGA TACCGACGTC TCGGCGACAA AAACCGCGAG AAACGGATCT GATTCTCAAC TACCGACTAA GCTGAAGGCG ATCACGACGT CGAAAAATGA CCCTGTCGTC GCCGCTGACG TTCCGCTGGT TGTTCTTGTC AACGGCAGAA GCGCTTCTGC GAGCGAAATT TTAACCGGCG CTTTGAAGGA CAACTGTCGA GCGACTGTGG TCGGGAGTAA GACGTACGGC AAGGGTTTGA TTCAGAGCGT GTACGAACTC AGTGATTTGA GTGGGATGGT ACTCACCGTG GGTAAGTACG TCACCCCAGG TCTCGTCGAC ATCGATCAGA CAGGGATTTC GCCAAACTTT ATGATGTTCC CGGGCTTTGA CGCCGCGGCG AGAGAAATCG ACGCGTGCAA AGTGCCACCA AAATATTGA
|
Protein sequence | MGPTARARAA AAAAAAAAAA VAAATTFGTP AAFADDVARG RGDARTKSVS SAVELVSEIA AEAAEAEAES EGATDATILD EAWGLVFDNF LPARKSESDG FDRAAWEAIK AEHEANPPQS REEAYEMIKS MLGTLGDKFT RFIEPDRFTS MLKYDITGVG LNIAEDADDP ERVRVLGMVL DSSAMKAGVA QDDEIVAVNG ELVRGLSAFQ VSSLIQEADG KSVDLTISRT GEDVPRVVSL TRDSQFEAPK SPVSMRLEGG HVGYIRLREF NSLAERDIAR AITDLRTQGA DAYILDLRDN PGGLVQAGVE IARLFLPADS TIAYTEGRVV AGGAITTSKN DPVVAADVPL VVLVNGRSAS ASEILTGALK DNCRATVVGS KTYGKGLIQS VYELSDLSGM VLTVGKYVTP GLVDIDQTGI SPNFMMFPGF DAAAREIDAC KVPPKY
|
| |