Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_31659 |
Symbol | |
ID | 5001915 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009359 |
Strand | - |
Start bp | 301730 |
End bp | 303438 |
Gene Length | 1709 bp |
Protein Length | 550 aa |
Translation table | |
GC content | 60% |
IMG OID | 640417336 |
Product | predicted protein |
Protein accession | XP_001417982 |
Protein GI | 145347029 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.0143795 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 5 |
Fosmid unclonability p-value | 0.0189648 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | CGCGATGACG CGCGCGACGA AGCGCGCGGG AAAGACGCGC GCGACGAAGG CGCCGGCGCG CGCGTCGAAG CGCGCGCGAG GGGAGACGCC GGTGAAGGAG GAGACGACGC GACGAAGGCG AGGCGGCGCG CGAGACGCGG ACGAGGACGA CGAGACGGAC GCGACGGAGG CGCCGGCGAG CGGACGGTCG TCGGAGGACG CGGTGGATTA CTTTGGGGAC GTCGATGGAT GGGCGAGCGA CGACGAGGAC GCGCCGCAGT ACCCGCTGGA CGCGGTGGTG AAGGTGTTCG CGACGCACAC CGAGCCGAAC TGGAGCCTGC CGTGGCAGCG AAAGCGACAG AGCTCGAGCA CGAGCACGGG GTTCGTCATA GAGGGGAATA TGGTGCTGAC GAACGCGCAC AGCGTGGAAC ATCACACGCA GGTGAAGCTG AAGAAGCGCG GGAGCGATAA AAAGTACGTG GCCAAGGTGT TGACCATCGG CGTGGAGTGC GACTTGGCGC TGTTGACGGT GGAGGAGAAG GAGTTTTTCG AGGGCGTGGC GCCGGTGAAG TTCGGCGTGT TGCCTCGCTT GCAAGACAGC GTCACTGTGG TTGGGTATCC GGTGGGTGGG ATCGCGATTA GCGTCACGAG TGGGGTGGTT AGTCGGATAG AGGTGACGTC GTATTCGCAC GGCGCGACGG AGCTTTTGGG GGTGCAAATC GACGCGGCTA TCAATTCTGG CAACAGCGGG GGTCCGGCGT TCGGGCGCGA AGGGCAGTGC GTGGGTGTGG CGTTTCAATC GCTCAAGGAC TCGGACACCG AAGGAATCGG CTACATCATA CCGACGCCCG TCGTCGACCA TTTTATTAGC GACTTCAAGA GGACGGGCGT GTACAACGGT TTCCCGGCGC TGCAGTGCGA GTTCCAGCGC TTGGAGAACC CATCGCTTCG GAAGAGTCTC GGGATGAAAC CCGCGCACAA CGGCGTCTTG CTTCGACGTC TATCGCCTCT CGCGCCGGCT GCCAAGGTGT TGAAACGCGG CGATGTGTTG ATGAAATTCG ACGGCGTCGA CGTCGCTTCC GACGGCACCG TGGTTTTTAG AACGGGCGAG CGCATTAACT TTTCTTATTT AGTCTCTCGC AAGTACGTCG GCGATAGTGC TGCGGTCACC GTCCTACGCG ATGGCAAGAT GATGAATTTC GACATTTCGT TGACGCCGCA CGACCGCCTC GTTCCGGTGC ACATCGAGGG CAAGCCTCCG TCGTATTACA TTTGCGCGGG CATCGTCTTC ACCGTGGTGT GTGTGCCGTA CTTGCGATCG GAATACGGTA AAGATTACGA TTACGACGCT CCGCTGCGTT TACTGACGAA GATGATGCAC GGGCACAAGG AGAAGCCAGA CGATCAAGTC GTCGTCGTGA GTCAAGTGCT CAACTCGGAC ATCAACATTG GCTATGAAGA CATCGTCAAC GTCGTCGTGT GCGGCGTGAA CGGTAAATCC GTGAGAAACT TGCGCGAACT CGTGAAAATC GTCGAAGGCT GCAAGCACGA GTACTTGAAA ATCGAGCTCG ATCAATCGAT ACAAATCGTG CTCGAGACCA AGGCGGCGAA AAAATCCACG AAGGAAATTT TACACACCCA CTGCATCCCG AACGCGTCGA GCGTGGACTT GCGCTGAGCG CGAGGCGAGC GCTTCAAAGA CGACGATGAT GATTAGTATC GACTAGCAC
|
Protein sequence | MTRATKRAGK TRATKAPARA SKRARGETPV KEETTRRRRG GARDADEDDE TDATEAPASG RSSEDAVDYF GDVDGWASDD EDAPQYPLDA VVKVFATHTE PNWSLPWQRK RQSSSTSTGF VIEGNMVLTN AHSVEHHTQV KLKKRGSDKK YVAKVLTIGV ECDLALLTVE EKEFFEGVAP VKFGVLPRLQ DSVTVVGYPV GGIAISVTSG VVSRIEVTSY SHGATELLGV QIDAAINSGN SGGPAFGREG QCVGVAFQSL KDSDTEGIGY IIPTPVVDHF ISDFKRTGVY NGFPALQCEF QRLENPSLRK SLGMKPAHNG VLLRRLSPLA PAAKVLKRGD VLMKFDGVDV ASDGTVVFRT GERINFSYLV SRKYVGDSAA VTVLRDGKMM NFDISLTPHD RLVPVHIEGK PPSYYICAGI VFTVVCVPYL RSEYGKDYDY DAPLRLLTKM MHGHKEKPDD QVVVVSQVLN SDINIGYEDI VNVVVCGVNG KSVRNLRELV KIVEGCKHEY LKIELDQSIQ IVLETKAAKK STKEILHTHC IPNASSVDLR
|
| |