Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_44226 |
Symbol | |
ID | 5004424 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009365 |
Strand | + |
Start bp | 503692 |
End bp | 504678 |
Gene Length | 987 bp |
Protein Length | 328 aa |
Translation table | |
GC content | 55% |
IMG OID | 640419845 |
Product | predicted protein |
Protein accession | XP_001420363 |
Protein GI | 145352032 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG1404] Subtilisin-like serine proteases |
TIGRFAM ID | |
| ![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_cp.jpg)
![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_hh.jpg)
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 0.0107542 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGATTCG CTCGTTTATT TGCCGTGTTT GCCGACGACA AGGAAGTCAA CGCACTTCGC ACGTGCGTTC CCGCGCTAGA TTTCGAAGAA GACGTTGAAG TGTTTTCACA AGGCGTGACA ACCTGGTCCA GAGACCGTCT GAACGGTAGA GACGGGCTTG ATGGTAAGTT GCTTCCTCGC GATCCCCCTG AAGATGGCTA TGGGAGCATC GTGCACGTTT ATCTGCTTGA CACGGGTGTG AGAAGGACGC ACGTGGAATT CAAAGATCAA GCGTTTGGGC GCGGCGTCGA TTTAGTCGAC GATGACGCCG AACCTGATGA CTGCGATGGT CACGGTAGTC ACGTGGCTTC GACCATCAAT CAAATCGCGT ATAGCGGCAA GACAGTTCTC CACTCTGTGA GAGTTCTCGA CTGTAACGGA AATGGTGAGC TCTCGGGGCT GATTGAGGGA CTTGAATGGG TGCTCGGTGT GGCGACGCCG TCCGAGCCAG CCGTGGTCAG TTTAGCGCTC GGAGTGCGAA ATGGAATTTG GTCACGCGCA CTCGAACGCG TCGTCCAAAC GCTCACTGGA CGTGGAGTAT TCATCGTTTG CGCCGCTGGT AACCAAAAAG GAGACGCATG CACGATTTCG CCTGGAAACG TCGCGGAGAC GCTGACTGTC GCAGCCAGCG ATCAAGCAGA CGCGCCGTAC GCCTATGGAA ACTCTGGAAG GTGCGTTGAT TTATTCGCGC CCGGCGTGCA AATTCTCGGT GCATGCGGTG GAAGCACTGC GTGTGAGCAT CCAAGCGACA CGGCATATGC ATTTCAAAGC GGTACGAGTA TGGCGGTCGC GCACGCCGTC GGCGCAGCGA CGCGCCTTCT TATGTTTTCC CCACGCATGA GCCCTGAAAA TTTGAAGAAG CATCTCACAT CAACTGCGTC GCGCGACAAA ATTCGAGGTG GGTCATTACT CCCAGGAACT CCGAATCTAC TGTTGTACGT GAAATAA
|
Protein sequence | MRFARLFAVF ADDKEVNALR TCVPALDFEE DVEVFSQGVT TWSRDRLNGR DGLDGKLLPR DPPEDGYGSI VHVYLLDTGV RRTHVEFKDQ AFGRGVDLVD DDAEPDDCDG HGSHVASTIN QIAYSGKTVL HSVRVLDCNG NGELSGLIEG LEWVLGVATP SEPAVVSLAL GVRNGIWSRA LERVVQTLTG RGVFIVCAAG NQKGDACTIS PGNVAETLTV AASDQADAPY AYGNSGRCVD LFAPGVQILG ACGGSTACEH PSDTAYAFQS GTSMAVAHAV GAATRLLMFS PRMSPENLKK HLTSTASRDK IRGGSLLPGT PNLLLYVK
|
| |