Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_17568 |
Symbol | |
ID | 5004650 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009366 |
Strand | + |
Start bp | 215234 |
End bp | 217681 |
Gene Length | 2448 bp |
Protein Length | 815 aa |
Translation table | |
GC content | 57% |
IMG OID | 640420071 |
Product | predicted protein |
Protein accession | XP_001420622 |
Protein GI | 145352587 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG5160] Protease, Ulp1 family |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 0.180274 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 6 |
Fosmid unclonability p-value | 0.00230003 |
Fosmid Hitchhiker | No |
Fosmid clonability | decreased coverage |
| |
Sequence |
Gene sequence | ATGGACGCGC GGATGCGGCG CGCGAGGCGC GAGCACGAGA AAATCGCGCG AACGTACGCG GAGGCGCACG AGAACGTGGG GACGAGGGAA AACGGGACGA CGCGCGACGC CGACGCGCTG CGAAGGGAAC GCGTCAAGAC GAGTCATCTG ATAGCCAAGG TGAACGAAAC GAAAGTCGGC GCGAACGCGA CGCTGGTGAG AAGGAAGGAG TATTTAGACC GGATGATCGC GAACGCGGAG GAACGGGAGG CGTTGATGGA GGACGGCGGA TCGGAGGCGC GAGGCGGAGG CGCGGCGACG ACGACGACGC GCCGGGACGC GCCGCAGGAA TGGACGACGA TGGGGCAGAT GGCGGAGAGA AAGTCGCCGG CGAAGTCGCT GAGGGAGCGC GATGCGAGAG ATTTCCAGCC GGAAAAGTAT AGATTTGACG ACGAAAAGGC GTCGCCGAGC GCGGCGGAGG CTGTCAACGG GTTGAGCAGA GCGTTGGACG ACATAGACGG AGATTTAGGG TCGATGTCGT CTCTGTCGCC CGATAGATAT CCAGGACGAA AAGAAACGGC GAGTTGGGAC ACGTTAGCGC GACAACCGAC GTCGCACGGC AGTAGACAGT TGGACGCCGC GCTCGGGCGC GCGGGGGCTC CCGAGGCGAA GTCGACGCCG ACGACGATTT CGACGTACTT TAAGAAGAGC GCGGGTGCGA CGCAGCCGAC GACGACGACC GCCGCTGTGG ACGATTTCGC CGCTGCTGCG ACCGACAAGA GTCGCGAGCG TCTGCCGATG TCGCCGCCGC CGCCCAAGCG TCGAGCGAAG TCGACGCGCG GCGACCCGGC GTCTCCCATG GTGGTGGATC TTCTGGAGAC GAGCGACGAG GAGAACCCGT TGGAGAACGA GTCAAGAGGG GTGCAAAGCT TGGATCCAGT GAGTCTTACG AGCGAGAATG GTGAAATTCG CCGATCGACG CGCTCGAGTA GGTATCGCTC GCTGCAGAAC ACCGTCGGCA ACTTGGCGAC GGATTACCCA GACTCGAAGA CGAAGGGGTC GGTGCAAATC ACTCTCGGCG ACTTGGAGCA CCTCAGAGAC GGCGAGATGC TGAACGATCA GTGCGTGGAC TTTTATTTGA AGTACATTCA AGTGGAAATG TTGGGCGCGA ACGCGTTTGA GATATTAGAC AAGGTGCACA TTTTTAACAG CTTTTTCTAT CAAAAGCTCG CGCAAAAGCA CGATAGAGAT CGAAGCAACG TCGACGCCGC GACGGCATCG CACGCACGCG TGAAGAATTG GACCAAGGGT GTGGATATCT TTACGAAAAG CTTTCTCATG ATTCCCGTGC ACAGCAATTT GCATTGGTCT TTAGTCATCG TTTGCTATCC AAACGGTACT GATGAGCGCC AACCGATGAT GCTCCATCTC GACAGCATGA CGCAACATGG CGGGCACAAC TCAGAAGTCG TGTCGAAGAC GGTCCGGCGT TACCTGAGCA AAGAATGGAA AACGCAAAAG GGCGACGACA CTGAATCAAA ATTCGACGCT CGATACATGC CAACGTATCG CGTCAATGTT CCGAGACAAA ACAATGGTTG CGACTGCGGC GTTTTCATTT TGGCCTTTCT CGAAAAGTTT TTAACCGAAC AACCCGAGAT TTTGAAGAAG AGCGATGTTC AACGCGCGGC GCAAAAGCGC TCGTTTGGAA TGGACGATGC GGGCAAGTTT CTACGAAAGA ACTGGTTTCC CAACGAGTTC GTGGACGAAT TGCGCGCCAA GTTGAGTCTC CTCGTCATCC AGCGCATTCA GGCTTCGCTC GCGGAAAACG ACGCGTCAAA GCTCATTCTC AACGCGGCAA ATGAAGAGCA AATGAAAGAC TATGAGCATC GGCAGTGGCG AACGAAACAA GCGGTTTCAA AGGCTGAGCG TGACCGTAAG AAGAGAATGG AAGCCATGGA AGCGGAGCGA AGACGCAAGC TCGAGCCGCA AGACCTCTCT GACGACGACG ACGCCAAAAC CAAAGCCTCC GACGCCAAGG ATGTAGACTT TGAGATTTCT CGCGAAGTCA GGGTGAAGAA ACAGCCGACG AAGACTAGGC AGACGCAGAT ATTCTCGGGA TCCCACTGGA ACAAGGCGAG AGAGAAAGAA CCCGTGCGCG AGGAGACTCG CGACGAGAAG CCATTCGAAC CCTTCGCGGG CCCGTCGTAC AAAATCGGGC GACACACGCG CGCGCGTGCG CCTTCTCCGA GCGCTGATTG GACGAGCGGA ACGACCCGTC CCAAGTATGG TAACGCCAAC CTTTTGAAAC GAGCGGAACG ATCGCGAGCG AAGCAAAACG GGGAAGAGGA ACGGAGCGCC GCCGACGACG AATCCCCGGG CCAGGCCAAG GCGAAGCTCA ACGACGCGAA TGAAAAACTT TCCGAGAGAA TCAAGGGGAT GACGTCTCGA GTGTTCTCGA CGAAATAG
|
Protein sequence | MDARMRRARR EHEKIARTYA EAHENVGTRE NGTTRDADAL RRERVKTSHL IAKVNETKVG ANATLVRRKE YLDRMIANAE EREALMEDGG SEARGGGAAT TTTRRDAPQE WTTMGQMAER KSPAKSLRER DARDFQPEKY RFDDEKASPS AAEAVNGLSR ALDDIDGDLG SMSSLSPDRY PGRKETASWD TLARQPTSHG SRQLDAALGR AGAPEAKSTP TTISTYFKKS AGATQPTTTT AAVDDFAAAA TDKSRERLPM SPPPPKRRAK STRGDPASPM VVDLLETSDE ENPLENESRG VQSLDPVSLT SENGEIRRST RSSRYRSLQN TVGNLATDYP DSKTKGSVQI TLGDLEHLRD GEMLNDQCVD FYLKYIQVEM LGANAFEILD KVHIFNSFFY QKLAQKHDRD RSNVDAATAS HARVKNWTKG VDIFTKSFLM IPVHSNLHWS LVIVCYPNGT DERQPMMLHL DSMTQHGGHN SEVVSKTVRR YLSKEWKTQK GDDTESKFDA RYMPTYRVNV PRQNNGCDCG VFILAFLEKF LTEQPEILKK SDVQRAAQKR SFGMDDAGKF LRKNWFPNEF VDELRAKLSL LVIQRIQASL AENDASKLIL NAANEEQMKD YEHRQWRTKQ AVSKAERDRK KRMEAMEAER RRKLEPQDLS DDDDAKTKAS DAKDVDFEIS REVRVKKQPT KTRQTQIFSG SHWNKAREKE PVREETRDEK PFEPFAGPSY KIGRHTRARA PSPSADWTSG TTRPKYGNAN LLKRAERSRA KQNGEEERSA ADDESPGQAK AKLNDANEKL SERIKGMTSR VFSTK
|
| |