Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_26734 |
Symbol | |
ID | 5004832 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009366 |
Strand | - |
Start bp | 119540 |
End bp | 121135 |
Gene Length | 1596 bp |
Protein Length | 531 aa |
Translation table | |
GC content | 68% |
IMG OID | 640420253 |
Product | predicted protein |
Protein accession | XP_001420763 |
Protein GI | 145352883 |
COG category | [O] Posttranslational modification, protein turnover, chaperones [R] General function prediction only |
COG ID | [COG2319] FOG: WD40 repeat [COG5243] HRD ubiquitin ligase complex, ER membrane component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 25 |
Plasmid unclonability p-value | 0.191061 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 0.915553 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCCGACG ACGCGCGCGA GCGTCCTCGC GCGTCGCCGG ACGCCGCCGT CGCGTCGTCG TCGTCGCGTC CCTGGACGCT CGCCGTCGCG CTGCGCGCGC ACGCCGCCGC CGTCAAGTCC GTCGACGTCG CGCGCGCGTC CGCCGACGGC GCGACGACCG TCGTCAGCGC GTCCGCGGAC GGCGACGTGC TCGCGCGACG ATCGAACGGC GCGACGTGGG AGCCCCTGGC GCGCTTGCGC GGGCACGACG GCGGCGCGCG CGCGGCGAGG TGCGCGGGAC GCGCGCGAGA CCGCGTCTTG TCGGCGTCGT ACGATCGAAC GGCGCGCGTG TGGACGCTTC CGAGGCGCGC GAACGACGCG GGCGAGGCGA CGCGGGCGAA ATCGGCCGCG GTGGTGACGC TGCGAGGGCA CGGGGATTAC GTGGTGGATT GCGCGTGGAG CGAAGACGAA AACTTGGGCG AGCTCGCGAC GACGGCGAGC GCCGACGGCA CCGTGCGCGT GTGGCGCGCG GAGACGGGAG AGTGCTTAAA AATAGTCGAC GCGCGCGAAG AGGGCGTCGG GGCGGTGACG TGCGTGGACG CGCGCGCGAC GACGGACGAC GCGGGGACGA CGTCGGACGA CTGCGTCGTC GTCGCGGGTC TGATGGACGG TTCGGTCAGG TTCTTTCACG TGCGCACGGG GGCGTGCGCG TTGATATTGG TGGGTCATCT CGGCGCGGTG ACGTCGGTGG CGACGCCGGA TGCGCCGGAG ACGTGGATGG ATTCGGCGAA GGCGCGCACG CGCGTGTGTT ACGGTTGCCG AGACGGCGCC GTCGGGTGCT TTGACGTGTC TTCGGATGAG GGTGGTCAAA GAGCGGACGT GACGCACTTA ATGCGTCGAC GGACGCATCC AGACGCCGAT CTCGAGGCCA CGACGAGCGT TAAATTTATC CTCTCCAACG CGCAAACGCT CGCGTCGTCG TCTGAAGACG GCACGGTGAG AATATGGAAC GTCCAGCGCG GCGAGTGCGC GATGGTGTTG ACGGGACAGA CCGCCGGCGC GTCCATTGAT TGCGTCTCCA TGCTGGATGA CGTCTTGGTC AGTGGTGGTT CCGATGGGAG CGTTTCGATT TGGGAGAAGA AACCGCGAGC GTCCACTGAG GAGAGTAGAC GCGACGACGA CGACGACGAG GCGACGCTCG CGACGCGCGC CGACATCGCC CTTCGGTGGC TCTTGCGCCG GGGACCTCAA ATTCGCGTCG ACGAAGTCGA CGAAATCGAA GAACGCCTCC TCGTCGCCGC ATTCGACGCC GCGGATCGAG AAGCGCGAGC GTTGCTCGCC GTGCGCCCGC TCGCAGACGA CCGCGAATGC GGCGTCTGTC GCGATTCGCT CGCCGTCGGC GAACTCGCGC AGCTTCCTTG CTCTCACACT TTTCACGCCG ATTGTCTCCT TCCGTGGATG CGAGTGTCGC ACCAGTGTCC TCTCTGCCGT GAAGTCAACT ACGAGTCCGG CGTCTCGCAC GCCCTGGCGT GCGTGCGCGT CTTCGCTCCC GCGCGTCGGC CCGCGATTCG CGACTCCGGC GCCGTTTCGA TCGAACTCGC GACGCCGACA TCCTAG
|
Protein sequence | MSDDARERPR ASPDAAVASS SSRPWTLAVA LRAHAAAVKS VDVARASADG ATTVVSASAD GDVLARRSNG ATWEPLARLR GHDGGARAAR CAGRARDRVL SASYDRTARV WTLPRRANDA GEATRAKSAA VVTLRGHGDY VVDCAWSEDE NLGELATTAS ADGTVRVWRA ETGECLKIVD AREEGVGAVT CVDARATTDD AGTTSDDCVV VAGLMDGSVR FFHVRTGACA LILVGHLGAV TSVATPDAPE TWMDSAKART RVCYGCRDGA VGCFDVSSDE GGQRADVTHL MRRRTHPDAD LEATTSVKFI LSNAQTLASS SEDGTVRIWN VQRGECAMVL TGQTAGASID CVSMLDDVLV SGGSDGSVSI WEKKPRASTE ESRRDDDDDE ATLATRADIA LRWLLRRGPQ IRVDEVDEIE ERLLVAAFDA ADREARALLA VRPLADDREC GVCRDSLAVG ELAQLPCSHT FHADCLLPWM RVSHQCPLCR EVNYESGVSH ALACVRVFAP ARRPAIRDSG AVSIELATPT S
|
| |