Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_25467 |
Symbol | |
ID | 5005532 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009368 |
Strand | + |
Start bp | 87034 |
End bp | 90488 |
Gene Length | 3455 bp |
Protein Length | 1089 aa |
Translation table | |
GC content | 60% |
IMG OID | 640420953 |
Product | predicted protein |
Protein accession | XP_001421203 |
Protein GI | 145353830 |
COG category | [R] General function prediction only |
COG ID | [COG5271] AAA ATPase containing von Willebrand factor type A (vWA) domain |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.0102668 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 6 |
Fosmid unclonability p-value | 0.0616217 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | CGACTCGACG CGAACGGCGA CGCGCGCGAC GCGCGAACGC GAGGGACGCC GACGCGAACG CGCGACGCGC GACGCGCGCG GGACGAGACG CGACGCGCGA GCGACGCGGA CGCGCGATCG ACGACCGAAC GCGTCAACAC CCTTCGACGA CCGCGGCGAG AAGATGGCGC GCGGTGAGCG CGCGCCGCTG CTGGCGCGAG GCGAAGACGT CGAGCGCGGA CTTCGCGATG ACGACGCGGC GATCGGCGCG GCGGACGCGT CGAAAGGCGC GAGAGAGGGA CGCGAGGGAC GGGGAAGGAC GCGGACGACG GCGCTGGCGC TGGGCGGATG CGCGGCGCTC GCGAGCGTCG CGGCGTACGC GATGACGGCG GGTTCGGGGT CGAAGGCGAC GATGACGGCG AGCGGTGCGA GCTTGGGGGA GCGATTCGCG GCGAATCGGG CGCAGATGGT GGCGAGACGA CGACAGGCGC GCGCGCGGGC GAATCGTAAA CGCGCGGCGA ACGAGCGAAG GATGAAGGCG GAACGCGAAG AAAAGGTGAA TGAGGTTTTG ATGACGCTCG CGAAGGAGAA GCCGGAAGAC GCGAAGACGA TGGTGGAGAA TGCGATCGAG CGATGGGAGG CGAAGCAAGC GGCGCGAGCC AAGGCGAGCT TGGGCGAAGA CGAACGGGCG CCGAAGCGGC GCGCGTCGCG TCGACGCGAA CGACGCGAAC ACGCGCGCAG AAGCACCACG AAAGGACGCC GTGCTCACGT GGCCGAGGCG CGTATGGGCG TGGGCGCTGA GACGTCGGTG GAGAGTCTGT CCCGCGGAGA CGCCGCCGTG CGAGCCAAGG GTGAAGAAAA AATTCGGCAA GTCGAGGACG AATCCGATGA GCTCATCACT GAAATCAGTT CTTCCGTGGT CAAGCGCGTA GGAAAGATTA AGGCTGCAGC CGAGGCCGAA GGCTCGGAAC AAGCGAGAGA CGCCGCGCGA GCCGAGGCCG AGAAGTTTGT ACAGGCGGCG AACGAACGCA TCGAACGCAT CGAAGAAAAG ACGGCGACGC GCGTATCGAT GATCAAGGAT CTGACGATGC AACACATGTC GTATGACGGT CGTAACGCGT ACTCGGCGCA AGTGGGCCAG AGTGAGGTGG CCAACATGGG TGAATCCGAA ACGAACGATC CTGACTCCGC CGACGAGCAA GAAGACGACG AGGACGAGGA CGAGGAGTTG GTCGAAGATT TGCTGGCGGA TGGCGAACAA GAACAAGAAG AACAAGAAGA CGCTGGTAGT GGCAATCAGG TTGTGCCGGG GACGGCGACG GAGACGGAGA CAGGGAGTGA CGAAGAAGAC GAGGCCAACG AAGCAGCCGA AGAAGAAGAA GAAGATCATG AGATCAACAA GCTGTCCAAC GTCGAACTCA TTCAGTTATT GGATGACAAA CTCGATAACC TGGAACAGAG TAACGAAGAC TTCAAATCGG CGATTACTGA GCAAATCGAA GGCTTGTCGA CAAGAATGTC TAACCTCGAG AATTCATCGG AGGAGCTGCA GGACGACGTC ACGGAGATGG CGCTCGACGA GCACTACGAC AACGACGAGG ACGGCGACGG CGACAGCGAC AGCGACAGCG ACGAGGACGG CGACGAGGAC GGCGACGGCG ACGGCGACGG TTATCCCAGA GACGAGACCA TCGACGGCAA CTACCCCACC GGGAACGGTA AATATCCCGC CGATAGCAAC GGCAACTATC CCACCGCCCA AACCGTCACC GTCGATGAAA ACGCCTTGAG CATCAGCGCG GTCAATGACC AAATTTCCAA TCTCACGGCG CAGCTTCAGA ACGCCACCGC CGCCGCCGCG CAAGCGCAAG CGACTCAGGA ACAATTGGAT CAGACGCGAG CTGAGCTTGC GGCGGCGCAA GCGCAGCTTA ACGCTCGTTA CTTCGATTTG GACGTTTTCC GTAGGCGTCC GGTCGGTCTC GAACCGGATT TCCAAACCAG TGACCCGACC GCGGAATCTG GCGCATTTTA CGCTTGCGAC GTGCAAAATC AATGCATGAC GGGCGCGACG CTTTTGACAC GTAATAATAA GTTGCTCGCG ATGAAGGGCG TGTGCGACAA CGGCGCGAAG TCGACGGATT TGTCTTCAGG ATCCTATGAA ATGTTCCCGA CGCACGGTAC GATTGACTTT TCGTCGACGG ACACCGACAA CGAGTGCACG ATGGAATTCG GTGAAAACTC GCGGTCGGTT TGGATTCGCA AGGAAAATGA TTTTGTCATC GCCATGTCCA AGGACGGCTC CGCACACACG CGGTGTGGTC GCGATGTCGC GAGCGCGTCT TCCATCAAGT ATCAGTGCCA AAATCCGAAA GCGTGCATCA CGGGATATCA CATCAAGAAT CAGTACGGCA ACTTCGATGG CGACGCGAGC ACACGCACCG AAGAAGACTT GGCTATATCC GTCGTTGATT TCGTGTGCTC CGACGGCAGT TTCGCCGTCG ACCCGGTTTC ACTGCGACCG GATTACGGCC AAATTAGAGT TGAACCAGAA TTTACCATTG GCTTGGCGAA CACGCCGCGA AAGGATGGTT CGGTCAGGCG CACGGTGACG CCGTATTTGC GACTCAAGAC GAATACGGTA GTGAGCGATT CGTTTCTCTT GGCTGTTCGC GTCCGCGACG TGACCGCGTT GACCGAACAA GAGAAACACG GTTGGTGTAC GCAACAACAC GTGCAAGCGC ACCTAAACGC TCCGCACACG TGCAAAGATG GAAGCAAGCT GTGCGCGAAG CCGGCGTTTG TGTTCAACCC AGGCGGATCA GAGTATGAGT TGAAATTTGG CGAACAGACG TTCGATCTTC TCGACGCCCA AGCGCACGAA TCCGCGCGCA TCGTCGTTGA GAACGAAGAC CAAGCAAACG ACGCACGCGT CAAAAACTTT CCGATGCCGG CGAATGAGGA ATTCTTCATG TTTGGTCATC ACTACGAAGT GTGCGTCGCC GCCATCGAGC GTTCGAGCTT CAAAACTAAA ATCATCGACC CAAGCACGGG CGCGGTGAAG GATTCGTTCT ACAATCGCCA CCTCGACGGT TCCGCGCACC TCGTCGGCTT GGCCACGACG GGCGAGCTCT CCGTCACCAT GGAAGGGGGA CCGAGACACT TCAACATCGT CGCTCCAGCG GCGCCGACGA CCGCTGATTT ACAAAATCTT CAGTGTTACG ACAACGATTG CGCCATTTCG GGCGGTGTCG CCCCGGTGGT GAGCGCGCAG AGCCAAGTGT CGCCGCAACC GACGCCGCAA CCGACGCCGG CGCCGCAACC GACGCCGCGA CCGAACGACG GCGTCGCGAC GCTCGCCAAG GAATCCGCGC AGCAAAAGAC GCACGTCACC ACCGTCCTCG ACGCGCGCAA AGCGCTCGAA GCGCGTCTCC AAGACGCGCA GCGGCGCCAT TAAAGTAGCC CCATTAAAGT ACCCT
|
Protein sequence | MARGERAPLL ARGEDVERGL RDDDAAIGAA DASKGAREGR EGRGRTRTTA LALGGCAALA SVAAYAMTAG SGSKATMTAS GASLGERFAA NRAQMVARRR QARARANRKR AANERRMKAE REEKVNEVLM TLAKEKPEDA KTMVENAIER WEAKQAARAK ASLGEDERAP KRRASRRRER REHARRSTTK GRRAHVAEAR MGVGAETSVE SLSRGDAAVR AKGEEKIRQV EDESDELITE ISSSVVKRVG KIKAAAEAEG SEQARDAARA EAEKFVQAAN ERIERIEEKT ATRVSMIKDL TMQHMSYDGR NAYSAQVGQS EVANMGESET NDPDSADEQE DDEDEDEELV EDLLADGEQE QEEQEDAGSG NQVVPGTATE TETGSDEEDE ANEAAEEEEE DHEINKLSNV ELIQLLDDKL DNLEQSNEDF KSAITEQIEG LSTRMSNLEN SSEELQDDVT EMALDEHYDN DEDGDGDSDS DSDEDGDEDG DGDGDGYPRD ETIDGNYPTG NGKYPADSNG NYPTAQTVTV DENALSISAV NDQISNLTAQ LQNATAAAAQ AQATQEQLDQ TRAELAAAQA QLNARYFDLD VFRRRPVGLE PDFQTSDPTA ESGAFYACDV QNQCMTGATL LTRNNKLLAM KGVCDNGAKS TDLSSGSYEM FPTHGTIDFS STDTDNECTM EFGENSRSVW IRKENDFVIA MSKDGSAHTR CGRDVASASS IKYQCQNPKA CITGYHIKNQ YGNFDGDAST RTEEDLAISV VDFVCSDGSF AVDPVSLRPD YGQIRVEPEF TIGLANTPRK DGSVRRTVTP YLRLKTNTVV SDSFLLAVRV RDVTALTEQE KHGWCTQQHV QAHLNAPHTC KDGSKLCAKP AFVFNPGGSE YELKFGEQTF DLLDAQAHES ARIVVENEDQ ANDARVKNFP MPANEEFFMF GHHYEVCVAA IERSSFKTKI IDPSTGAVKD SFYNRHLDGS AHLVGLATTG ELSVTMEGGP RHFNIVAPAA PTTADLQNLQ CYDNDCAISG GVAPVVSAQS QVSPQPTPQP TPAPQPTPRP NDGVATLAKE SAQQKTHVTT VLDARKALEA RLQDAQRRH
|
| |