Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_31752 |
Symbol | |
ID | 5002097 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009359 |
Strand | + |
Start bp | 476504 |
End bp | 479707 |
Gene Length | 3204 bp |
Protein Length | 1048 aa |
Translation table | |
GC content | 60% |
IMG OID | 640417518 |
Product | predicted protein |
Protein accession | XP_001417784 |
Protein GI | 145346620 |
COG category | [R] General function prediction only |
COG ID | [COG5271] AAA ATPase containing von Willebrand factor type A (vWA) domain |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 6 |
Fosmid unclonability p-value | 0.0703442 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCTGAAGA AACTCGGCAG TCGCAAAGAT CGCGTCGTCA CGTTCGTCGT CACCGTGCGC ACGCTCGAAC CGTGGCCCAC GCGCGCGTCG GGCCGTCCCG GACAGTTCGC CATCGGTTGG CAGCGCGGCG CGAATAAACG CGGAACGACG CCCGTGCGCG CGGGCGAACG CTCCGATGAC GGCGCGCGCG CGACGTACGC GTTCGATCAC ACGTTTGAGG TCGAGGCGAC GGTGCGCAGG GCGGGAAAGA CGGGACACAA GGAGAAAACA CTGACGCTGT ACGTGTTGGC GCTGCCCGAA GACGCGACGC GCGGACGCGA GGTGACGGCG GCGAAGGTCG GGGCGTGCGA CGTCGATCTG GCGAAATACG TCGATCGCAC GGAGGATGAG ACGATAATGA TCGACGTGGA GTGCGGCGAA GGGGTGCGAC GGGCGGTGGG GACGCCGAAG CTTTCGATTT CCGTGCGCGC GAAGGAGGGC GGCGCGAACG CGGAGGGACG CGAGGGCGAG GCGAACGCGG CGAACGCGTC GCCGTTGAAA TCACCGACGG CGAGCGAACG CGGGGGCGAG TACCAGTGGG CGTCTTCGAG GTTTCAGTCG GAGCAGGCGA GCTCTTCGCA GGCGGATCAA GTGGAGGCGT TGACGAGCAT GGCGAGCATG TTCAAAAAGC GCGCCGCGCC CGTCGAGGAA ACCTCGGTCG TCGACGACGT CGTCGACGAG GGTTCCGAAG CCACGGCCAA TGACGAAGCC GACGTTGAAG CGAATACGGC TCGAGCGGAA CTAATCGTCG CGCGGGCGGC GACGCCGAAA TCGGACGAAT TTTCGTCGAC GCCCGACGGC ACTCCCGCGT TGGAGCAAGA AGATCCAGAG TTGACGCGAG CGCGCGACGA ATTGTTTGGT GCGCCACCAA AAGATGCCGC GACGTCACCA CGCGGTGACG TCGACTCCGA TGGTTTCTTG CTCGATTCGG ATCTGGACAC CGAGGGCGAA GCGGACGAAG AGACCCCAGC GGAGTTCACT CGTCCAGTCG AAGAGTCGCC GGCGCGAGAC GACTCTGAAA ACGCCGCGGA GCAAGCGAGA TTGGCCGAAG AAGAGGCGAG AATACGCGCT GAAGAAGACG CTGCTGTAGC TCGCATCGAA GCCGAACGCA AGGCATTTGA AGAAGAGGAG CGCCAGCTCG AAGAGCAAGC GCGACTCGAA GCTCAGCGAG CAGAGGAAGA AAGAGTTCGC GTCGACGAGG AAGCGAGGTA CGCACGCATG GAAGCCGAAC GCGCGCAAGC GGAAGAAGAA GCTCGAAGGT TGGCGGAAGA GGACGCGCTC TTCGCTGAAA ATGCCGAGTA TCAAAGACGC GCAGAGGAAG AGCAGCGCCT GCGCGCAGAG GAAGAGCAGC GCCTGCGCGC AGAGGAAGAG CAGCGCCTGC GCGCAGAGGA AGAGCAGCGC CTGCGCGCAG AGGAAGAGCA GCGCCTGCGC GCAGAGGAAG AGCAGCGCCT GCGCGCAGAG GAAGAGCGAC GATGGGCTAT GGAAGCCGAG GCTGAGCGCG CGCGCATCGA AGAAATTGAA AAGGCCCGAG CTCATGAAGC CGAAGCTGCG CGTAGAGCCG TGGAAGATGA AGACGTGGCG GCGGCGCAAA TCGCTTCGAT TGCACAGCAA GAACGTCAAC AGCTGGCAGA AGAGGAAGCG ATTAGAGCCG CTCAGGAAGA AGAGGAGCGA CAGCGATTAG AAGATGAAAA TCTTCGCACA TCTGAACACG AGGCTCGATT GCAAGAGGAA CGAGAACTTC AAGCCGAAGA GAACGCGAAG GCGGCTGCAC GCGGCGACAT TGATGTGTAC GCTAAAGCAG TCATCACCGA GGGCGCGTCA TCTGTCTTAT TTTCGGACTC TGCTGACGAC GTGACTGCGT TTGGTACACC GTCATCGCAC AGCGACGACG CGTTCTACAC TCCAGCCACT CGCGGGACGC GTTTCGCGGA TTCGGTGCTT AAATCATCTC GCAATCGCGA TTTGGAACAC GAGATCGTGA GTATGTCCAT TTGTGATATT CTCATTCACA GCACTGCGGA GGATTCGAGC TTCACTACCG CTCTTGGTCT TCAAGAACGA ATTGCGAGCG TGCGCGCCAC GCTCGGAGAG CGCGAGTCTC AACTCGAGTT CAACAGAATC ACGGACGCTT TCGGTGTCGC GATTAAAGGG GCGATGCATA ATCCAGCACG ACTCGTGTTC TTATGTGCTC AGCTTATCGC GCTGCGAATT TGCGTGGCCA CGATGGATGA CTTGGACACG CGCGACGTGA TCGAACTCGA AGTCTTGGCT CGAAACGCCG CGTTTGAGTC GCTCTGGAAG CACACTTGCA GTGCATTAGT AAACCCTGGC GAAGTGACGG AAACGCTCGC ACACTTCATG AAATCCTTCT GTGGGCCTTC GCCGAATGGA GATGGTGAGA AGATAGGTCG CGCCTGGTCT GCAATGTTTC AGCTCGCCAA GACGCGACTC GACATCATCG GTGGCGACGC CGACGACGCC GGCTGCTCTT CGCAACTACT ATTGACGCAG CTCCGACAAG GCATCTTGAA GGAGATTATT CTTGCACTTG ACAAGTCTGT ATTAGATGCA TTGATTCATC CGTCTGGCGA TGCCCTGGCA AACCCGATGA TACCCGGTGG CGGCGCGTTG ACGTTTTCAG CGGGTGCCGA ATTGAAGCGA GCAATTTCTG TACTCGCCAG CGTCGCCAAA GATCTCAACG TTGGCACGAG TACTGAATCG ATCATCCCGA AACTCAGAGC CGTCGCAGAT GTGTGCATGA TTCCGAAGGA CGCATTGATT GACGTCAAGC TTCGCACGGA TATCGTGTGC GGCAAACTCA CGGACGAGGA ACTTGCCAGC GTCGTCTCGC GATTCCGCCC TGACGATTTC GCGCCCCAAC CCGTGGACCC GGACGTCATC TCCGCCGTCG TCGACGCGGC GACGAATGGA AAGGGTGACA CGCCCCCCGC GATCGGACCT TACACCCCAA TGAGCACGGA AGGCGCGCCG TGGATCGCCA ACTTGGCCCG AGCGCTCGCC GCTTTCGACG GCGTTTTGCA GTCGCGAGCG CCCGGTCCCA GCGCGCACGC CACGCGTTGG TCCCTAGTCG CCGACGCTCT GCCTTAAGAA AAGCGATGAA GCGTTGAACA AAGGCGCGTC ATTCACCATA GCGACGCGTT ATTC
|
Protein sequence | MLKKLGSRKD RVVTFVVTVR TLEPWPTRAS GRPGQFAIGW QRGANKRGTT PVRAGERSDD GARATYAFDH TFEVEATVRR AGKTGHKEKT LTLYVLALPE DATRGREVTA AKVGACDVDL AKYVDRTEDE TIMIDVECGE GVRRAVGTPK LSISVRAKEG GANAEGREGE ANAANASPLK SPTASERGGE YQWASSRFQS EQASSSQADQ VEALTSMASM FKKRAAPVEE TSVVDDVVDE GSEATANDEA DVEANTARAE LIVARAATPK SDEFSSTPDG TPALEQEDPE LTRARDELFG APPKDAATSP RGDVDSDGFL LDSDLDTEGE ADEETPAEFT RPVEESPARD DSENAAEQAR LAEEEARIRA EEDAAVARIE AERKAFEEEE RQLEEQARLE AQRAEEERVR VDEEARYARM EAERAQAEEE ARRLAEEDAL FAENAEYQRR AEEEQRLRAE EEQRLRAEEE QRLRAEEEQR LRAEEEQRLR AEEEQRLRAE EERRWAMEAE AERARIEEIE KARAHEAEAA RRAVEDEDVA AAQIASIAQQ ERQQLAEEEA IRAAQEEEER QRLEDENLRT SEHEARLQEE RELQAEENAK AAARGDIDVY AKAVITEGAS SVLFSDSADD VTAFGTPSSH SDDAFYTPAT RGTRFADSVL KSSRNRDLEH EIVSMSICDI LIHSTAEDSS FTTALGLQER IASVRATLGE RESQLEFNRI TDAFGVAIKG AMHNPARLVF LCAQLIALRI CVATMDDLDT RDVIELEVLA RNAAFESLWK HTCSALVNPG EVTETLAHFM KSFCGPSPNG DGEKIGRAWS AMFQLAKTRL DIIGGDADDA GCSSQLLLTQ LRQGILKEII LALDKSVLDA LIHPSGDALA NPMIPGGGAL TFSAGAELKR AISVLASVAK DLNVGTSTES IIPKLRAVAD VCMIPKDALI DVKLRTDIVC GKLTDEELAS VVSRFRPDDF APQPVDPDVI SAVVDAATNG KGDTPPAIGP YTPMSTEGAP WIANLARALA AFDGVLQSRA PGPSAHATRW SLVADALP
|
| |