Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_16944 |
Symbol | |
ID | 5004244 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009364 |
Strand | - |
Start bp | 21942 |
End bp | 24869 |
Gene Length | 2928 bp |
Protein Length | 975 aa |
Translation table | |
GC content | 52% |
IMG OID | 640419665 |
Product | predicted protein |
Protein accession | XP_001420056 |
Protein GI | 145351375 |
COG category | [R] General function prediction only |
COG ID | [COG5271] AAA ATPase containing von Willebrand factor type A (vWA) domain |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.0730673 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 28 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCGAAGA AGAAGCGCCC GTCCAATCCC GGCGGCAGCG GGTCGAAGCG AAAGTTTTTC CGCGCCGATG GCGACGCGAG CGGTCCCAGC CCGAGCGGTG TGACGGTGAA GAGCGCGTTC GAGACTGTCA GCCATCGACG AAAATTCGAC ATTCTCGGTC GCAAGGTGAA GGGCGAGCGA GGTAACGTGC TGCAAGCGCG CACGGAGGCT ATTGAAAAAC GTCGGCGCAC GCTGCTCGTG GAACACGAAG CGAACGGCAA GGCGAACGCA TTCTTGGACC GACGATTCGG TGAACACGAC AGTGGCTTAA GCGTCGAAGA GAAAAATATT GGAAGATTGG CCAAGGCGCG GTTGCGTCAG TATAAGAAAT CCAAAGCCTT CGCCCTGAAT GAAGACGATG ACGACGACGG CTTTAAAACG CTCACGCATT TGGGACAGCC GCTCGGGGAG AGAGATTTAA CCGCGAAACC GTCGCACGGG GACGAGGAAG ACGATGAAAA TCTAGACGAC GAAATCACGC GTAATTTCCA TTTCGGTGGA GGCGAATTTG AACCAACTTT GAAGCGAGGC GATGGGGAGC ACGATGTGGA TGCGCCGGAA CGCCGAAAGA GCAAAAAGGA TGTGATGGAG GAACTCATAG CCAAGAGTAA GTTCTACAAA GCTGAGAAGC AGAAGCAGCG AGATGACGAT GGAGATATGT TGGATAAGTT GGACTCCGAT TTTAAGGCGA TAAGTCAAGG TGGACTATTG TCGAGCGCGT TGAGAAAAGC TGTCGGGCAC ATGAAGCCGA CCGCCGCGAA GGCTGCACAA ACCAAGGCTC CCGAGCAGAA GGATGAATAC GACACGTGGG CCCGAACTTT AGCGTTCGAA AGACGTGGTC AGGCGGGTGA TAGAGCAAAG ACGCAGGAAG AAGTTGAGGC GCAAGCAAAG CTCGCGCTAG AGCAAGCCGA ACGCAAGAGG TTGAAGCGCA TGCGCGACGT CGGCTCTGAC GATGAAAGTT CCGACGACGA CGGCCCTCAG GGAGGATACG CCGCTCGAAG GCGCAAGGTG AAAAAGGGTG ACATCGACGA AACAGACGAC GCATTACACG GCAATAATCG CAAGCAGCAT GAAGGAGGTG AAGATTTGGA CGAGAATTTT CAGCTCGAAA GTGATGACGA TGGAGAGAAC GAGGACGAGG GTTCCGAAGA GGATGTGTCG GACGACGAAT CTGAATCAGA AGATGACGAC GCGCTCGATG ACGCAGCGCG TCTGCGCAAG TCACTCAAGT CTGAGCTAAA AAATGTCGAT ACAGAATTAG ATCAAGGTAA GAATCGACTT CGAAAGTTGG GTATTCTTCA AGATGGCGTC GAGGCTGAGG ATTCGGAGGA CGATGAGGAC GACGACGATG AGGATGAAGA CGACGACGAT GAGGGCAGCG ACGATTCTCA GGAAGTAGAA GATGCACATG CAGATGTTTT GCGAGAGATT GAAGACGAGG AAGACGTTGA AAAAGAAGAT GCGTCGAACA CAGCGCGCAA ATCTTCCCAA AAGGAAGCGT CCACGGCGAA AGAGAAGAAG TTTTCTACGC CGACGAGAAC GGACATCCCC TTCACTTTCC CCATGCCAGA GAGCGCCGAA GATCTTAATT CTATATTAGG TGATCATAAC GCTGAAGATG CGTTTACGAT CATCACGCGA ATCCGGGCCT GCAACGCGCC CACGCTCGCG GCGGAGAATA GAAAGCGAAT CCAGACTTTG CTCGGTCTTC TGTTACAACG GTTTGAGATT TTAGCCGGTC AAGCGCCGTT GCCGGTAGAT CATTTGGACG TTCTATCGAA ACACATTGTA GACTTGAGTA CACAAGTTCC ATTTTTCGCT GCAACCGCAG CCAAGGCCCG CGTAGAAAAG ATGAGCACTC GGCTTCGTCA GGCGCTGCGT GCCGGGGAGA CGGGATGGCC GCCTTCTCGT ACGGTTCTGC TCCTTTCTTT ATTCGCCGCC ATCTTTCCTA CGACGGATAA GTCACACCCA GTTATGACTT CGGCGACACT TTATATTGGA AACTTACTCG CGCATTGCGC GATCAGATCA GTGAGAGATG CAGCTTTGGC GGTCATCCTT TCTACAATGG CAAGCGTTTA TTCGACTGGC GCCGAAAGGA TATTTCCTGA AGCCCTCACT CTCATGAATG CTTTGATTCA TTGCGCATCG CGATCAAAGA CCAATTGGGC GGCGGGTTTG TCAACACACC TAGTCGAACA AGTTGGCGGA CCCTGGTTAT CTTCAGCGCT TACGTCTGCG ATGGAACCCA TGACGTTGCC AGAAATGTTG GATGGCATTT ACGCCGAAAA ACTCGAAGAG AAGAAGTTAT CGGCCGCAAC ACTAAGGGCT GCTCTTTCTT GCTTAAGGCA ACTATCTAAA CCAGTGATAA AGACTGCGTC AGCGTCTGAG ATTCTCTCAC CTGTTCGTGA TTCCGTGAAG GCGCTTAGAA AATCGTTGAA AAAGTCGAAC GGTGGTTTAG CAGAGTTGTG CGACGAGCTC GTCAAGGAGT TAGACGACGC TCTCGTGGGC GCGGTCAAAA CGCCTTTGGC GTACCACACC AAGACGGCGG AGGCAATCAA AACGTTCAAT CCCATGTACG AGGAAGACGG CTATCAAAAA GGTCGGGACT ACGATCCGAA TCGCGAACGA GCCGAGGCAA GAAAGCTCAA GAAGCAAGTC AAACAAGAAA CTCGCGGCGC CATGCGCGAA TTGCGCAAGG ACAATCGATT TATGGCAGAT GCTCGTTCGA AGGAACAGTT CCAAGCCGCG GAAGAACGCG GGGCTCGCCA GAAAGACATT TTATCATTCT TGGAAAAGCA AGAAGCCGAT TTCAAATCTG GAGGACAAGG TGGCCAAATT GTCAAGAACA AACGCCGCGT GTCGAAAGGC TCGAGACGAG CCTTCTAG
|
Protein sequence | MAKKKRPSNP GGSGSKRKFF RADGDASGPS PSGVTVKSAF ETVSHRRKFD ILGRKVKGER GNVLQARTEA IEKRRRTLLV EHEANGKANA FLDRRFGEHD SGLSVEEKNI GRLAKARLRQ YKKSKAFALN EDDDDDGFKT LTHLGQPLGE RDLTAKPSHG DEEDDENLDD EITRNFHFGG GEFEPTLKRG DGEHDVDAPE RRKSKKDVME ELIAKSKFYK AEKQKQRDDD GDMLDKLDSD FKAISQGGLL SSALRKAVGH MKPTAAKAAQ TKAPEQKDEY DTWARTLAFE RRGQAGDRAK TQEEVEAQAK LALEQAERKR LKRMRDVGSD DESSDDDGPQ GGYAARRRKV KKGDIDETDD ALHGNNRKQH EGGEDLDENF QLESDDDGEN EDEGSEEDVS DDESESEDDD ALDDAARLRK SLKSELKNVD TELDQGKNRL RKLGILQDGV EAEDSEDDED DDDEDEDDDD EGSDDSQEVE DAHADVLREI EDEEDVEKED ASNTARKSSQ KEASTAKEKK FSTPTRTDIP FTFPMPESAE DLNSILGDHN AEDAFTIITR IRACNAPTLA AENRKRIQTL LGLLLQRFEI LAGQAPLPVD HLDVLSKHIV DLSTQVPFFA ATAAKARVEK MSTRLRQALR AGETGWPPSR TVLLLSLFAA IFPTTDKSHP VMTSATLYIG NLLAHCAIRS VRDAALAVIL STMASVYSTG AERIFPEALT LMNALIHCAS RSKTNWAAGL STHLVEQVGG PWLSSALTSA MEPMTLPEML DGIYAEKLEE KKLSAATLRA ALSCLRQLSK PVIKTASASE ILSPVRDSVK ALRKSLKKSN GGLAELCDEL VKELDDALVG AVKTPLAYHT KTAEAIKTFN PMYEEDGYQK GRDYDPNRER AEARKLKKQV KQETRGAMRE LRKDNRFMAD ARSKEQFQAA EERGARQKDI LSFLEKQEAD FKSGGQGGQI VKNKRRVSKG SRRAF
|
| |