Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_17449 |
Symbol | |
ID | 5004441 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009365 |
Strand | - |
Start bp | 549750 |
End bp | 552104 |
Gene Length | 2355 bp |
Protein Length | 717 aa |
Translation table | |
GC content | 64% |
IMG OID | 640419862 |
Product | predicted protein |
Protein accession | XP_001420545 |
Protein GI | 145352417 |
COG category | [R] General function prediction only |
COG ID | [COG5271] AAA ATPase containing von Willebrand factor type A (vWA) domain |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 32 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCGGCCA AGGCCCAGGG CGCGCGCGAC GCCGCGCCCG TGCCGAGCGG CAAGGCGCGC GGCGCGAGCG CGAGCGCGAA GCGCGCGAAG GAGGACGCGA GGAAGCGCGC GGGGAAGCGA GGGGCGGCGG CGACGGCGCG CGACGACGGC GCGACGACGG CGCGCGAGGC GAGCGGCGTG CGAGGGTACT GGCAGGCACC GGAGGAACAG GCGCTGAAGC GCGCGGTGCG GAAACACGGG ATCGGGGCGT GGGAGAAGAT GCGAAATGAT CCGGAATTCG CGGCGTTGCG GTGCGCGCGC GACGCGAGCG AGCGAACGAA CGAACGAACG ACGCGCGCGA ACGAACGCGC GAACGAACGG CGCGGGACGA GCGATGGGAT GGGACGCGAG CGCGCGCGAG GCGCGTCGAG CGGGCGAGGG AACGACGCGG ACGAGGCGAG AGGGAGACGC GGGGACTGAC GAGGGGCGAA CGACGAACGG CGCGAACGCA GGTCGCGCAC GGGGGTGCAA TTGAAGGATA AGTGGCGAAA TTTGATCAAG TTTCAACACT TGCGCGTGGA TGAGGCGGCG AAGGTGCCTT ATAAAGCCGC GGTGAAGGCG GCGAACGCGG CGAACGCGAC GGGAGGGAAA GCGGGACGCG GCGCGGACAA GGGCGGCGCG AGGGGGGCGA GCGCGGCGAA CGCGAAGGGA GGAAAAGCTG CGAGCGCGGC GAGCGCGAGG AAGACGAATG CGGCGGTGAA TGCGAAGAAG GGTAAAGGCG CGAGTGCGTC GAGCGCGAAG AAGGGCGCGG GGGGGAAGAA AACGATGGCG GTGAGTCGAG ACGCGTTGCC GAAGAAGGGC GGCGGCGGAA CGACAAACGT CGCGGGCGCG CCGACGATGA GCGCCAAGGA GAAGCTCAAA GCCAGGTTCG GGCACAGGCA CGACGATGAT GATCAAGACG AGGACGATGA GTACGAAGAA ATCGCAGGCG TCGGTCGACC ATCGACAGGG GCCAAGGCAA AATTCGACGC CGGCGTGGAC GCCGTGGTGA CCCAGCTCAA GGCGAAACGC GCGGGCATGG TGAACGAAGT CGAACACGCC GAGAGCGAAC TCGCGAGGAT GAAGACGGTC GTAGAAGAAG CGGAAGCGGT GTACTCGTCC GCGCGCGCGC GCGTGCACGA GGCGCTCGTG GGCGCGCACG GGGAGGATGA GTACTACGAG GACGGCGAGG GCGAAGACAC CGCCGACTGG GACAAGTACT TGCACTTTGG CGAGCACGAG GACGAAGACG ACGACGAAGA GGTGGCTCGT GCCGCTGAAG CCGCGGCAGA AGCGCACGAA GCGGTGAAAC GGGGCATTAA GAAGCCGAGC GAAAAGAAGC CGAGCGAAAA GAGGCCAAAG GCGGCGGTGG ACGAAGACGC CACGGATCAC GACGACGACA GCGATGAGGA AATCGACGAA GATCTCGTAG ATAGCGACGA GGACGACGAC GATTTAGATG AGGAGGAACG CGCGGTGAGG ATTGCAAACG AGGTGCTCGC CGAGGCTGGG TTACCTCCCA TGCACGAAAT TGAACACGTG TTGCTCACCG AGCTCAAAAA GCTTGAAGCC GCGAACGCCG CAGTCGTGCA CGCGCGCCTG GCGTTGGAGG CCGTAGACGC AGAGTTTGCC GAAGCACTCG TCAGCGCACA CAACAATGCG GCGGCAAACG CGAAAGCGAC GACGTCGGCA GCTTTTGACC ACGACGACGA TTTAGAGAAG AGTTTCACCA TGGATGGCGA CGAAGACGAA GACGACGAAG ACGAAGACGA CGAAGTCGAT GACGAAGACG ACGAAGAAAT CGACGAAGAT TTGGTCGACG ACGAGGAGAT CGAAGAGGAG ATCAGGCCGC AACCCACTTC AAAGTCGAAG GCAAAACCAA AGGCAAAGTC GGCGCCCTCT CAAAACGACG GCGCAGACGC GGCGGGTGGG CGAGTCGTTG AAGACCGGAG ATACGGTATA TACGACGCTC GGCGCTATCG TTCGGAGGCT GAAAACGACG CCGCCGAGGC TGCGGCGTCC AAGGGTGGTA AAGTGTCCGC AGCCAGTAAA CGTAAAGCGC CGACGAGCTC GAGCGCGGCG GCACCCAAAA AGAGCGCCAA GCAGACTGTC AAGGTTGAAG ACACTGAAAT ATGGTACGAA CAGCCAAATG TCGCCAAAGC TGGTGCGGCG GTGATGCCAA AGAAATCGCA ACAGTTTCGA GCGGCACCGC AGTCGGCTCC TGGATCCATT CCTTCGTGGC GACGCAAGCG AGCAGCGACT CGCGTCACCA TCGGCCGTCC CGTCGCGGGC CCCACGTCTT GGGCCGCGAT CGGGCAACAC ATTATAAAAG AGTAA
|
Protein sequence | MAAKAQGARD AAPVPSGKAR GASASAKRAK EDARKRAGKR GAAATARDDG ATTAREASGV RGYWQAPEEQ ALKRAVRKHG IGAWEKMRND PEFAALRSRT GVQLKDKWRN LIKFQHLRVD EAAKVPYKAA VKAANAANAT GGKAGRGADK GGARGASAAN AKGGKAASAA SARKTNAAVN AKKGKGASAS SAKKGAGGKK TMAVSRDALP KKGGGGTTNV AGAPTMSAKE KLKARFGHRH DDDDQDEDDE YEEIAGVGRP STGAKAKFDA GVDAVVTQLK AKRAGMVNEV EHAESELARM KTVVEEAEAV YSSARARVHE ALVGAHGEDE YYEDGEGEDT ADWDKYLHFG EHEDEDDDEE VARAAEAAAE AHEAVKRGIK KPSEKKPSEK RPKAAVDEDA TDHDDDSDEE IDEDLVDSDE DDDDLDEEER AVRIANEVLA EAGLPPMHEI EHVLLTELKK LEAANAAVVH ARLALEAVDA EFAEALVSAH NNAAANAKAT TSAAFDHDDD LEKSFTMDGD EDEDDEDEDD EVDDEDDEEI DEDLVDDEEI EEEIRPQPTS KSKAKPKAKS APSQNDGADA AGGRVVEDRR YGIYDARRYR SEAENDAAEA AASKGGKVSA ASKRKAPTSS SAAAPKKSAK QTVKVEDTEI WYEQPNVAKA GAAVMPKKSQ QFRAAPQSAP GSIPSWRRKR AATRVTIGRP VAGPTSWAAI GQHIIKE
|
| |