Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_24370 |
Symbol | |
ID | 5001214 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009358 |
Strand | + |
Start bp | 219574 |
End bp | 221846 |
Gene Length | 2273 bp |
Protein Length | 732 aa |
Translation table | |
GC content | 59% |
IMG OID | 640416635 |
Product | predicted protein |
Protein accession | XP_001417182 |
Protein GI | 145345361 |
COG category | [R] General function prediction only |
COG ID | [COG5271] AAA ATPase containing von Willebrand factor type A (vWA) domain |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 26 |
Plasmid unclonability p-value | 0.29712 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CGCGCGCGGC GCGCGGGCGA GCGTTGACGA CGACGATCGC AACGACGACG ATGACGACCG TGGACGCATC GATGCCCGCC GCGATGCCGA GCGGCGCCGC GACGACCACG GAAACGATCG ACGTTCCGGA CGTTGGCGCG GTCGACGACG ACGCGCGACG ACGCGCGACG ACGCGCGACG AGAGCGCGAG CGAGGACGAA GATGAGGAAT ATCTCGGACG GATCCGTCGA CGCGCGCGCA CGCGCGAGGA TGAGAGCGAG AGCGAGGAGG ACACGCGAGA GGGCGACGGG TCAGAGGACG GCGAAGGCGA GGCGCCGCGA GGGAAAGGCG AAGGACGACG ACGACGATTG AAACGCGCGA ATGGTGAACG CGTCGAACGA GACGTTGATG GAAGTGACTC GAGCGATTCG AGCGAGAGCG AATTGGATGA GGAGATGATG ATGGAACGTA AGTACGCCGA GACGGGGCGG TTGACGGATG AAGACGTCGA GGACGAGGCA GAGGAGACCG TTCGTGGAGC GTATGACGAA GTGGGTGTGG GATATTCGTC CGATTCGTCG CTTGGATCAC CCGTAGCGGG CGATGATGTG CCGGCGAGCG GTAGTCGGCC GAGGAAAATG ACGAAGAAGC AGATGAAGAA GGAACGAGAC GCGTTGGCCA AGGAACAGGA ACGCATGATG AAACGAGCGC AAAGACGGGC GAAATTCCCG GGCTGGGACG CCGAAGTGGT GCGCGTGTCT TATTTACCGT TGATTGACAA GCTTCGCGCT GCGGTGGCGC ACATCAAGCA CGACGGACTA GTTATGGGAG AAGACTCGCC GGCAAAAGAG GCCGACAAAG CCACAAAGGC GGACACGCCC ACCGTCGCCC CGCTTGCAGA CTCCGAAGAT GACGAAGCGG ACGACGGTGA ACCTAAAGCA AAGACAGCCG AAGTCGTAGA AATCGATCTC GATGACGACG AAGAAGAAGA TGACGATGCA TTACTCAAGG AAATCTTGGC AAAGAAAGCA GTCGCGGTGG CGCAAAAGCC ATCCGAAACG GTGATGACTG AGGAACCAAC GCTTGTCGAA CAAGAGGACG GAGAGGACGC GGATGACGAG TCCGAAGAAG ACAGCGAAGA TGATTTATCC GAGGAGGAAG ATATGACGGA AGAAGAGCGT CGAATGCAGC GTAAGGCGGC GAAGAGATTC ATCAAGGCGG ATAGAAGATC GCACAGAGCC GCCGCCACCA CGGGCGACGT CTTTGAAGAT GAAGCGGAGA TGTCCGAAGA TGGTGGGCAC ACCGACGACG ATGATGATGA TGATATTCAG GATGACGTTG ACGATGTCGC CGACGCTATC GATTTCCGCG AAGAACAGCC CGAAGACGAG CGCCGTGCCG CGGCTCGTGC GCGCGCCTTC GCAAAGGAGC AGCAAGCCCA AGACGACGAC GAGCTCGAAA AGATGAAGCA GATGGTTGGT AACGGTTTCA AACGCAAAAA GAATGGTCTG TTCGACTCTG AAGACGCGTG GCAGCGTAAG AGACGCAATG CTAACGGCGA AGAGGAATCT GATTCGGATG ACGTCGACTA TGGTCCCGTC ATCGAGCGTC CCGAAGAGGC CGTCGAATTA TCGGACGACG ACGACGGTGA GTGGCGCGAA CAAGCCAAGC GTCGTCGTGC CCTGCACGAA TCCGGTACAC AAGAGTCTCT TGAGCTACCA AATGCGTTCG AAGGTAACGT CAGTCAAGAA GTCTACGCCG CCATCAAGGC TCCTCGCATG AATTCGTTCC ACAGCGAGTC GCAAGACACC ACGCAAGCGG AAGGATGGGA AGCCCCGCTC CCTCGAGCGC AATCTATGCC CGCGGCTTTG GTCCGTACGT CGAGCCATCT CGGCAGCGGA AGTTCTGCAA TGCTTGCCCG TCAATCATCG AAAACGTTCT TGGGTAAGAA ACGACAAGTC ACCAAGGCGA CTGGCGCGCT GCTCGGCAAC AGCCAGGCTT CACGCTCATA CGTCTTCGGC CGAACGGATA GCCAAAGTCA ATGGGGAGGC GACGATTCGG GTCCCGCGAC CACGTTCAAG GAAATCGGTC GCGATGAAGA TGCGCGCGCT TTCGGTTCGA CGAACATGGG ACCAACGAGA CCGAACGCGT CCGAACCGAA GAAGAAGCCA TCCTTATTCG CGATGGTGAG CCAGACCGCG GGCGAGAACA ACGCCCGCCC TCGAGCGGAG GACGTTCAAA AAGCGATGAA GGCAGCGTCG GGGAAATGAT TTATGTAACG AAGATAAGCG CAC
|
Protein sequence | MTTVDASMPA AMPSGAATTT ETIDVPDVGA VDDDARRRAT TRDESASEDE DEEYLGRIRR RARTREDESE SEEDTREGDG SEDGEGEAPR GKGEGRRRRL KRANGERVER DVDGSDSSDS SESELDEEMM MERKYAETGR LTDEDVEDEA EETVRGAYDE VGVGYSSDSS LGSPVAGDDV PASGSRPRKM TKKQMKKERD ALAKEQERMM KRAQRRAKFP GWDAEVVRVS YLPLIDKLRA AVAHIKHDGL VMGEDSPAKE ADKATKADTP TVAPLADSED DEADDGEPKA KTAEVVEIDL DDDEEEDDDA LLKEILAKKA VAVAQKPSET VMTEEPTLVE QEDGEDADDE SEEDSEDDLS EEEDMTEEER RMQRKAAKRF IKADRRSHRA AATTGDVFED EAEMSEDGGH TDDDDDDDIQ DDVDDVADAI DFREEQPEDE RRAAARARAF AKEQQAQDDD ELEKMKQMVG NGFKRKKNGL FDSEDAWQRK RRNANGEEES DSDDVDYGPV IERPEEAVEL SDDDDGEWRE QAKRRRALHE SGTQESLELP NAFEGNVSQE VYAAIKAPRM NSFHSESQDT TQAEGWEAPL PRAQSMPAAL VRTSSHLGSG SSAMLARQSS KTFLGKKRQV TKATGALLGN SQASRSYVFG RTDSQSQWGG DDSGPATTFK EIGRDEDARA FGSTNMGPTR PNASEPKKKP SLFAMVSQTA GENNARPRAE DVQKAMKAAS GK
|
| |