Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_37691 |
Symbol | |
ID | 5006074 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009370 |
Strand | - |
Start bp | 310752 |
End bp | 312827 |
Gene Length | 2076 bp |
Protein Length | 691 aa |
Translation table | |
GC content | 61% |
IMG OID | 640421495 |
Product | predicted protein |
Protein accession | XP_001422034 |
Protein GI | 145355574 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0464] ATPases of the AAA+ class |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 27 |
Plasmid unclonability p-value | 0.556949 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 8 |
Fosmid unclonability p-value | 0.0104163 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAAATCG AACGCCTGGA GAAAAGCAAG GATAAGAAGC GAGGGAAAAA AGGTAAGGGG CATCGTCGCA AGCGGAAGGG TGACCGAGAC AGCGACAGTG AGAGTTTGGA TGAGGATGAG TTTGGACGAA CGACGAGCGC GAAGGCGTTT GCGCAAATGA CGTCGCCTCG CTCGGTGAAG TTGTCGGATT TAGGCGGGAT CGAGGACTCG CTGAAGGACA TCAAGGAGCT CATACTGTGC CCTTTGATGC ATCCCGAGTT GTACGCATGG CTCGGGGTCG ATCCGCCGCG CGGGGTGTTG CTTCACGGCC CGCCGGGTTG CGGCAAGACG ACGTTGGCGC ACGCCATCGC GCAAGAGGCG AAAGTGCCCT TCTTCTCCAT AGCGGCCACG GAGATTGTGA GTGGAATGAG TGGCGAGTCT GAAGCGAAGA TACGTGAGTT GTTCCAGTCC GCCGCCGCGC ACGCGCCGTC GCTGATTTTC ATCGACGAGA TCGACGCAAT CGTCCCGAAG CGCGAGAGCG CGCAGCGAGA GATGGAACGT CGAATTGTCG CCCAGCTGTT AGCGTCCATG GACGATCTTC AATCAACAAT CGATGGCACG GACGAGGTGG ATCGACTAGC GCGGTGTCGT CGCCACGTCA CCGTCATCGG CGCCACGAAT AGACCGGACG GTATGGACGC CGCGCTTCGT CGCGCCGGAC GCTTTGATCG CGAAATCATG CTCGGCATTC CAGACGAAGC CGCGCGAGAG CGCATTTTGC GAGTGCAGGC GACCAAGCTT CGCTTGAATG GAGATTTAGA CTTGCGCGAA ATCGCAAAGA AAACGCCCGG CTATGTCGGC GCGGATTTAT CGGCGTTGGC CAAGGAAGCC GCCGCGTCGG CGGTCACGCG CATCTTTAAA AAGCTCGAGG ACGAGGAAAG GGCGAGCGCG GATGTGACGA TGGACGAGGG TGTCGCGCCC GCACTGGGGG GGGACACTCG TCTCGCGACT GGTCGCTTAG CGGATCCGCG TCCGCTCACC GAGGACGAAC TCGAGGATCT AGCAATCACC ATGGAAGATT TCTCCCTCGC GCTCACGCGC GTGCAACCGT CGGCGCAACG CGAAGGTTTC ACCACGACGC CGAACGTGAC TTGGGACGAC GTTGGCTCGC TCACAGAAAT TCGCGAAGAG TTGAAGTTCT CCATTGCTGA GCCCATCGCT CATCCCGAGC GATTCCAAGC GATGGGTTTG AACATCTCTA CGGGCGTCTT GCTCTACGGC CCACCGGGGT GCGGCAAAAC GCTCGTCGCC AAGGCGACGG CGAACGAGGC GATGGCGAAT TTCATATCCA TCAAAGGTCC AGAGTTATTA AATAAGTACG TCGGTGAGAG CGAGCGCGCG GTGCGGACGC TGTTCCAGCG CGCACGAAGT GCGAGCCCGT GCGTGTTATT CTTTGACGAG ATGGATTCTC TGGCGCCGCG TCGCGGAAGC GGCGGCGACA ACACCTCAGC CGAGCGCGTC GTGAACCAAC TTCTCACCGA GATGGACGGT CTCGAAGCGC GAAACGCGAC GTTCTTGATC GCGGCGACGA ACCGACCCGA CATGATCGAT CCAGCGATGC TGCGTCCCGG GCGCTTGGAC AAGCTCTTGT ACGTTCCGTT GCCGCCGCCG GACGGCCGAG TCGCCATCTT GAAGACGCTC ACGCGCCGAA CGCCCATCGC ACCAGACGTA CGCGTGGATC AAATCGCGCT CGGTCGATCG TGCGAAGGCT TCAGCGGCGC CGACTTGGCG GCGCTCGTGC GCGAAGCGTG CGTGGCGGCG TTGAAATCGA TGACGCTCGA ATCGACGCCG ACGGTGACGA CGAAGCACTT CGAAGAGGCG TTCACGAAGG TGCAACCCTC GGTGAGCAAG TCGGATCACG CGCGTTACGA TGAATTGCGT CGAAAGCTCC GTCGCGAGCG CGGGACGATC AACAGCGCGC GCCGCTCTTC CTCCGCCGAA AATCTCGCCG TCGAGCCCGC GTCCAACAAG CGCGTTCGCC CCGGCGACGA CGACGACCGC GACGACCGCG ACGCGCCCGA ATTAGCCACC TCTTAG
|
Protein sequence | MKIERLEKSK DKKRGKKGKG HRRKRKGDRD SDSESLDEDE FGRTTSAKAF AQMTSPRSVK LSDLGGIEDS LKDIKELILC PLMHPELYAW LGVDPPRGVL LHGPPGCGKT TLAHAIAQEA KVPFFSIAAT EIVSGMSGES EAKIRELFQS AAAHAPSLIF IDEIDAIVPK RESAQREMER RIVAQLLASM DDLQSTIDGT DEVDRLARCR RHVTVIGATN RPDGMDAALR RAGRFDREIM LGIPDEAARE RILRVQATKL RLNGDLDLRE IAKKTPGYVG ADLSALAKEA AASAVTRIFK KLEDEERASA DVTMDEGVAP ALGGDTRLAT GRLADPRPLT EDELEDLAIT MEDFSLALTR VQPSAQREGF TTTPNVTWDD VGSLTEIREE LKFSIAEPIA HPERFQAMGL NISTGVLLYG PPGCGKTLVA KATANEAMAN FISIKGPELL NKYVGESERA VRTLFQRARS ASPCVLFFDE MDSLAPRRGS GGDNTSAERV VNQLLTEMDG LEARNATFLI AATNRPDMID PAMLRPGRLD KLLYVPLPPP DGRVAILKTL TRRTPIAPDV RVDQIALGRS CEGFSGADLA ALVREACVAA LKSMTLESTP TVTTKHFEEA FTKVQPSVSK SDHARYDELR RKLRRERGTI NSARRSSSAE NLAVEPASNK RVRPGDDDDR DDRDAPELAT S
|
| |