Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_41620 |
Symbol | |
ID | 5005054 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009367 |
Strand | - |
Start bp | 523573 |
End bp | 526383 |
Gene Length | 2811 bp |
Protein Length | 936 aa |
Translation table | |
GC content | 58% |
IMG OID | 640420475 |
Product | predicted protein |
Protein accession | XP_001421175 |
Protein GI | 145353765 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0466] ATP-dependent Lon protease, bacterial type |
TIGRFAM ID | [TIGR00763] ATP-dependent protease La |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 136 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 78 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTACGCGA CGCGAGCGAT CGCGCGGAGA CTCGAACGAC ACGCGGCGCG ATGTAAAGGC GCACACGTCG CGCGCGCGGT GCGAGGGGCG AGAGCGCGAA CGACGTCCGC GCCGAGGGCG CTGTTGGACG CGCTCGGGGC GGGGAGGGGC GACGCGGACG CGTTCGGGAC GAGGACGAGA CGGACGAGGA ATGCGTTCGT GTCGAGCGTC GACGGCGATG GGTCGACGGG ATCGACGGGA TCGTCGTCGT CGTCGTCGTC GAGCGCGTTC GGTGATTCGG CGTCGTCGGG GGGGATCATG GTGAGCGCGT CGCACCCGAG TTCGCACCCG CAAGTGCTCG CGGTGCCTTT GCCGAGGCGG CCGCTCATGC CGGGGATCAT CATGCCGGTC AAAGTGACGG ACGAAAAGCT CATAGCCGAG CTTGAGGACA TGCGAAATCG TGGTCAAGCG TACGTGGGGG CGTTCTTGCA GCGAACGGAC GCCGCGTCGT CGGCGTCCAA GGGGGAGGGC GAAGACGTCT TCGACGCGCT CTCGGCGATG AAGCGCACGA CGACGTCGGT GGGTTTAGAC GGCGAGGAAA TGGTAGACGA AGACGAGGTT GATCCCGCGG ATCACATGCA CGACATAGGT ACGTTTGCGC AGGTGCATAA CATCGTGCGC CTGCCGACGG ATTCGACCAC GGGTGAAGAA TCGGCGACGC TGTTGCTCCT CGGCCACCGG CGTTTGCGAA AGCTCGGGAC GATGAAGCGG GATCCGATGG TGGTCAAGGT GGAACACCTC AAGGATGAGA AATTCGACGC CAACGATGAC ATCATTAAAG CCACGACGAA TGAGGTGGTG GCGACGATCA AAGATTTGCT CAAGACGAAT CCTTTGCACA AGGAGACCCT GCAGTATTTC GCTCAAAATT TCAACGACTT TCAAGATCCG CCAAAGCTCG CGGATTTGGG GGCGTCGATG TGCAGTGCCG ACGACGCGCA GTTGCAACAC GTGTTGGAGC TATTATCGGT GAAAGAACGC CTCGACGCGA CGCTCGAGTT GTTGAAGAAG GAGGTGGAAA TCGGCAAGCT CCAAGCCGAC ATCGGGAAAA AAGTTGAGGA GAAAATTTCA GGCGACCAGA GGCGTTACTT TTTGATGGAG CAGTTGAAAT CGATCAAGAA AGAGCTCGGT ATGGAGCGTG ACGACAAAAC CGCGCTCATC GAAAAGTTTA CGAAACGTTT CGAGCCCAAG CGCGCGAGCG TGCCGGAAGA CACCGCCAAG GTTATCGATG AAGAGCTTCA AAAGCTCGGC GGCCTCGAAC CGTCGTCGAG CGAATTCAAC GTCACTCGCA ACTATCTCGA GTGGCTCACG TCACTGCCGT GGGGCGTGTG CGGCGACGAA AAATTGGACA TATCTCACGC ACAAGAAGTG TTAGATAGCG ATCATTACGG CCTGGAGGAC GTCAAAGATC GCATCTTGGA ATTCATCGCC GTCGGGCAAC TTTTGGGGAC GACGCAAGGA AAAATCATCA CCATGGTCGG TCCGCCTGGG GTGGGGAAGA CATCCATCGG GCAATCGATC GCCAAGGCGC TCGGGCGTAA ATTCTATCGC TTTTCCGTCG GCGGTATGAG CGACGTGGCG GAGATCAAGG GCCATCGACG GACGTACGTT GGCGCGATGC CGGGCAAGCT GATTCAGTGC TTGAAATCCA CGGGTGTGTG CAATCCAGTG GTTTTGATTG ACGAAATCGA CAAGCTCGGA CGCGGTTATC AGGGCGATCC CGCGAGCGCG CTGCTCGAAC TACTCGATCC CGAGCAAAAC GGCACGTTTC TTGATCACTA CCTCGACGTC CCCGTCGACT TGAGCAAGGT TTTATTCGTG TGCACCGCCA ACGTGCTCGA CACGATTCCC GGGCCTTTGC TCGATCGCAT GGAAGTCGTG CGGTTGTCTG GATACATCAC CGACGAAAAA GTGCAAATCG CTCGAACGTA TTTGGAGAAA GCGGCGCGAG AAAAGAGTGG GCTGTCCGAC GTCGACGCGA GCATCACCGA CGCGGCGATG GGGAAACTCA TCGGCGACTA CTGCCGCGAA GCCGGCGTGC GGAACTTGCA AAAGCATCTC GAAAAGGTCT ATCGCAAGAT TGCCCTCAAG GTGGCTCGGG CGAAGAGTGC GGACGAAAAG CTCGACTCTA TCGTCGTCGA TGTCGATGAC TTGGTCGATT ACGTCGGTCA ACCACCGTTC GCGACCGACC GAATCTACGA CGTCACCCCG CCCGGAGTCG TCACCGGCTT GGCTTGGACG GCGATGGGCG GATCCACGCT TTACATCGAG TGCACGGCTA TCGATTCCGG CGACGGCAAG GGCGCGTTAA AGACGACCGG TCAACTCGGC GACGTCATGA AGGAATCGAG CACGATTGCG CACACGTTCA CGCGAGGGTT TTTGGAATTG AAGGATCCCG GCAACAAGTA TCTCGCCGAC ACGTCGCTTC ACGTTCACGT CCCCGCCGGG GCGACGCCGA AAGATGGACC GTCGGCGGGA ATCACGATCA CGACGAGCCT GTTATCGCTC GCCATGAACA AACCGGTAAA GCCTAATTTA GCCATGACGG GCGAGCTCAC GCTCACCGGT AGGGTGTTAC CGATCGGCGG CGTCAAGGAG AAGACGATCG CCGCGCGTCG AAGCGGGGTG AAAACCATCA TTTTCCCCGA AGGAAACAAG AAGGATTACG ACGAGCTTTC CGAAGACATT CGTGAAGGTT TGGACGCACA CTTTGTCTCG ACGTACGACG AAGTCTATCG CCAAGCGCTC GATTGGGAAG CGTCTTCGTG A
|
Protein sequence | MYATRAIARR LERHAARCKG AHVARAVRGA RARTTSAPRA LLDALGAGRG DADAFGTRTR RTRNAFVSSV DGDGSTGSTG SSSSSSSSAF GDSASSGGIM VSASHPSSHP QVLAVPLPRR PLMPGIIMPV KVTDEKLIAE LEDMRNRGQA YVGAFLQRTD AASSASKGEG EDVFDALSAM KRTTTSVGLD GEEMVDEDEV DPADHMHDIG TFAQVHNIVR LPTDSTTGEE SATLLLLGHR RLRKLGTMKR DPMVVKVEHL KDEKFDANDD IIKATTNEVV ATIKDLLKTN PLHKETLQYF AQNFNDFQDP PKLADLGASM CSADDAQLQH VLELLSVKER LDATLELLKK EVEIGKLQAD IGKKVEEKIS GDQRRYFLME QLKSIKKELG MERDDKTALI EKFTKRFEPK RASVPEDTAK VIDEELQKLG GLEPSSSEFN VTRNYLEWLT SLPWGVCGDE KLDISHAQEV LDSDHYGLED VKDRILEFIA VGQLLGTTQG KIITMVGPPG VGKTSIGQSI AKALGRKFYR FSVGGMSDVA EIKGHRRTYV GAMPGKLIQC LKSTGVCNPV VLIDEIDKLG RGYQGDPASA LLELLDPEQN GTFLDHYLDV PVDLSKVLFV CTANVLDTIP GPLLDRMEVV RLSGYITDEK VQIARTYLEK AAREKSGLSD VDASITDAAM GKLIGDYCRE AGVRNLQKHL EKVYRKIALK VARAKSADEK LDSIVVDVDD LVDYVGQPPF ATDRIYDVTP PGVVTGLAWT AMGGSTLYIE CTAIDSGDGK GALKTTGQLG DVMKESSTIA HTFTRGFLEL KDPGNKYLAD TSLHVHVPAG ATPKDGPSAG ITITTSLLSL AMNKPVKPNL AMTGELTLTG RVLPIGGVKE KTIAARRSGV KTIIFPEGNK KDYDELSEDI REGLDAHFVS TYDEVYRQAL DWEASS
|
| |