Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_42712 |
Symbol | |
ID | 5003212 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009362 |
Strand | - |
Start bp | 605067 |
End bp | 606689 |
Gene Length | 1623 bp |
Protein Length | 513 aa |
Translation table | |
GC content | 61% |
IMG OID | 640418633 |
Product | predicted protein |
Protein accession | XP_001419434 |
Protein GI | 145350046 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG3118] Thioredoxin domain-containing protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 26 |
Plasmid unclonability p-value | 0.732004 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.929724 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCAGCGCC CGACGGCGGT GCGCGGCGGT GGCGCCTTCG GCGCCCCGCA CGGGGCCCCG TTCGGCGGCG CGCACGCCCC CACGAGGCTC GGACGCGCGT TTCGAAGCGT AGACTTTTAT CGCAAACTCC CGCGCGACAT GACGGAGGGC ACGGTGAGCG GGAGCGTGAT ATCCATCTTC GCCGCGGTGT TGATGACGTT TCTGTTGCTC AGCGAACTGC GGAGTTACTC GTCGAGCTCG TTCGACACCA AGGTGGTGGT GGATCGGAGC GTGGATGGGG AATTGCTGCG AATCAACTTT AATCTTTCGT TTCCCGCGCT GTCGTGCGAG TTCGCGAGCG TGGACGTCGG CGACGCCCTG GGATTGAATC GGTTCAATCT CACGAAGACG GTGTTTAAGC GGGCGATCGA CGCGGACATG CGAGCGATCG GGCCGCTGCA GTGGGACCGA GCGGTGGACG AGGTGCTCAA GGCGAGCGAC GAGGAGACGA CGCGCGCCGA GGAGAGGGTG GCGCGGCACA AGGAGGCGCT CAAGGTGCTG CAGGAGTCGA ATAGGCCCGC GGATGGGCAC GCGCACGTGG TGTACGAGAT CGCGGATCTG GACGAACTGC AAGCGATGGT GAAGGATCCG ACGCACGCGG TGGTGTTGGT GAATTTCTAC GCGCCGTGGT GTCCGTGGTG TCAGAGGCTC GAGCCCGTGT ACGAAGCCGC GGGGCTCGCG GTGCACGAAA AGTACCCGCC CGGGACGAAG TCGCGCGTGC TGTTCACGAA GATTGATTGC GTGGTGCACG AAAAGTTTTG CATGGCGCAA GTCGTGACGG GATACCCCAC GATTCGCATT TTCACTCACG GCACCGACAT TTTGATGCAC GAGGGCAAGC GCGAGCACGC GTTTTACAAG GGTCCGCGCA CGGTGGATGG GCTGACGCAG TTTGTGGACA CGCTCGTGCC ACCGCCGGAG CCGGTGGGTG AGTCGAGCAT AGAGGCGGCG CAGGAGGAAA ACATGAAGCT TCGGCTTCCG GCGAGCGTCG ATATGCAAAA GCGCATCATC GGCCCGGGGT GCGCCATCAC CGGTTTCGTG CTCGTGAAGA AAGTTCCCGG GCACTTGTGG ATCAGCGCGT CCTCTCCGGA TCACTCGTTC CACGGTGAAA CGATGAACAT GACGCACGTC GTCAACCACT TTTACTTTGG ACATCAACTC AGCGACGAAC GTAGACGTTA CCTGGAAAAG TTTCACGCCG GAGAAAAAGC GGGCGACTGG CACGACAGAC TCGCGAGCGA GCGCTTCGTC TCCAACGCCG CGCACGTCTC TCACGAGCAC TATTTACAAA CCGTCCTCAC GACCATCACT CCGCGCGGGC GATACACCCT TCCGTTCAGC GTGTACGAGT ACACCCAGCA CTCTCACGCC GTGCACGAAC CGCTTCCAAA GGCAAAGTTT CATTACCAAC CGAGCCCGAT GCAAATCGTC GTCTCCGAGG AAAAGATGGC GTTTTACTCA TTCATCACCA GTCTCATGGC CATCATCGGC GGCGTGTACT CCGTCATGGG CATCGCCGAC GGCGTTTTGT TCAACTCACT CGCCCTCGTG CGCCGCAAGC TCGAGCTCGG CAAGCAAGGT TAA
|
Protein sequence | MQRPTAVRGG GAFGAPHGAP FGGAHAPTRL GRAFRSVDFY RKLPRDMTEG TVSGSVISIF AAVLMTFLLL SELRSYSSSS FDTKVVVDRS VDGELLRINF NLSFPALSCE FASVDVGDAL GLNRFNLTKT VFKRAIDADM RAIGPLQWDR AVDEVLKASD EETTRAEERI ADLDELQAMV KDPTHAVVLV NFYAPWCPWC QRLEPVYEAA GLAVHEKYPP GTKSRVLFTK IDCVVHEKFC MAQVVTGYPT IRIFTHGTDI LMHEGKREHA FYKGPRTVDG LTQFVDTLVP PPEPVGESSI EAAQEENMKL RLPASVDMQK RIIGPGCAIT GFVLVKKVPG HLWISASSPD HSFHGETMNM THVVNHFYFG HQLSDERRRY LEKFHAGEKA GDWHDRLASE RFVSNAAHVS HEHYLQTVLT TITPRGRYTL PFSVYEYTQH SHAVHEPLPK AKFHYQPSPM QIVVSEEKMA FYSFITSLMA IIGGVYSVMG IADGVLFNSL ALVRRKLELG KQG
|
| |