Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_34816 |
Symbol | |
ID | 5003528 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009363 |
Strand | - |
Start bp | 632429 |
End bp | 634366 |
Gene Length | 1938 bp |
Protein Length | 645 aa |
Translation table | |
GC content | 60% |
IMG OID | 640418949 |
Product | predicted protein |
Protein accession | XP_001419862 |
Protein GI | 145350966 |
COG category | [K] Transcription |
COG ID | [COG0557] Exoribonuclease R |
TIGRFAM ID | [TIGR00358] VacB and RNase II family 3'-5' exoribonucleases |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 0.0611822 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 20 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGAACGCT TCGCGCGAGA CGCGGCGGAG AGCGCGAAGG CGGATGCGTT ATTGCGAGGG GCGTGGGAAC TGATCGATGA GTTTGGGTTA GAGAGTGGGG AGGTGTGCGA TGCGACGTAC GTGAGCGAGT TGATGTTTGA AGACGACGCG AACGCGCCCG CGGGCGCGAG GGAATTCGCC GCGCATCGTC TGTTGAGCTC GCCGATGGGG GGTATTTATT TCAAACTGAA GGCGAAAGGG ACGTACGAAT CGCGCGCGGC GGATCAAATA GACGCGCTTA AGCGTAAAGC AGAAGCTGAA TCTCGCGCAG CAAAGGTGGA AGAAGCGTTT TTGGATGAGA TCAAAGCCGC CGCCGCGGCG CCCGCGGGAG TAAAGCCCGA TCGCGACGCG CTATGGAGAC CGAGCGACGA GGATAACGAC GCGCGCGTTC GTCGATTAGA GGCGCTGGAA GCGTACGCGC TCGGCGAGAA ATTCCATTCC GCCGGAGAGA AGGCGATGGC GGACGATTTA CTCGGCAAAC TAGGGTTCAC GCGCTCATCC GAGGGCGCGT TGAAGACGTT GATTTCAACC GGCACGTGGA GCCGACACGA AAATTTATCC GTGCGCAAGT ATGGGGTGCA GATTGATTTC CCCGAAGGTA CCGCAAAAGC GTGCGCGGAG GTGTTGTCTA ATGAACCAGT CGACGCCGAC GCCGCGTCGC GCGTGGATTT AACTTACCTA AGAGCGTACG CCATCGACGA CGCTGAAACG GTTGAGGTGG ACGACGCGGT GAGCGCCGAG GCGCTGGGCG ACGACGGACA GATTCGAGTG TGGGTGCACA TCGCCGATCC GACGCGATGG ATCCCTCTCG GGTCGCCGCT GGACGCCATC GCGCGTCAGC GCGCGACGAC GCTGTATTAC CCCACGGAGA TCGTCCCGAT GTTTCCTCTG GAAATCGCCG CGGGTCCGAT GTCTTTGGGC TCGAGGTCAG ACGTCGCGAG CGAGGCGATG ACGGTTCGCG CCGACATCGA TCGCGAGGGT AACATCATGG ATTTTGAAAT TATGCCGAGT TTCGTAAAGC TCGATCGACG TTGGACGTAC GATGAAGTCG ACGTCGAACT CGACAGCGCG ACGTGCGATG AAGGACTTCG ATTGTTGTAC AAGGTGGCGA GCGCGCGAGA CGAGCGTCGC GCCGAAGACG GTTCGGTGAC CATCATTCTT CCCGAGAATT CAGTCAACGT GCGAGGAGCC ACCGCGCGCG GAGGCGACGG CGACGTCGCT ATAACAATGT CAAAAATCAA TGGTCACACG CCGGCGCGCA TGCTCGTCTC CGAGCTCATG GTTCTCGTCG GCGACGTCGT CGCGCGCTTT GGCGTGAGAG AAAATATCCC GCTTCCTTTC CGAGGTCAGG GCGAGCCGCG CTTGATGTCC GACGACGAGT GGGATGGAAT CCCCGAAGGG ATATGTCAGG ATATGGCTAT GCGTTCGTGC ATGACGTCAT CGACGAGTGG CGCTACGCCC CGTCCGCACT CCGGTCTCGG CTTGAGCGCG TACGTGCAGT TTACGTCGCC GATTCGCAGG TACGCCGACG TCTTGGCGCA TCATCAAATC AAGGCGTACT TACGAGGCGA ACCGCTTCCA TTCGACGAAC AATCGATGGA AAACGTCATC GAAGACGTCG GTACCACCGT TGGTGGCGCG ATTCGCTCTC AACGCGAGAC GTCAAAGTAT TGGGCTTCGG CGTACTTTGA CGCCCAGCCC GCGGACGCGC GTTGGACGGC GACGGTGGTC AAGTTTATTC GCGGCGACGA TTTAGTTTTG GTCATCTTCG ACGACCTTGG CTACGAAACT GTGGTGAAGC TCGATCGCGG CGCGGTGTTG GGGGAGACGC TGACGCTAAA GTTTGTCGAC GCCGACCCGC ACGCCGGTTC GACAAATTTT GCTCGCGTCG AGGCGTAG
|
Protein sequence | MERFARDAAE SAKADALLRG AWELIDEFGL ESGEVCDATY VSELMFEDDA NAPAGAREFA AHRLLSSPMG GIYFKLKAKG TYESRAADQI DALKRKAEAE SRAAKVEEAF LDEIKAAAAA PAGVKPDRDA LWRPSDEDND ARVRRLEALE AYALGEKFHS AGEKAMADDL LGKLGFTRSS EGALKTLIST GTWSRHENLS VRKYGVQIDF PEGTAKACAE VLSNEPVDAD AASRVDLTYL RAYAIDDAET VEVDDAVSAE ALGDDGQIRV WVHIADPTRW IPLGSPLDAI ARQRATTLYY PTEIVPMFPL EIAAGPMSLG SRSDVASEAM TVRADIDREG NIMDFEIMPS FVKLDRRWTY DEVDVELDSA TCDEGLRLLY KVASARDERR AEDGSVTIIL PENSVNVRGA TARGGDGDVA ITMSKINGHT PARMLVSELM VLVGDVVARF GVRENIPLPF RGQGEPRLMS DDEWDGIPEG ICQDMAMRSC MTSSTSGATP RPHSGLGLSA YVQFTSPIRR YADVLAHHQI KAYLRGEPLP FDEQSMENVI EDVGTTVGGA IRSQRETSKY WASAYFDAQP ADARWTATVV KFIRGDDLVL VIFDDLGYET VVKLDRGAVL GETLTLKFVD ADPHAGSTNF ARVEA
|
| |