Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_5620 |
Symbol | |
ID | 5005690 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009369 |
Strand | - |
Start bp | 238577 |
End bp | 239665 |
Gene Length | 1089 bp |
Protein Length | 333 aa |
Translation table | |
GC content | 61% |
IMG OID | 640421111 |
Product | predicted protein |
Protein accession | XP_001421753 |
Protein GI | 145354983 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0258] 5'-3' exonuclease (including N-terminal domain of PolI) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 30 |
Plasmid unclonability p-value | 0.469951 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 7 |
Fosmid unclonability p-value | 0.0101798 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGGCATCA AGGGCCTGCC CGCCGTCCTC GAGCCGTACT GCGAGCGCGT GCACGTCGGT GAATACGCGC CCGGCACGCG ATGCGCGGTG GACGCCTACA GCTGGCTGCA CAAGGGCGCG TTCGGGTGCG TCGACGCGCT CGCGCCCGGA GGCGATCGCG CGTGGGAGCG ACGACCGGGC GCGACGGCGC CGTACGTGAA ATACGCCGTG CACCGAGCGA ACATGCTGAG GCATCACGGG ATCGAACCGG TGATCGTGTT CGACGGCGAC AGAGCGCCGG CGAAGCGAGG CGAGGAACGC GCGAGACGCG AGCGACGGGC GGCGCTGTTG GAGCGAGGAG AGCGGGCGCG CGCGGCGGGC GATAAGGAGG GAGCGTTTCG GGCGTTTTCG GGGGCGATCG ATGTGACGCC GGAGATGGCG AGGGAGCTGA TCGTGGCGCT GAAGAGGGAG AAATTCGAGT TCGTCGTCGC GCCGTACGAG GCTGACGCGA CGATTGCGTC GCTCGCTCTC ACGGCGAAGG AACGGGGGGG GGTAGATTTA GTGTTCACGG AAGATTCCGA TCTCGTGGCG TACGGGTGTC CGCGCGTAGT GTTTAAGTTG GAAAAATCCG GCGATGCGAA GGAGCTGAGG TTGGCGAGTT TGTTTGAAGG CGCCGCGCGC GCGACGACGA CGACGACTAC GGAAACGCCG AGCGATGAAA ACGTCGACGA CAACGCGATC GGGCGAGCGA ATAAACCGAA AAGCAAAGGT CCGCCGCCGC TGGATTTCAC TGGGTGGGAC TACGAATTAT TTCTAAGCTT GTGCGTGTTG TCGGGGTGCG ATTTCTTGGA CAACATTCGC GGCTTGGGTA TCAAAAAAAT GTACAATATT TTGAACAAAC ATCGATGTGT CGACGCGGTG TTCGCCGAAT TGAGGGCGAA TGAAAAAATT AAGGATTTGA TCGCGGAAGG GTACGAAGTG GAGTGGAGAA AGGCACGAAT GATTTTCAAG CACGCGTTGG TGTGGGACCC CCACGCCGGC GCGCTTCGAC ACCTCACGCC GGTTCCCGAG CATTGCGAGT TCGCGAACGA TTTGAGTTTT CTCGGGCCG
|
Protein sequence | MGIKGLPAVL EPYCERVHVG EYAPGTRCAV DAYSWLHKGA FGCVDALAPG GDRAWERRPG ATAPYVKYAV HRANMLRHHG IEPVIVFDGD RAPAKRGEER ARRERRAALL ERGERARAAG DKEGAFRAFS GAIDVTPEMA RELIVALKRE KFEFVVAPYE ADATIASLAL TAKERGGVDL VFTEDSDLVA YGCPRVVFKL EKSGDAKELR LATNKPKSKG PPPLDFTGWD YELFLSLCVL SGCDFLDNIR GLGIKKMYNI LNKHRCVDAV FAELRANEKI KDLIAEGYEV EWRKARMIFK HALVWDPHAG ALRHLTPVPE HCEFANDLSF LGP
|
| |