Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_41082 |
Symbol | |
ID | 5002488 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009360 |
Strand | + |
Start bp | 594749 |
End bp | 596473 |
Gene Length | 1725 bp |
Protein Length | 509 aa |
Translation table | |
GC content | 55% |
IMG OID | 640417909 |
Product | predicted protein |
Protein accession | XP_001418279 |
Protein GI | 145347657 |
COG category | [A] RNA processing and modification [D] Cell cycle control, cell division, chromosome partitioning [L] Replication, recombination and repair |
COG ID | [COG5049] 5'-3' exonuclease |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 0.523535 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 5 |
Fosmid unclonability p-value | 0.0326326 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGGCGTCC CGAAGTTTTT CCGATGGCTC GCCGAGCGCT ACCCGCTGCT CCAGCAAGAG ATCGCGGGAA ATCAGATCCC GGGGATCGAT AACTTGTACC TGGACATGAA TGGTGTCATT CACAACTGCT CGCACGGCGC AGGGACGGAT GTGAATACGC GAATGACTGA AGACGAGATG ATGTCCAAGG TTTTCGCGTA TTTGGATCAT CTGTTTCGAA TGACGCGACC GAATAAGATG CTGTACATGG CGATCGATGG CGTGGCACCG CGAGCGAAGA TGAACCAACA GCGGAGTCGA CGATTTCGAA GCGCGGCGGA GGCGGCGAAG GACCGCGAGG AAGCGCGTGC GAGAGGCGAA CCGGAGCCGG AGGGCGAACC GTTTGATTCC AACTGTATCA CGCCGGGGAC GGAGTTCATG GCGCGGTTGA CGGAACATTT GAAATTCTAC GTGCGTAAGA AGCAAACGGA AGATCCACTT TGGGCAAAGG TGACGGTGAT ATTGTCTGGG CACGAAGTCA GAGGTGAGGG CGAGCATAAA ATCATGGAGC ACATTCGATG GGCGCGAACG CAGCCGGACT GGGAGCCGAA TCAAACGCAC TGTTTGTACG GCCTCGACGC CGATCTTATC ATGCTTGCTC TCGTCACGCA CGAACCTCAC TTTTGTTTGC TTCGTGAGGT CGTGAAGTTT GGCGGCGGCG AGAAAGGGCA GCCGAGCCGG GAGATTTTGT CCAATCCCAC CGACGACGGC TTCATCTTGT TGCACATCGG CTTGTTGCGC GAGTACTTGG ATTTAGAATT CCGCGAAAAG AATCTTCCGT TCGGATACGA GCTCGAGCGC GTCATCGATG ACTTCATCTT ACTGTGCATG CTCGTCGGGA ACGATTTCTT GCCCGCTCTG CCGACGCTGA ACATCGCCGA GGGCGCGCTG AACACGCTCT TCAAAGTGTA CCACGACACG TTGCCGATGC TTGGAGGTTA TATCACTGGC GATGAAGGGG GGGGTACTTT TAACCCTGAA CGTTTGGAGA AGATCATGAG CATCATGGCG ACGTTCGAGC GACAGGTGTT GGAAGAGCGC GCGATGGATG TCGAGAAGGA AGAGGAGAAG AAGTCTCGAC GAAAAGGTCG CAACGGTGGT TCCGCGTCCG ATCTCACTCC CGAGGAGAAG TTCGACAAGG ATTTGAGTGA GATGAGCGAC ACCGAAGGCG TGCCACAAGT TTCTGCCGAC CCGACAATGA TGAACGCTGC GAAGCGGGCG CTGATTCTCG AAGGTGGCGA GGAGGGCTTG CAAGCGTGGA AGGATACGTA CTATCGCGAA AAGCTCGGTT TGAAGATTGG CGAAGCTGCA CCGCTGGGTG AAATTAGACA AGCTTATTTC GATGGTTTGA ACTGGGTCTT GCGTTACTAC TATCGTGGTG TTGCGTCCTG GACTTGGTAC TATCCCTACC ATTACGCGCC GATGGCGAGC GACTTGTGCG CCGGCATGGG CGGTCTCACG TCTGAGTTCG ATTACGGCGA ACCGTTCAAA CCTTTCGAGC AGCTCATGGC TGTACAACCA CCATCGAGCT CCAAGTTACT CCCCGAGCCA TTCCGCCACT TCATGGAAGA TCCGCAGTCG CCCTTGGCTG AGTTCTTCCC GGAAGACATC AAAGTTGACT TTGAAGGCAA GCGCAACGAC TGGGAAGGCG TCGTGCTGTT GCCCTTTTTG GACGCCGATC GCTTG
|
Protein sequence | MGVPKFFRWL AERYPLLQQE IAGNQIPGID NLYLDMNGVI HNCSHGAGTD VNTRMTEDEM MSKVFAYLDH LFRMTRPNKM LYMAIDGVAP RAKMNQQRSR RFRSAAEAAK DREEARARGE PEPEGEPFDS NCITPGTEFM ARLTEHLKFY VRKKQTEDPL WAKVTVILSG HEVRGEGEHK IMEHIRWART QPDWEPNQTH CLYGLDADLI MLALVTHEPH FCLLREVVKF GGGEKGQPSR EILSNPTDDG FILLHIGLLR EYLDLEFREK NLPFGYELER VIDDFILLCM LVGNDFLPAL PTLNIAEGAL NTLFKVYHDT LPMLGGYITG DEGGGTFNPE LSADPTMMNA AKRALILEGG EEGLQAWKDT YYREKLGLKI GEAAPLGEIR QAYFDGLNWV LRYYYRGVAS WTWYYPYHYA PMASDLCAGM GGLTSEFDYG EPFKPFEQLM AVQPPSSSKL LPEPFRHFME DPQSPLAEFF PEDIKVDFEG KRNDWEGVVL LPFLDADRL
|
| |