Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_25132 |
Symbol | |
ID | 5004274 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009364 |
Strand | - |
Start bp | 220481 |
End bp | 221836 |
Gene Length | 1356 bp |
Protein Length | 441 aa |
Translation table | |
GC content | 65% |
IMG OID | 640419695 |
Product | predicted protein |
Protein accession | XP_001420115 |
Protein GI | 145351503 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 29 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 4 |
Fosmid unclonability p-value | 0.0100865 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | CGCGCGACGG CGATGGCGAC GGCGACGATC GGTCGAGCGA CGCTCGAGGC GACGCGCGCG CGCGCGCGCG CGACGGCGAC GCGCGCGCGC GGACGATGGG ACGCGGGACG CGCGGCGAGG GCGAGGGCGA CGGGACGCGC GCGCGCGACG GGACGCGGCG ACGGCTTCGG GGACGCGGAG ACGAATCCAG CGGCGGGACT GAAAGCGCCG AGCACGCCGG AGAGCCCGCG AGGGCAGCAG CTGGCGTACA TACTGCGAAC GGCGCCGGAG ATGTTCGACG CGGCGGTGGA TTCGCAGTTG GACGGGCTGG GCGAGGAAGT CGAACGCGAG GCGAAGAGCG CGAGCGAGGA GGCGAAGACG GAGCAGTTGG TGCTGTTTAA ACGAATCGCG GACGTGCGGG CGCTGGAGCG GAGGAACGGG CTGGAGGATA TCATGTACAC GACGATCATT CAAAAGTTTT TGAGCGTCGG CGTGGATATG TTGCCGCCGT TGGACGAGAC GACGATGCTG AAGGGGATCG ATTTGAATCG GTTGACGGAC GGCGTGCACA GCAAGGAGGC GCTGGACATG GTGCGAGAAC ACCTGATGGC CGTGTTGGGA GGGGCGGGGG AGAACGCGTA CTCGTCGCAG CTCGTGCGGA TGTCCAAGCT TCAGGCGGCG CAGGTGTACG CGGCGTCGAT CATGTTTGGA TACTTTGTCA CGCGCGCGGA TAAGCGGTTC CAGCTCGATC GCATGGTGGG CACTTTGCCC ATGGACCCGA TGGAGAGCGC CATGGCGCTC GAACGCTTGT TCAACAGCGC CAGCGCGATG GATTCCATCG ACGAGGCTGA CGCGGCGCCG CAAAACTTTG GCGGCGAGGA TTTCGACTTG TTCAGCGACA GCGCCCCCTC GAGCGGGACG GGGAGTCAAC TCACTCTGAA GCAATACATT CAGAACTTTG ATCAGTCCAC GTTGGCGCAA ACGGCGCGCA TCGTTTCCAT GGAGGGCGTA CAAGTGGCCG AGCGCCAAAC CGGCGCGTTG TTCGGTTCGA TCGAAGATTT GCAACGCGAG ATGCAAGACG CGGTGGGCAT GAACGCGGTG ACCCCGGAAG AGCTCATGGA TGCGGTGAAC GACGCGGTCG CGGAGAAGAA AGTACAGACG CTCACGCTCG CATATGCGTC GCAGCGTCGC CTTGTGCTCG AAGCCGTGGC GTTCGGCGCG TTTTTGCGCC AATCCGAAAC TTACATCGAC GGTTACAACC CCAAACTGTT GACCCCCACG CCGCGCGGTC CGACGGGCCC GCCCGGGGCC TCGCTCCCGT CCGGAGACGA CGACGACGGC GGGAGTCCCG TGGCCTAGGT TAATGCTTGA TTCATT
|
Protein sequence | MATATIGRAT LEATRARARA TATRARGRWD AGRAARARAT GRARATGRGD GFGDAETNPA AGLKAPSTPE SPRGQQLAYI LRTAPEMFDA AVDSQLDGLG EEVEREAKSA SEEAKTEQLV LFKRIADVRA LERRNGLEDI MYTTIIQKFL SVGVDMLPPL DETTMLKGID LNRLTDGVHS KEALDMVREH LMAVLGGAGE NAYSSQLVRM SKLQAAQVYA ASIMFGYFVT RADKRFQLDR MVGTLPMDPM ESAMALERLF NSASAMDSID EADAAPQNFG GEDFDLFSDS APSSGTGSQL TLKQYIQNFD QSTLAQTARI VSMEGVQVAE RQTGALFGSI EDLQREMQDA VGMNAVTPEE LMDAVNDAVA EKKVQTLTLA YASQRRLVLE AVAFGAFLRQ SETYIDGYNP KLLTPTPRGP TGPPGASLPS GDDDDGGSPV A
|
| |