Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_94892 |
Symbol | |
ID | 5004093 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009364 |
Strand | - |
Start bp | 436859 |
End bp | 437857 |
Gene Length | 999 bp |
Protein Length | 314 aa |
Translation table | |
GC content | 60% |
IMG OID | 640419514 |
Product | predicted protein |
Protein accession | XP_001420175 |
Protein GI | 145351636 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0260] Leucyl aminopeptidase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 0.00994972 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 0.638195 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCGCCAA ACGAGCTGTC GCCGGACGCG TTTGTGGACG AGGCGAAGGT CGTCGCGGGC GAGCTCGGCG CGAGCGTTAA GGTTGTCGAG ATTAAACGTT ACGCGCAACT CGTCGAGGAG GGCTACGGAT GTCTGGCGGG CGTCGGAAGC GCTTCGAGTC GCGACGGTCG TGATCCCGCG CTCGTGCACT TGCGATTTAC GCCGAAAGGT TGCGTCGACC CGGACGCGCC GTCCATCGCG TTCGTGGGCA AGGGTATCAC GTTCGACACC GGTGGGCTTT CGCTGAAATC TAAAGACGGC ATGTGCGGTA TGAAGACGGA CATGGGCGGT GCCGCGGGTA TGCTGTGCGC GTTTGAGTCC ATCGCTCGCG AGGACGCCGA ATCGAATTTC AAGACGCCAC TCGACCTCGT GCTGTGCATC GCCGAAAACG CAATTGGCTC GGGCGCGATT CGCCCGGATG ACATTCTCGT CGGCAAGAGC GGAAAGACAG TGGAAATCAA CAACACCGAC GCCGAAGGGC GCCTCGTGCT CGCGGACGGC GTCGCGTATT GCTCCGATCC CGCGAACGCC GCGTGCAAGC CGCGCATCAT CGTCGACATG GCCACGCTCA CCGGAGCGCA AATGATCGCC ACCGGGCGTA AACACGCCGG ACTTGTGACC GATAGCGAAG ACATGGAGCA CACCATCGTG CGATTGGGCA GAATCACCGG AGATCTGGCG CACGCCCTGC CCTACGCTCC GGAAATGTTC AAGAGTGAGT TCAGTTCCAA AGTCGCCGAC ATGAAAAACT CCGTCGCCGA CCGCGCCAAC GCGCAATCGA GCTGCGCCGG TCAATTCATC GCCAATCACT TACACCCAGA CTGGGTCGCT CGCGATGACA CGGCTTGGAT TCACCTCGAC ATGGCTGGTC CTGGTAACTT TAAAGACGGT TTAGGTTCTG GCTATGGAGT GGCTCTGCTC AACGCGCTCT ACAAAGAAAT CGACAGTCGA CCGCAATGA
|
Protein sequence | MAPNELSPDA FVDEAKVVAG ELGASVKVVE IKRYAQLVEE GYGCLAGVGS ASSRDGRDPA LVHLRFTPKG CVDPDAPSIA FVGKGITFDT GGLSLKSKDG MCGMKTDMGG AAGMLCAFES IAREDAESNF KTPLDLVLCI AENAIGSGAI RPDDILVGKS GKTVEINNTD AEGRLPRIIV DMATLTGAQM IATGRKHAGL VTDSEDMEHT IVRLGRITGD LAHALPYAPE MFKSEFSSKV ADMKNSVADR ANAQSSCAGQ FIANHLHPDW VARDDTAWIH LDMAGPGNFK DGLGSGYGVA LLNALYKEID SRPQ
|
| |