Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_92454 |
Symbol | |
ID | 5000556 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009357 |
Strand | - |
Start bp | 540492 |
End bp | 543338 |
Gene Length | 2847 bp |
Protein Length | 948 aa |
Translation table | |
GC content | 52% |
IMG OID | 640415977 |
Product | predicted protein |
Protein accession | XP_001416978 |
Protein GI | 145344932 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 2 |
Plasmid unclonability p-value | 0.000000138993 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGGACGCG AGCCGCGGGG GAACGGCGCG CGCGAGGGCG CGCGCGAGGA CGACGGCGAG GACGGGCGCG GCGGGGGGGG GTGGGAAACG ATGGGGAGTG GGTACGAGGA TAAACCGTTG GTCGACGCGA GGGCGGAGAC GGCGCGGTTG ACGCCGAGAG TGCGTCGAGA GTTTCACGAC GATTACTGGA CGGTGAGCCA CCTGAAGCTG TTGTACTTGG TGTCGAAGTA TTCGCACCGC GCGCAGTCGG CGTACGAGAA GGAGGTGTGG GTGCGGAAGT TGCCGCTGTT GGTGTTGATT TATGAGGGGA TCGTGCAGAA GGTGTTCGAG TACGATTACG CGCCGGAGTC GACGGTGGTG AAGAATACGC GCTTGTACTT GAACGTGTCG CAGGAGGGGG TGGACGATTT GGACGATTTG ATCGAGGGAG AGCTCATTCG TGGGTTGAGG CTGAGCAGTG CGGAACATCA GAGCGTTTTG GCGTTTCAGA TCACCGCCGC TGGGTTGGCG CTCGTGAGTA AGCGCATGAG CGAAGAGGAT CGACAGTTGG TGGACGAGCT GTGCGTCGCC GACGGGAAAC TTTTGCAAGT GGTGTGGGAA GATGAGACCT TTTACTTGAG AAATGAAACG ATTTCGTTCG AATCGACAAT AACTGATGTA GAAGACGTGT CGTACGTCGT ATCACCGTAT CTTCCGTTGG CTTTACGGCG TGTAGGTGGT CCCGTGACGT CGTCAAATTC GCACCGAGCG TCGGAGAGCG CGGCTAAAGA CAGCACGATT CGCGATGATT TGGACGAGGT CATGACGGTA TCCAACGTGA CAATCTTAGT TGGTGAATGG ATTCCGTTCG GAGTGAATCA AATCGTCAAG TTGAACATGA AGCTGGGATC AAACGACAGA TGTCAGGGCG GATTCTTCAC CCCCACGGTG GATACGGACT CGACCAACGC AACGTGTGAG ATCCCAGCTG GTTTGACAAA TGTAAGCATT TTGGACCATG ATATGACGAC GTTCACCAAC ATTGAGGCTC AAGTACACTT CCCTGAGGAC GATGGTATCG TGCAAGTCGA GCACTTTGGC ATTCACTTGC GCAAAGACGG CACCATTGTG CACGGTTTGA TGATTGAATC AGTCATGGAT CGAATTTTGG ACAATATCTC GCTCGATAAC TTGGCGCGCT TACTCGTGGA CGTGCATATG GACTCAAGTA CGATTTTAGA TTCGCTATTG AGCGATCATC AGCGTGATTT GCTCCAAACT GTTTTCCTCG GTAACGCCGA ACGACGAGAC AAAATTAACA TCATCATCGC CGAGCGCATT GCCCCGCAAC AGGCTGCGGA CATGTACATG GATAAGGACG CGTTCGAAAA CGAGTTAAGA CAAGTATTAG GTGATACGCA CGAAGCGTAC GATTTGAGCG ACCGTGATGT CGTCATCACC GGTTCTCACG GCATCTTACT CACGGGCCCA AACGCCAAGG CGCAGGAACC AGTGATGTTG ATGTACCTGT CCATCATGTC CAGAGATGTC TTCGTGCACA ACGTATTCAA CAGAATATTT ATGACGGATG ACATTCTCAA GCTCGTCCGG CAGATGATTA AAAATCACGA GGAAGATCCG AACAGTGTTG CGAATATTCG TCACACTCTC GCTGAGACTT CGCGCGACAT CATTTTGCTC GAGGAAATCC TGGTGTACTT GAAGGAATCA ATCGAACAAG ACGAGAGTAT TCACACCATT CCAGATACTG AAAGCGGAAG AAGACTTTAT GAAATCCTTG GCCTGAAAGT TATGCTCGCT GATCTCACGG TTCGTGTGGC GGATATGCGG AAAATCCTCG AGGCGGCGCG CAAAGAAATC GATGGGTTAC GAATCATGGT GGACACAGTA GCCGAGTCGC AGGCGTTGCG CGTGCACGAA GACATTCGCG GTAGTAGTGG AGAGATGCTG GTGCAAATCA AGTCGAACAG CAAACAAGGG ACAAGCTTAT ACACCATGCA GTTCATTTTC GCGGGTATGC TTGCGTTTGA GTTGCTCGAC AGACTCACTG GCGAATGGAG CGTCGTGCAC ACAGAGTGGG CTCGATCGTA CATTGTTGAT CCGTTTATGA ACAAACCACT GGTCTGGTTC ATTCTTAGCA TGTGTACGTG GGTCGGGGTT GCGTTTGGCG TGTATTATTT AACGAGGGCG ATCGATGACA AGTCTGCGGG AGCAATTTCA TATCGCATCA AGGTGAACAA ACCAATGAAT CTCGACGCTC TCGACGAGTA TTTGGCGTTC AAAGCTATCA TTGGCGAAGA TGGTGTCGCG AATGGTGATA CCACAATTCA GCGCGTCAGG TGGCAAGAGG AAGATATCAC GCGTTGGGAA GGATACCTCC CCATCATTGA ACTCGTGTAC GACGTCAAGC ATGGATTTCT GCTCGATGTG CATCTCACGA TAACCAGACG AGCATCAGTA CCGGCAAAGC TTCGTCCAGA GACGTTGAAG ATTAGATTTT TCACCGAATT GCAAGAAGCG GCGGTGCTAA CGCCAGAAGA CGCGGCGACA ATCATTGATA ATACGCGCCG AATGCGTGCC AAGGTCGACA AGCAAGCCAT CGGACTTCCT ATCGACCAAG CGGTGTCTCT CAAGGTGAAG GTTCCAAATG AGCGTTACTA TCGCGAGATT CCTTTCGGTC GACAGACGTA CCTTGAACTA CGAGAAGAGA TTGCGCTCAA ATTCAGAGTT AAGCCCGTGC AAGTGCTTCA GATTTTCAAG GTACCGGACA CCTTGATTGC GGACGACGAC GACGTGGCAA GAATCGCGCC CGACTCCATC CTCGAGGTTC TACTGAAAGC GTCCTAA
|
Protein sequence | MGREPRGNGA REGAREDDGE DGRGGGGWET MGSGYEDKPL VDARAETARL TPRVRREFHD DYWTVSHLKL LYLVSKYSHR AQSAYEKEVW VRKLPLLVLI YEGIVQKVFE YDYAPESTVV KNTRLYLNVS QEGVDDLDDL IEGELIRGLR LSSAEHQSVL AFQITAAGLA LVSKRMSEED RQLVDELCVA DGKLLQVVWE DETFYLRNET ISFESTITDV EDVSYVVSPY LPLALRRVGG PVTSSNSHRA SESAAKDSTI RDDLDEVMTV SNVTILVGEW IPFGVNQIVK LNMKLGSNDR CQGGFFTPTV DTDSTNATCE IPAGLTNVSI LDHDMTTFTN IEAQVHFPED DGIVQVEHFG IHLRKDGTIV HGLMIESVMD RILDNISLDN LARLLVDVHM DSSTILDSLL SDHQRDLLQT VFLGNAERRD KINIIIAERI APQQAADMYM DKDAFENELR QVLGDTHEAY DLSDRDVVIT GSHGILLTGP NAKAQEPVML MYLSIMSRDV FVHNVFNRIF MTDDILKLVR QMIKNHEEDP NSVANIRHTL AETSRDIILL EEILVYLKES IEQDESIHTI PDTESGRRLY EILGLKVMLA DLTVRVADMR KILEAARKEI DGLRIMVDTV AESQALRVHE DIRGSSGEML VQIKSNSKQG TSLYTMQFIF AGMLAFELLD RLTGEWSVVH TEWARSYIVD PFMNKPLVWF ILSMCTWVGV AFGVYYLTRA IDDKSAGAIS YRIKVNKPMN LDALDEYLAF KAIIGEDGVA NGDTTIQRVR WQEEDITRWE GYLPIIELVY DVKHGFLLDV HLTITRRASV PAKLRPETLK IRFFTELQEA AVLTPEDAAT IIDNTRRMRA KVDKQAIGLP IDQAVSLKVK VPNERYYREI PFGRQTYLEL REEIALKFRV KPVQVLQIFK VPDTLIADDD DVARIAPDSI LEVLLKAS
|
| |