Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_19438 |
Symbol | |
ID | 5000669 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009357 |
Strand | - |
Start bp | 493569 |
End bp | 497831 |
Gene Length | 4263 bp |
Protein Length | 417 aa |
Translation table | |
GC content | 56% |
IMG OID | 640416090 |
Product | predicted protein |
Protein accession | XP_001416966 |
Protein GI | 145344908 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0520] Selenocysteine lyase |
TIGRFAM ID | [TIGR01979] cysteine desulfurases, SufS subfamily |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACGATTA GAGTTTTAGT TAGTTTAGTG GGGGATGAGA GCGAAAAAAT GCGTGAGATT GCGAAATTTT CAAGAGTAAC GCTCGCGGCG TTGAAGGCCT GGCTTGGAAA GTACGAAAAC AAGACCAATA GCAAACGCAG CAAGAATGGG ATCGAAATAA AGAGTGAAAA CTCGGACACC GACGAACTGT TGTCGTGCCT TCGTATGTGG CTTCGTCACG AGAGCGCTTC GCACGTGAAT ACGCGTCAAG ATGCTCGTCG AGAAGAGGCG ACCAAGGCGA AGAAAAAGTT GGATAAAAAC ATGATTTTCC CAGAGCAAAA TCTAGACATG GCAGCGTCTT ACGGGCTCGT TAACGTACCT TCAAACCTCA CAAATATGAC GACTACGAGC GAGCAAGCGT TAAGTGGCGG CCAAGGGATC GACGACTCAC CGTCCAGCGC GGGAACGACC GTCGTGGGTA ACGACCCCGT TCGGGATGAG CACGAAGAAT TGGGCGGTGA CAAGGATATG GATGTGCGAG AAAAGACCGG ATACGAATGC ACAACGGCAT TTGGGATCGA TGGAGAGCTG AAACATTTTG AGCAAGGCTG TACAGAAGAG ACGATGGCCA TTCACACGGA AATTTTGCTC GACAAGGCGT TCAGGCTCAA AGAGTCTACC TCACAGACTC CCAGTGGTCC CAAGCAAGGT TGCTCGGGGG GATCTCAACC ATGCGCGGTG TGCGTGAGAT TGAAAACGTT GTCGAGCAAA TGCGGTACAA GTGAAGGAAG CGTGAATTGT CTCAAGAAAA GTGGCGAGTC GAAAGATTTG CAACTACCTG GGGCGCCGTC TCGCGCCAAC CGTGTACAAG GCGGTCCATC GGCAAAGGAT GAGTTGCAAC GAATTGAAGA CGCTTGGCGT TCGAACCCGA CGCTCTTGGC GTGCGCTCCA AAAGACGAAG TAGAAGCAGA GATTCTTGCG CTTCAATATG AACTTCTTTG GCAACTACAG TCGAACAGGC AGAAACTCAA GAGGGCACAA GAGAAAATAG CTAGCGGAAT CGAGTCGGAC AACGACGAAC AAGCAAAGCG CAAGGCAAAG CTCGATGAGG CCGCGACGTA CATGTCGGGA ATCAGAGAGG TCAAACGTCA ACAGAAGAAA GAGAAGCGCG AAGCGGACCA GCTGGCAGCA CTGGAGCGAG CCAAAGCAGC AGTGGGTGAT GGGCGACGCG AATCAAAACC GAAACGCGAG ATGGGTTCAG CCGCAATAGT GGCACCCGAC CCGAAGAGCC TGATACCAAC ATACAAGCCC CCGACTACCC AAGCGGTGAT GAAAGCAATA GAAGCGGCAC CAAGTCGGGT GAAAAAAATG TTCGGTCTCG CGAACTTCGA CCCCACCACA TCGAGTCACC TCAAGTCTTT TAGCCTGTCT GGCACGCCTC GAAGCGGATC CCCTGTTCTT GGAGCGTCTC CGATTCGTTC ACCGATGTCA TCGATTCTTC TGGATCCGTT CCATGCCGTG TCCGATAACA AAGCTTGCTG TGTCTGTGCT GGGACTGAAG AAGCGGAGAA GATGGAAGAA ACAATGAAGT GTTCGCAGTG TGAACTTATC GTGCACCCGC ACTGTTACGG AATCACGGGT GACGCTTATG TCAAGGATCC CAAGTGGTTG TGTTTTGTGT GCAATGCAGC CGTGTTAGCA GGCACTGCAA TTCCAAAGAC TTCAAAGAAG TCGCGCATAA CGTTACAAGG CAAGTTGGCT CTTTACCGCG GAGTGGAGTG CACGCTTTGC CCGGTGAAAA TGGGCGCCTT CAAGAAGACG CTGACCGGGA ACCAGTGGTG TCACGTCGCG TGCGCGAAAT GGGTCCCCGA GGCGCACTTG TTCGATAGGA TCATGAATCA CGTAGCGGTG ATCAGACCTG TGAATATCGA GGACGTGCCT CGCGAGCGCC GCAACGCGTC TTGCACATAC TGTACGCGTT CGCATGGAAC GCTCATGCGT TGTTGCTTTG GGCACTGTCA AACCGTCTTT CATCCACTCT GCTGCCGACG AGCAGCGTGT CATATGCGCG CAGTCGACCA CTCCAAGAAA AGATTCACTG CTTTTTGCGA GAAACATTCG CGTTCTGAAC GCGAAAAGGA TCTCGCAGTC AACCTTGTGG GAGACCCTGT TCCAGCGCTC GTTGAAATGT ACAATCAGTC GCCGTTAGAG AGACAGCTAT CTGGAGGTTT GCAGCGCACG TTATCTGGTG GTACAACCGG AGTGTCGCCA ATGAACGGAG CCATGCGTCG GATCCGGGAG ACGATGCGCG CGAAAGACAC ACAAAAGCGA GCCGCCTTTG GCGGACTCCC GTCGAGCGCC AAGGCGAAGA CGAAGAAAAC TCGAAGCAAT CTTCAACAAG CCAACAGGCT TCCTGGTAGT GTCCCTGGTT CTTCGCTCGC AGGGTTCGAT TCCGAAGAAA ACTTTGACGA TCTTCTACCG GGTCAGCGCG GTCAAGCCCG AAACACCGGT GCGTTGCGTG GCGGAGCGCT CCCTCGCGAC GCCCTCCTGA CGTCCCCCGA GGCGGACAAG GTCAATCGCG ACTTGCCCAC CGGGTACCAG TACATGGAAA AATCCGACAT CGCACGATAA TCGCACATGG TTCACGATTC ACGACGCATC AGAGCGTCTT AGCCCGCGTT CTCGACGAAA ACATTGTATC CGAGACCCAC TCGGGTGCGG CGTCGCCACC CTTCAGAAAC GTTCTTCTCT TCGTCGCCCA CCGCACACCT CAGCGTTCGC TCGTCGGTGC GTCGCGAAAC CGCGGGTCGG CACGCGTTAC CGCCGATCGC CCGCCCGCCC GCGAGTCATC ATCGGCACGA GCGCGCGCAC GAGGCGCGAA CCGAGCGCGT AATCGAAGAC GTCACCCATG TCGGCGGCCG GCGTCACGCG CGCCGCGTTC GCGTCCACGT CGTCTCCGCG ACGCCGCGTC TCGCCGACGC AACGGACGGC GAGCGTTCGA CGCGCTGCGT CGTCGGCGAC GGCGGTGAAA GCGCCGACGT CGACGCTGGG GGAGGAAACA CGAGGAGACT TTCCGATTCT CGACCAAAAG ACAGAGTCAG GCGCCCAGCT GGTGTATCTG GACAACGCGG CGACGTCCCA GAAGCCGAAT CAAGTCGTCG ACGCGCACAT GTTTTACTAC AAGGATTATA ACTCCAATGT CCATCGAGGG GTGCACTATC TGAGCAGCAA GGCCACGGAT GCGTACGAGT TGTCGCGGAG AAAGGTGGCG GCGTTTGTCA ACGCGACGAC GGATAGGGAA ATCGTCTTCA CGAGAAACGC GAGCGAGGCG ATCAATCTGG TGGCCAACAC GTGGGGGACC AAGCACATCA AAGCCGGTGA CGAGATAATT TTGAGCGTTT TAGAGCACCA CTCAAATATC GTCCCGTGGC AACTGCTCGC TGAGCGGACG GGGTGCGTGT TGAAATTCGT TGAATTGACG AAGGATACGC AGGAACTGGA CATGGATCAG CTTCGCTCTT TGGTGAACGA AAAGACGAGG CTCATCACGA CGGCGCACGT GAGTAATGTC ACCGGTGGTG TGGCACCAGT GAAGGAAATT ATCGATCTCG CTCGAGGCGT CGGCGCCAAG GTGCTTCTCG ACGCGTGCCA AAGCTTGCCG CACATGAAGG TGGACGTGCA AGCGCTCGGT GCGGATTGGA TCGTGGGTAG CTCGCACAAA ATGTGCGGAC CGACTGGTAT CGGATTTTTG TGGGGTCGTA TGGACGTCTT AGAGACCATG CCGCCGTGGA TGGGAGGCGG CGAGATGATC GCCGACGTGT ACCTCGAAAA GAGCACGTAC GCCGAACCTC CGTCACGGTT CGAGGCTGGG ACGCCCGCGA TAGCTGAGGC CATTGCCTTG GGCGAGGCGT GCGACTACCT CACAAAAATA GGCATGGATA GAATCCACGA CTACGAGATC GAGATCGGGA CGTACCTGTA CGAAAAGCTC TCCGCCGTCC CTGGAGTGAC GATATACGGC CCTCCGCCCG CGCGCGGCCG AGCCTCGCTT TGCGCTTTCA ACGTCGACGG CATTCACGCC AACGACCTTT GCACCTTACT AGACCAAGCG GGAATCGCGT GTCGCAGCGG TCACCACTGC ACTCAGCCCC TCCACCGTTA CCTCGACGTT CCCGGGAGCG CTCGCGCCAG CCTCTACTTT TACAACCTCC CGAGCGACGT CGACGCCTTC ATCGACGCCT TGACATCCAC AATCGCGTTC TTCGCCGAGT TTGAAGAATA ATA
|
Protein sequence | MTIRVLVSLE TRGDFPILDQ KTESGAQLVY LDNAATSQKP NQVVDAHMFY YKDYNSNVHR GVHYLSSKAT DAYELSRRKV AAFVNATTDR EIVFTRNASE AINLVANTWG TKHIKAGDEI ILSVLEHHSN IVPWQLLAER TGCVLKFVEL TKDTQELDMD QLRSLVNEKT RLITTAHVSN VTGGVAPVKE IIDLARGVGA KVLLDACQSL PHMKVDVQAL GADWIVGSSH KMCGPTGIGF LWGRMDVLET MPPWMGGGEM IADVYLEKST YAEPPSRFEA GTPAIAEAIA LGEACDYLTK IGMDRIHDYE IEIGTYLYEK LSAVPGVTIY GPPPARGRAS LCAFNVDGIH ANDLCTLLDQ AGIACRSGHH CTQPLHRYLD VPGSARASLY FYNLPSDVDA FIDALTSTIA FFAEFEE
|
| |