Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_35452 |
Symbol | |
ID | 5002695 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009361 |
Strand | + |
Start bp | 763644 |
End bp | 766673 |
Gene Length | 3030 bp |
Protein Length | 1009 aa |
Translation table | |
GC content | 58% |
IMG OID | 640418116 |
Product | predicted protein |
Protein accession | XP_001418796 |
Protein GI | 145348729 |
COG category | [H] Coenzyme transport and metabolism |
COG ID | [COG0476] Dinucleotide-utilizing enzymes involved in molybdopterin and thiamine biosynthesis family 2 |
TIGRFAM ID | [TIGR01408] ubiquitin-activating enzyme E1 |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 25 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 47 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGAGATCG ACGAGGACCT GCACTCGCGA CAGCTCGCGG TGTACGGACG AGAGAGCTTT CGAAAACTCG CGAGCGCGCG GGTGTTGGTG ATCGGGGCGC GAGGTTTAGG GTGCGAGATC GCGAAGAATG TCGTCTTGGC GGGCGTGCGC GCGGTGAGCG TGTGCGACTC GGGGGCGTGC GAGGCGGCGG ACGCGAGCGC GCAGTTTTAC GTGGACGAAG CGAGCGTGAA GGCAAACGTG ACGCGCGCGC GGGCGAGCGT GGGGAAGTTG CAGGAGCTGA ATCCGGCGGT GGAGGTGAAC TGCGTGGAGA CTTGCGACGA AGACGCCGTC AAGGCGCACT CCGTGGTGGT GTGCGCCGGG GAGACGTCCG AGGCGGAGGC GGTGGCGATT AACGCCATGT GCCGGGCGAA TAACGTGGCG TTTATTAAGA CGGACGTGCG AGGGGTGTTC GGAAACGTGT TTTGCGACTT TGGTGACGCG TTCAACGTCT TGGACGTGGA CGGCGAGGAG GCGCTGTCGT GCATCGTGGC GAGCGTGTCA AACGATTCTC CGGCGCTCGT CACGTGCATC GAAGACGAAC GCGTGGAGTT GCAAGACGGC CAGCGAGTGA CGTTTAGCGA GGTGCGAGGG ATGACTGAAC TGAACGGGCT GTCGGTCGTG GTTAAAAACG TGAAGAAGCA TTCGTTTGAG CTCGACTTGG ACACGAGCGC GTTTTCGCCC TACGTCGGCG GCGGCATCGC GACGCAAGTC AAAGAAACGA AGACGCTCAA GTTTGCCTCG TACGCGGACT CGCTCGAGTC CCCAGGGGAC TTTTTGTTGA GCGATTTCGC CAAGATGGAA CGCTCGCCCC AGTTGCACTT GGCCTTCGGC GCGCTCGACG CGTACGTCGC CAAGCACGGC GCGTCGCCGA CGCCGGGTTC CGACTCGGAT GCTGAAAAAT TCGTCGCCGA AGCGGAAGCG TTGAACGCGA CGCGTAAAGC GGTGGATGAA GTTGACAAGG ACTTGTTAAA GACGTTTTCG AAGACGTGCC GAGGTCACGT CTCGCCCATG GCGGCGATGT TTGGCGGCAT CGTCGGCCAA GAAGTCGTCA AGGCGTGCAC GGGCAAGTTC CACCCGTTGT TCCAATGGTT TTACTTTGAT TCCGTCGAAA GCTTGCCCGA GACGTTGACC GAAGAAGACC TCGCGCCGCG AGGTGATCGC TACGACGGTC AAGTCATGTG CTTCGGGACG AAGATGCAGG ACAAAATTCT CAGTCAAAAG ATTTTCCTCG TCGGCGCCGG CGCGCTCGGT TGCGAGTTTT TGAAAAACTT TGCGTGCATG GGGTTGTCGT GCGGTCCGAG CGGTGGTGTG ACGGTGACGG ACGACGACGT TATTGAGAAG TCAAACTTGT CCCGTCAATT CTTGTTCCGC GACTGGAACA TCGGTCAAGG CAAGAGCGTG TGCGCCTCGA ACGCGGCCAA GGTGATCAAT CCGAACCTCA ACGTCACCGC GCTCGAGAAC CGCGTGAGCC CGGACACGGA GGACGTTTTC GACGATGGAT TCTGGGAAGG CCTGGATGTG GTCGTGAACG CTCTGGATAA CGTAAACGCG CGGTTGTACG TAGACAGTCG ATGCGTGTAC TTCCAAAAGC CGCTGCTCGA GAGCGGGACT CTCGGCACGA AGTGCAACAC TCAAATGGTC ATTCCGAACA TGACAGAGAA CTATGGTGCT TCTCGTGACC CTCCGGAGAA GAGCGCGCCG ATGTGCACGC TGCACTCGTT CCCGCACAAC ATCGATCACT GCTTGACGTG GGCGCGAAGC GAATTCGAAG GTGCATTCGA GAAGGCTCCC GCCGAGGCCA ACTCTTATTT GTCCAAGCCA GAGGAATACG CCGCGGCGGC GCTGTCGAAC CCCGATGCTT CCGCGCGAGA GAATGTCGAA AAGGTTGCGC AAGTGTTGTT GAAGACGGCA TGCTCCACGT ATGACGAATG CATCGCTTGG GCGCGCACGC AGTTCCAAGA GCAATTCCAC GACAAGATTT TACAGCTCAC GTTTACGTTT CCCGAAGACG CCGTCACGTC GACGGGTTCA CCTTTCTGGA GCGCACCGAA GCGTTTCCCG CGACCAGTCA TATTTTCCAC CTCGGACGCT TCGCACATGA CGCTCATCCG CGCCATGGCG AACCTCAAAG CGGAGCTCTC TGGGATCGCG CGACCGGCGG CGGGAGTCAA CGACGACGCC GCGCTCGTGC AGCTCGTCGA CAAGGTGGCC GTCGCTCCTT TCGAACCGAA GAAGGGCATC AAGATCGAGA CCGACCCCAA GGCGAACACC GCCGCTTCGA GCATTCCTGA AGGTATCGAC GACGAGGCTG TGATCAAGGA CGTGTTGGCC AAGCTCGAAA CGAAGCGAGC GGGCTTGGGA GGAGATTACA GACTCAACGT CATCGAGTTT GAGAAGGACG ACGACACAAA CTTTCACATG GACGCCATCG CTGGTCTTTC CAACATGCGT GCGCGCAACT ATGACATCGG TGAGGTCGAT AAACTCAAAG CAAAGTTTAT CGCGGGGAGA ATTATTCCAG CCATCGCGAC GACGACGGCG ATGGCGACGG GTTTGGTGTG CCTCGAATTG TACAAGGTGT TCAAAGGCGC GAAGATTGAG GCGTATCGCA ACACGTTCGC CAACCTCGCG CTCCCGCTGT TCGCCATGGC GGAGCCCATC GCGGCCAAGC AAGACAAATT CAAAGACTTG TCGTGGAGCA TGTGGGACCG ATGGATCTTG GAGGGCGATT TCACGGTTCA ACAAGTCTTG GACCACTTCG AGGCCAAGGG CCTGATCGCG TACTCCATGT CCGTCGGCGC GAGTTTGGTT TATAACAATA TTTTCCCCAA ACACAAGGAG CGTTTGAACC AAAAACTCAG CGAGTTGGTG CAAACCGTGG CGAAGATGGA AATTCCCGCC AAGCGTCGAC ACTTCGACAT CGTCGTCGCG TGCGAAGACG ACGAAGGCGA AGACGTCGAC ATCCCGATGG TGTCCATTCG CTTTAGATGA
|
Protein sequence | MEIDEDLHSR QLAVYGRESF RKLASARVLV IGARGLGCEI AKNVVLAGVR AVSVCDSGAC EAADASAQFY VDEASVKANV TRARASVGKL QELNPAVEVN CVETCDEDAV KAHSVVVCAG ETSEAEAVAI NAMCRANNVA FIKTDVRGVF GNVFCDFGDA FNVLDVDGEE ALSCIVASVS NDSPALVTCI EDERVELQDG QRVTFSEVRG MTELNGLSVV VKNVKKHSFE LDLDTSAFSP YVGGGIATQV KETKTLKFAS YADSLESPGD FLLSDFAKME RSPQLHLAFG ALDAYVAKHG ASPTPGSDSD AEKFVAEAEA LNATRKAVDE VDKDLLKTFS KTCRGHVSPM AAMFGGIVGQ EVVKACTGKF HPLFQWFYFD SVESLPETLT EEDLAPRGDR YDGQVMCFGT KMQDKILSQK IFLVGAGALG CEFLKNFACM GLSCGPSGGV TVTDDDVIEK SNLSRQFLFR DWNIGQGKSV CASNAAKVIN PNLNVTALEN RVSPDTEDVF DDGFWEGLDV VVNALDNVNA RLYVDSRCVY FQKPLLESGT LGTKCNTQMV IPNMTENYGA SRDPPEKSAP MCTLHSFPHN IDHCLTWARS EFEGAFEKAP AEANSYLSKP EEYAAAALSN PDASARENVE KVAQVLLKTA CSTYDECIAW ARTQFQEQFH DKILQLTFTF PEDAVTSTGS PFWSAPKRFP RPVIFSTSDA SHMTLIRAMA NLKAELSGIA RPAAGVNDDA ALVQLVDKVA VAPFEPKKGI KIETDPKANT AASSIPEGID DEAVIKDVLA KLETKRAGLG GDYRLNVIEF EKDDDTNFHM DAIAGLSNMR ARNYDIGEVD KLKAKFIAGR IIPAIATTTA MATGLVCLEL YKVFKGAKIE AYRNTFANLA LPLFAMAEPI AAKQDKFKDL SWSMWDRWIL EGDFTVQQVL DHFEAKGLIA YSMSVGASLV YNNIFPKHKE RLNQKLSELV QTVAKMEIPA KRRHFDIVVA CEDDEGEDVD IPMVSIRFR
|
| |