Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_12268 |
Symbol | |
ID | 5000086 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009356 |
Strand | + |
Start bp | 822573 |
End bp | 825343 |
Gene Length | 2771 bp |
Protein Length | 311 aa |
Translation table | |
GC content | 61% |
IMG OID | 640415507 |
Product | predicted protein |
Protein accession | XP_001416263 |
Protein GI | 145342772 |
COG category | [R] General function prediction only |
COG ID | [COG1234] Metal-dependent hydrolases of the beta-lactamase superfamily III |
TIGRFAM ID | [TIGR02651] ribonuclease Z |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.812865 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAGTCGA ATTTTAGCGG CGCGGAGTTG ACGTTTCTCG GCACGTCGAG CGGGGCGCCG AGCTTTACGC GAAACGTCTC GTCGGTGGCG TTGAGGTTAG AGAACGAGGT TTGGTTGTTT GATTGCGGTG AGGCGACGCA GCATCAGTTG ATGCGGTCGA AGCTGAAGTA TGCGAAGATC ACGAGGATTT TTATCACGCA CATGCACGGG GACCACATCT TCGGTTTGCC CGGGTTGATT TGCGCGATTA GCGGCGCGCG CACGGCGCAC TCGAGGTCGT ACGGTGGTTC TGTCGTGCCT TTGCACATCA TAGGCCCGCC GGGGATACGG CAGTTCATTT ACTCATCCAT CACGTGGTCG AGATCGGTGC TCGGGATGCC GCTGATCGTG ACAGAGTTGA GGCAGCCTGC GCGCGCGAGC GGCGGCTCGC CCGCGGCGCA CACGTCGGTC GATCCGCGAG GGAAGATTTT CATGGGCGAA TATTGGCCCG ATAACGTCTC CGAACCGCTG CCGCACTTTA GAGACGCCGC ACGATGGAAC GAACTCGGGA TGAGCGAGCA GCTGCCGATT TGGACCGCGT ATAACGATGG AAACTTTTGC GTTCGCGCCT CGGTGCTGCG ACATCCCGTG CCGTGCTTCG GTTACGTCAT CGACGAGTGT GACGCCAGCG GTCGGCTCGA CGCCGAGAAG TGTGTGGAGA TGGGACTGCC TCCAGGGCGC GAATATGCCA TGCTTAAAGC AGGGCAGTCG GTGACGACGA AGGATGGACG CGTGATTCGT CCCGAAGACG TCATAGGCAC GCCTCGACCG GGGCGTCGAC TGGTGCATCT CGGCGACACG TGCGACAGCA GCTCGATGGT TTCCCTGGCG CAGGGTGCGG ATTCCTTAAT TCACGAATCT ACGTTTGAAG CAAAGAAAGT GTCAGAGGCG CTGTACAAAG GACACTCCAC GGCGCGCATG GCTGGGAAGT TTGCCGCGCA AGTCAACGCG CGCGCGTTGA TTTTAACGCA TTTTTCAAAT CGCTACGCCG GCGGCGTCCA TCGCGCTGAT TCAAACAGCG ATGATCTCGG TGCTAGCGGA GACGACGACG ACGACGACGA AGATGATTGT GAAGATATGT CGCCACCCGA TGTCGACTCC GACGGCGAAG CACTCGCCGA GGGCGATCGC ATGAACGTCG AGCGTCTCGT CGAGGAAGCC AAGGAGGCCA AGGGCGACTC GCGCGTCATC GCCGCGAGCG ACTTCTTCGT CTTCAACGTC GGTCGGCGCG AAGAGTTCGA CGATTTCGAT TTCTCCAAGG GTGATCGATC GGTGTTATTC GCCAGCCCCC GCAAGACGAC CCCAGAGACG TTCGTCGTCG ATCAGGACGA CGGCCGCGCG TCGTCGTCAT CGGGTGGCGA CGATCGAGAC CGTCGACCGC GGCGAGGCGG CGGCTCCGCC GACCGTGGCG GTCGCGGCCG CGGCGCCCGG GCCCCGGTCA GTGGACGCTA TCAGCGGTCG CCATCTTCGC AATAGCTTTC ACGCGCGAGC TCATGTAAAC ATACACACCG TCGAGTCGAC GCGCCGTCCC GAAGCCCCTA TCCACGACGT CCCCTTTCCA CGCGGTCATC ACTGTGCGCC GCGGACCCGA ATTCGCGCGC CATGTTCGCC GCCCGCGCGC CCGCGAAGCT CGCCGCGTCG CCCGCGCGCG CGTCCCGCGC CTCTCGCGCC GTCGTCACGT GCGGCGCGTC TCGTAAAGCG CGCCGAGCGC AAACCGGCGG CGGCGGCGCC GACGCGGCGA CGAAACCCAA ACCCGCGCTG AGCGCGGCGA ACAAAGCCGC GCGCGATAAA GCTCGCAAAG CCGCGGCGAC TAAAGCGCGC GGATTGGTCA GCGAACAAGG GAAGGCGTAC GAAAAAATCG TGAACGAGAT GGGGCGAGAG GCGTCGCGCG AGTACGTGCT GAGCGTGCGA CACGCGCCGA AGAGTGGAAA CGCAGAAGAC GTCGTCGGGG TCATGAGCGA TTGGTTACCA GTGTGTGAAG TCGTCGTCGC GGATCGGAAG GCGTACGAGG CGCAGATGTT CTTTAAAAAG CAAGGAATGA GCGTGCCGGA GACGGAGACG CAGACGCTGG AGAACGCGCA AATCGAAGGC GTGCCGGCGG TGCGGATGAT GTTGCCGAAA TACGAGCAAC ACGTGGCGGC GATGGCCTTC GTGATGAGTG GAGTCAAGGA TTTGAACATG GATGAGGTCG AGTACGGATT GGAGGATTGG GGATCGTTCG AGGTCGCCGT GGACGCGCTG GCGGCGCAGA CGAACCAGCG AGGGAAATTT GAAGCCGCGG CGAAGCTTCT CGGCGTTTCT GTGGACGATG AGCCGAGCGA TATTAAGCGC GTGTATCGTA AACTCATCGC TGAGGCGCAC CCCGATCGTA ATCCGGACAC GACGCAAGAG AAGTTTAACG CCATCAAGGA TGCGTACGAG CTCTTGTGCG GTCGCGGCGA TACCGCGGGG ACAACGTTCG AAGGACTCGG TGACCACCGT CGCGACTTCG TACCGCTCGA CAAAGGTCAC TTTGGGGTGA CGAGCGCGGA CAAAGGTCAA ACCGATCCCG CCTCGGTGGC GTTCGCCATG CGCACGCTTA CCATTTACGA CCGCGTCGGC ACGGTGTTCT CGACGAGAAA TCTCAAGCTC GCGCAAAAGC AAAAGGCGTG AGCGCCGAGC GCGCGTTGTA CCATCATCAA CTAATTATTT GTCGAGTAAT AAGTATGATT CTTTCTAAGC TCTCAGCTTA A
|
Protein sequence | MKSNFSGAEL TFLGTSSGAP SFTRNVSSVA LRLENEVWLF DCGEATQHQL MRSKLKYAKI TRIFITHMHG DHIFGLPGLI CAISGARTAH SRSYGGSVVP LHIIGPPGIR QFIYSSITWS RSVLGMPLIV TELRQPARAS GGSPAAHTEQ LPIWTAYNDG NFCVRASVLR HPVPCFGYVI DECDASGRLD AEKCVEMGLP PGREYAMLKA GQSVTTKDGR VIRPEDVIGT PRPGRRLVHL GDTCDSSSMV SLAQGADSLI HESTFEAKKV SEALYKGHST ARMAGKFAAQ VNARALILTH FSNRYAGGLS A
|
| |