Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_12762 |
Symbol | |
ID | 5002846 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009361 |
Strand | + |
Start bp | 714532 |
End bp | 718595 |
Gene Length | 4064 bp |
Protein Length | 872 aa |
Translation table | |
GC content | 59% |
IMG OID | 640418267 |
Product | predicted protein |
Protein accession | XP_001418782 |
Protein GI | 145348699 |
COG category | [D] Cell cycle control, cell division, chromosome partitioning |
COG ID | [COG4942] Membrane-bound metallopeptidase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCGCGGT ACATGGGGAA CTTCGCGCGA CCGGAGAACG CGATCAAGCG CGCGGTGCGC GCGACGCGAC GACGACGACG CGACGCGGGA CGACGACCGA CGGGGGACGG GAGAGGGTTT GGATTTGAGA TTCGAGCGCG CGGCGACGAC GGGAGGGGCG AAGCGAAAGG ACGAGGGCGC GCGAAGACTG ACGAGACGTC GCATCGATCG GACGCGCGGT GCAGGAGGAG CTGATCAACG TGGGACAGAA GCAGGCGGCG CTGCTGTCGC TGCACGAAAT CGTCACCAGT CGGCGAAACA GGCAGTGGAC CAAGGTTCTC GAGGAGGTCA TGTTCAAGTA CGTCGAGCTG TGCGTCGAGC TGAAGAAGGG CAGGCTGTGT AAGGATGGGT TGATGCAATA TCGCAACACG TGCTTGCTGG TGAACGTGCA GTCGCTCGAG GAGGTGGTGA AGCGATTTTT GAAGCTCTCG ACGGAACGCG CGGAGACGGC GCAGGCGGAG TTCGGCGCTA CTCTCGACGC GGACGTCGAT CTCGAGGCCG AGTTCACGCC AGAGTCGCTG TTGGCGAAGG CGTATCGACT CGACCACGAG AACGAGGCGA CGGAAAAGGA AACGGTGACG CCGTGGTTCA AGTTTTTGTG GGAAACGTAT CGCAACTTGC TCGACATCTT ACGCAACAAC AACAAGCTCG AGGGTTTGTA CGCCATGGTC GTGAAGGACG TGTTCAAGTT TTGCCTCAAG CACAAGCGCA CGACCGAATT CAGGCGCGCG TGCGACTTGA TGCGAACGCA CTTGAACAAC ATGGTCAAGT ACAAGGATAT GCGCGACAGA CCGGATTTGA GCTTACCCGA GACGCAAAAT CTCTACATGG AAGTGCGTTT CGAACAGCTG AAGGCGGCTA CGACGCTTGA AATGTGGCAA GAAGCGTTCC GATCGATCGA AGACATTCAC GGGTTGATGC TCTTGCTTCG CCGTTCGCCC AAGCCGCAGA TGATGGCGTT GTACTTTGCC AAGTTGACGG AAATTTTCTG GATTGGTAAA AATTACCTCC ACGCCGCGTA CGCGTGGATG AAGCTTTACA GCGTTAGTAA AACGTATAAC CGCAGCTTGA CCCCTGAAGA CGAACGCGCG TTGGCTTCGG GTGTCGTGTT GGCGACGATG TGCATAACGC CGTACACTGA AAAGTCCGTC TTTGGCGACA TGGACTCCGA TCATCAATTC GATCGCGACT CTCGCATGGC AAGTTTGCTC GGTTACCACA TCGATCGTAG TCGCAGCATC AGCGATGTGC TCTCGCGTGA ATTGTTGGCG GCCGAGATCA AGCGCAGCGG TTTGTTGGCC AAGGTGGACG ACGACGTCAA GCGTTTGTAC GCCCTCATGG AGCAAAGCTT TTCCCCGCTC GATCTGTGCA AGAAGGCTGA CGTGTTGTTT AACGTGTTAC AAGGCACGAC AATTGAAGTC AGTGAAGCCT CGCCGGTTTC CTCTTTCGAC TTCAACTCGT TCTTACCGAG ACTGAGATCG CTCGGCATTA TTCGCATGGT GCACCAGCAA TCCAAGGTGT TCGAGACGAT GAAGATTGAT TCTTTGAAAT CCTCCGTGCC GTTCATGCCG TATCACGAAG TCGAGCGTAT TCTCGTACAA GCGATTCGAA GTGATTACAT TTCCGTTCGA ATCGACCACG AGACTGGATC GATGAACTTT GTCGGCGATC GCCTCGAGAC GGGTTTCGTC AAGACGCACC TCTCTCGCGC CGCTCGTCTT TTGCAAGAAG GCATGAGCAA GCTCGCGCCC AAGACACCCG CCGATGTCGG CGCTCGTGTA CTCGGTGCCG AATTGCGCGC CGCCATCGAG GCCGAACACA AGCGTGCGCT GGCGCGCAAG GTGGTCATCG AGCGCCGCAA GGAGGAAGCC GAGCGCGCCG CCGCTGAGCA AGAAAAGGAG GAAGAGGCTA AGCGCGTCGC CGCGCAGCGC AAGCACGAAG AAAACGAGGC CAAGCGCCTC GAACAAGAAG CTCGTGCTCG CGAAGAGAAG AGAATTCGTG CGGAAATGGA GGAAAAAGAG AAACAAGAGG CTCTCGAGTT GCTCGCCGAG CAAGCCAAGC GCGCTGGCAA AAAGGCGCCG ATTGTCCTCG AGGAGGGCGT CGTCCTCGAC AAGCGAGCCA TCATGCAAGA TGCTATTCAA GAACAAATCA AGGCGCGTCA AGAGCAAGAA CGCAAATTGA ACAGTCTCGC TAAACGCATG GATCACGTCG AACGTGCCAA GCGCGAAGAA AGTATCAGTC TCATCGAGAA GGCGTACAAG GAGCGATCGG TGGATGACGA AAAGTACCAC GCCGAACAAC AAGTCGCGAT GGCTGCGAAA CATCGCGCCA AGTGGGAGGC GGAGAGCGCA GAAAAGCAGC GTTTGATGCG TATGGAAGGT CCGCGTGCCG AGTTCGCGCG TGGTGTCATG ATTCGTCGGG CCGAGCAATT CGCCGCTCAA GAGGAGGAAC GCGCGAGAAA GCTCGAGCAA ATGAAGAAGC ACAAGGAGGC TGAACGCTTG CTCCAAGCGA AAAAAGATTA CATCAAGCGT CTCCAAGACA TCGAGGCTGC GGTGAAGCAC GAAAAGCGCG AAAAGGATCG CATCGCGCAA GCTGAAAGAC GTCGTCAACA AGAAGAAGAG GAAGCTGCTA GAAGACAACG CGAACAACCG CCGGCACGCG GCGGCGACGA TCGCTGGGGC GCGTTCGCTG GGAGCGCGTT CGGCGAGCGT CGTCGGGTCG ACGAAGCCCC AGCGCGACCT CCGTCAGGTG GTAGCCGATG GGAACCGTCA AGAGGTGGTG ATCGTTACGA GCCTCGTCGC GACGACGGCC GGGGCGGCGG ATTCGGAGAT CGCGATCGCG CGCCGTCGAA TAGATGGGAG CGAGGCGGCG GCGATCGCCC GCCGTCGAGA AGCGGCGATG ACCGCCCGCC GTCGAGAAGT GGCGGTGCTG ACGATGGGAA ATATCGCCCG CCTCGACGCG ACGATGCACC GTCTAGAGGA GGAGGTTGGT AGACACCGCA AACAACATCG CTTCACATTG ATTGTAAATC GAGAGCATCG ATCGTCACAT TTGCCGTCAC ATCCATTCAT TCATCACGCT CGGCGCCGCG TCATGTGCAC CGAATCTATA TTAATGTTAC GCGTCAGCGC TCGCGCGCGC AAAAATTTGA AAATATTCTC CAAGAGCGCT TCACAAACCT CGGCCGACGC CCCGGAGCCG GAGACGCCGA GATACGTAAT CGAATACGTC GCGTCTCCGG TCTTTTCCAC GTGCTTTTGA AGTCGCCCGA TCCCCGACCC CGACGTGCAC TTGTAATACG CTCTTGGATA CTTCTGCCCA CTCAAAAACT TATTTCCGTA CTTGCGCCAT CGAAACCCAT CGTCCAGCCG AGTCGGATCT TCGCTCGTTT TCACCTCGAC GATGCGCGCG TCGCCGCGCG CCGTCGCCGT CGTCGCCGTC GCCGTCGTCA TCTCGCGCCG CTTCCGCGAC GATTTCGCCG TCGACCGCGC CTTCTTCACC CCCCCCTCCT CCGACGCGCG ACTCAGCGGC GCCCTCACCC CCCTCCCACC CTCCCCGCGT CCCTCCGCGA ACCCCACGCA CAGCGCCTCG AGCGATCGCT CGAATTTCTC CAGCCGTCCC CCGAACGTCC CATTCGGATC GTCGTTCTGA TCCCGCGCCT CCCAGAACCC GTCGTCGTCC AGCGCGCTCC ACGTCGACGT CCCCTCGCGC GCCGCGCGCG TCGTCCCCAT CGCCGATCGC GCCTCCCCGT CGTCCCCGCG CGTCGTCCGC GCGACGCTAA ACATAGCCGC CTCGTCGTCC TCGCGCGCCG CGCGCGTCGT CGCCGCCGCC GACCCGCGTC CTCGTCGACG CCCTCGCGCT CGACGCGCGC GTCACTATTA TTATTATCGC GTGTCCACCG ATCGCGTTTT CGTCCCTCGT CGTCGTCGTC GTCGTCGTCG TCGTCGTCCG CCTCGAGTCC GCGTGTGCGA CCGGCGCCGA TCGCGCGCGT CGCGAATATC TCGTGGAAAA CGATCCCTCA CTGA
|
Protein sequence | MSRYMGNFAR PENAIKRAEE LINVGQKQAA LLSLHEIVTS RRNRQWTKVL EEVMFKYVEL CVELKKGRLC KDGLMQYRNT CLLVNVQSLE EVVKRFLKLS TERAETAQAE FGATLDADVD LEAEFTPESL LAKAYRLDHE NEATEKETVT PWFKFLWETY RNLLDILRNN NKLEGLYAMV VKDVFKFCLK HKRTTEFRRA CDLMRTHLNN MVKYKDMRDR PDLSLPETQN LYMEVRFEQL KAATTLEMWQ EAFRSIEDIH GLMLLLRRSP KPQMMALYFA KLTEIFWIGK NYLHAAYAWM KLYSVSKTYN RSLTPEDERA LASGVVLATM CITPYTEKSV FGDMDSDHQF DRDSRMASLL GYHIDRSRSI SDVLSRELLA AEIKRSGLLA KVDDDVKRLY ALMEQSFSPL DLCKKADVLF NVLQGTTIEV SEASPVSSFD FNSFLPRLRS LGIIRMVHQQ SKVFETMKID SLKSSVPFMP YHEVERILVQ AIRSDYISVR IDHETGSMNF VGDRLETGFV KTHLSRAARL LQEGMSKLAP KTPADVGARV LGAELRAAIE AEHKRALARK VVIERRKEEA ERAAAEQEKE EEAKRVAAQR KHEENEAKRL EQEARAREEK RIRAEMEEKE KQEALELLAE QAKRAGKKAP IVLEEGVVLD KRAIMQDAIQ EQIKARQEQE RKLNSLAKRM DHVERAKREE SISLIEKAYK ERSVDDEKYH AEQQVAMAAK HRAKWEAESA EKQRLMRMEG PRAEFARGVM IRRAEQFAAQ EEERARKLEQ MKKHKEAERL LQAKKDYIKR LQDIEAAVKH EKREKDRIAQ AERRRQQEEE EAARRQREQP PARGGDDRWG APRVRPAPIA RVANISWKTI PH
|
| |