Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_32074 |
Symbol | |
ID | 5002574 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009360 |
Strand | + |
Start bp | 226286 |
End bp | 228583 |
Gene Length | 2298 bp |
Protein Length | 714 aa |
Translation table | |
GC content | 50% |
IMG OID | 640417995 |
Product | predicted protein |
Protein accession | XP_001418186 |
Protein GI | 145347467 |
COG category | [R] General function prediction only |
COG ID | [COG3621] Patatin |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.0337279 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGGCTAT TAGCGCTGAG AACTTTGGGT ACTTTCGCTT TCGACGAAAC TAATAAACGT GCGATGCTCA AACTGCGCGA TCTCCACTCG ATTCTCGTCG TATTCGCGCT CCGCCCGGAA TTAAAGGCGG CGAGCGTTGC GGTGAAGAAG GAATCGATCC GAGTATTGGC TATTTTGGGC GAGAACGAGC TCGTGCGCCA GGCCACTGGT GCGCCTCCCA TCACTGGTCG TGGTATACGC ATTCTTGCGC TAGATGGTGG TGGTATACGC GGTAGAGCGA CGTTGAAGAT GTTGAAAAGA ATTGAGGTGA GGAATTATTT CATTTATTTA TTTAGAGCTT GAATTCACGA ATTTTTTGTT TCGTCGCGAT CAGGAAGGAA CTGGTCGTCC GATTCACGAA TCATTCGATT TGGTGTGCGG AACGTCTACT GGTGGCATAT TGGTACGAAA ACGCCCGTCT AGAGCGCACG AATTAATTAT TTCGCCGTGA ACAATTATTT TTTACTAACC AACACATTTT GCTCGTAGGC CACGGCGACG TCCATCAAAA AGCTCTCATT GGAGCACTGC GATAAGATTT ATGTAAATCT TGGCAGCAAA ATCTTCAGTC AAACAACGCA CAACGAAGAG ACCTCTGGAT CGAACTCTTG GCTCGGCAGT GTTGGTTCCA TGTACACGAG TGGAAAGCAA CAACTTCTCG CGACGACGCT CTATAGCAGC AAGCACAACA CCTCGACCTT TGAAACCCTC GTTAGACAAG AGTGCAACCC TGAAGCAGAA GAGCCAACGT GGATAGACAC CGCTGCGTCG GGCGGTCCGA AAGTTTTCTG CGTTTCTACC CAAACGAGTC AAAATCCGGC GCAGCCGTAC TTGTTCAGAA ATTACACGTA TCCGGCTGGC AGTACGAGTG CGTATTCCCA GGCAGGTAGT TGCGAATATC TATTGTGGCA GGGTGTCTGT GCATCCGCCG CTGCACCATA CTACTTGTAT GTTGACGCCT TTGCGATAGA AAACGAGCGC TGGGTTGATG GAGCCATGAC TTGCAACAAT CCGGCGATGA TGGGTGTCCA GGAAGCTCGA CGACTTTGGC CAGACAAGAA AATTGACTGT GTCGTGTCCC TCGGAAGTGG TAACTTCATC CCCCACGAGA GAGATCCACC CATTTCTCTC GTCGCTTTGG CCAAAGATGT CTTGTTTGAC AGCGCTTGCG ACACTGAACG CGTTCATGAA AGCTTGAGTA CGCTTTTGCC ACTCATACCC GGGGCGCAAT ATTTCAGGTT CAACCCGGTT GACGAGCGTT GCAAGATAGA AGTCGATGAA ACGGATGTTG GTGCACTCCA AGGCTTATTT GACGCTACTG AAGAGTACAT TGTGGCAGAG AAGGAGATGT TCGACAAGGT ATGCCATTTA TTGAGAGACG TCGACGACAC AGATGAGGTC ACCGCCAAAC TTCTCGACAC GGAAATTAGC GGAACGCGCA GTGGTGTTCT AGTTTTGGAA GCGCCCCGTT ACGAAGAGGA ACTTTCTGAA TGTACTTCTG CCTTGAAGAA TTTTTGTGCT TTAAGATCGA TATCGATACA GTGCGCAGAT TACTCCGCCA ATAGACCCAT GAATCCGGGC GAAGCACTGA GCCATCTGAA CACGGTGGCG GAGACGTCGA CTGCAGCAGT GATTCATTTC AACTGCCACG CCGATTCTGA CGGATTGATT CTCACTTGGC AGAAAGATGT CACTGCTATT GCTGAGCCTA GCTCGGTCGC TGAACTATTT CTCAGTAGGT CTGGTAGCCC GTATGCGTCA GTCAGCGAAC ACTGTGAAGC TGAGGCCCAC ATTGAGGTGC ATGGAATATT GCATACTTTC TCAGGCAAGC ACGTGCAAGT GAACGATGTC GGCGAGCGCA CGTCGTCATA CTTGTTTAAG CGTACTGTCC CAATGGACTA CCTGGACGGA TCGACGAGTC GAGAGTTATT TGGTTTATGG CGTGGGAAGA TCATCGTTTC TCAAAGTTCA CTTCCATCCT CACTCGTTGC GGCGTGGCTT GAAGCCGGGG CGAAATGTGT GGTCGCACCG TGCAAGGAAG GCGGCGTCGT TAACGTTGAA AGCGAACAGA CAGACTTTAT GGCTGCATTC TATCACGCAC TTTTTGTTGT GGGCGCGGAT GCTACTGCGG CGATGAGCGC AGCTGCCATC GTGCAGCCGG CGTGCTCGTA TTACCGTTGT CACGTTCTTG TTCAAGGTGG AATCGTGGCA CTTCGTCCTG ACGAAGAGTT TGACTATGAT TTGGATGCGC ACGTATAG
|
Protein sequence | MRLLALRTLG TFAFDETNKR AMLKLRDLHS ILVVFALRPE LKAASVAVKK ESIRVLAILG ENELVRQATG APPITGRGIR ILALDGGGIR GRATLKMLKR IEEGTGRPIH ESFDLVCGTS TGGILATATS IKKLSLEHCD KIYVNLGSKI FSQTTHNEET SGSNSWLGSV GSMYTSGKQQ LLATTLYSSK HNTSTFETLV RQECNPEAEE PTWIDTAASG GPKVFCVSTQ TSQNPAQPYL FRNYTYPAGS TSAYSQAGSC EYLLWQGVCA SAAAPYYLYV DAFAIENERW VDGAMTCNNP AMMGVQEARR LWPDKKIDCV VSLGSGNFIP HERDPPISLV ALAKDVLFDS ACDTERVHES LSTLLPLIPG AQYFRFNPVD ERCKIEVDET DVGALQGLFD ATEEYIVAEK EMFDKVCHLL RDVDDTDEVT AKLLDTEISG TRSGVLVLEA PRYEEELSEC TSALKNFCAL RSISIQCADY SANRPMNPGE ALSHLNTVAE TSTAAVIHFN CHADSDGLIL TWQKDVTAIA EPSSVAELFL SRSGSPYASV SEHCEAEAHI EVHGILHTFS GKHVQVNDVG ERTSSYLFKR TVPMDYLDGS TSRELFGLWR GKIIVSQSSL PSSLVAAWLE AGAKCVVAPC KEGGVVNVES EQTDFMAAFY HALFVVGADA TAAMSAAAIV QPACSYYRCH VLVQGGIVAL RPDEEFDYDL DAHV
|
| |