Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_52141 |
Symbol | |
ID | 5006883 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009375 |
Strand | + |
Start bp | 155701 |
End bp | 160702 |
Gene Length | 5002 bp |
Protein Length | 608 aa |
Translation table | |
GC content | 62% |
IMG OID | 640422304 |
Product | predicted protein |
Protein accession | XP_001422825 |
Protein GI | 145357233 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0006] Xaa-Pro aminopeptidase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 0.70287 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 0.00225388 |
Fosmid Hitchhiker | No |
Fosmid clonability | decreased coverage |
| |
Sequence |
Gene sequence | CGCCGCGCGT CGCGCGCCGA CGCCGAATCG CGCCGACGTA GCACATCGCC GTCATCGCCT CCGCGTCGAT CGCGCGTCGG CTTTCGCGCA GGAGCTCGAA CGCGCGCGTC GCGCGCGGCG TCGCGCGCGC GAACGCGTCG AGCGCGTCGC GGTCGTCGGC GCTCGAGCGC GTCGCGGTCG CGTCGGCGCG CGCGGCGCGG CGGCGGCGAA TCGGCGCCGG TCGACGGCGC GCGACGGCGC GCGCGGGCGC GCTCGAGCGC GCGGGCGAGC GCGCGCGGGC GGCGGCGCGC ATGCGTCGAC GCGCGGGCGC GCCCGACGGC GCGCGCGCGG CAGAGCGCGA CGCGGTGTGC GGCGAAAGGA TTCGTCATGA CGCGCGGTGA CGGCGCGCGC GACGGCGCGT CGACGCCGCG CGCCATCGGG CGCCTGCCGA GCGACGTCGT CAATCGCGTC GCCGCGGGAG AGGTGCGTCG AGGGCCGGCG CGCGACGGCG CGCGCGATGA GAGATTGAAT GTTTGAACGT CAGCTCGACG ACGGACGACG CGAACGCGCG ACTGACGACG CGAGACGACG CGACGACCGA ACGCGCGCGC AGGTGATCCA TCGACCGTCG AACGCGCTGA AAGAGCTGGT GGAGAACTCG TTGGACGCGG GCGCGAAGTC GATCGCGGTG ACGACGAGGG AGGGCGGGAA TAAACTGTTG CGAGTGCAAG ACGACGGACA CGGAGTGCGA ATAGAGGACT TGCCGCTGCT GTGCGAGCGA CACGCGACGA GTAAGATTGA AAAGTTTGAG GATTTAGCGC GATGCGAGAG CTTTGGGTTT CGAGGAGAGG CGCTGGCGAG CATGAGCTAC GTGGCGCACG TGTCGGCGAC GACGATGGCG GCGGGGGCGA CGCACGCGAC TCGAGCGACG TATACGGATG GGAAGATGGA TGCGGAGGGG GCGAAACCGA TCGCGGGGGT GTTAGGAACT ACGATTAGCG TGGAGAACTT GTTTTATAAC GTCGTGACTC GAAGGAAGGC GTTGAAGAGC GCGTCAGAGG AGTACTCGAA AGTGCTCGAG GTGTTGCAGA GGTACGCGGC GTTGCGAACG GATGTGGCGT TCACGTGTCG GAAGCACGGT GAGTCGCGAG CGACGTTGCA CACTCCCGTG GCGCAATCGC GCGTCGAGCG GTTGCAGGCG ATTTACGGTC CCACGGTGGC GAGAGATTTG AAGAAGCTCG ACTTCGACAG CGAGCTGTCC AAGAAAAAGT TTGATTTCAA GCTGCAAGTG GACGGTTTAG TGAGCGGTGG GAATTATCAT TCAAAGAAGA CGACGTTCAT TTTGTTCATC AATTCGCGTT TAGTGGAGTG CGCGCCGCTC AAGCGCGCGT GTGAGTCGGT GTACGCGGCG ATACTCCCCA AGGCTGAGAA GCCGTTTGTA TTCATGCACC TCCGCCTGCC GTTTGAAGAC GTCGACGTCA ACGTGCATCC CACGAAACAG GAGGTGCACT TTCTGCACCA AGAAGCCATT GTGGAGTTGA TTCAGTCCAA ACTAGAGAAG ATTCTTCTCG CGACGAATTC GTCGCGAACA TTCACCGTGC AAACACTGCT TCCTGGCGCG GAGAAACTGG CAAAGAAGGA TGACGAAAAC GACGCCGAGC GAAGCGGCGA CAAGGAAAAT AGCGAAAAAG CGGACGAACC GCCGGCGTCG CAGGCGAAGA CGATGCGGAC ACAGCGCGAA CGCGCGGGTG GTGATCACAA GCTCGTTCGC ACGGATGCGA ATTTAGCAGC GGGGAGTTTG GACGCGTACT TGCAGCGAGC GATGAATTCC GAGGGACGCG AACACGAGAA AATAGAAGAG GTTCGACGCG CGGTGAGAGA GCGTCGAGGA CAGCGCACGG AACCCGAAGA CACGTACGTG TGCGAGTTGA CGTCTATTCG CCAGCTTAAC ACCGAAATCG CCAATCGCGC GCACAAGGAG CTCGGCGACG TGATTAAAAA TCACACACTC GTCGGCGCCG TGGACGCGCG CAAAGGCGTG TGGTTACTTC AGCACCAAAC CAAGCTCTTC ATGGTGGACG CCGTAAAGCT CACCGAGGAA ATGTTCCATC AAATGGCTTT GAAGAACTTC GCCAACTTTG GGTACCAATC GCTGCAAGAT CCCGCGTCTT TGGCCGAACT CGCGCTGTGC GCGCTGGAGG ATAAATTCGT CGACGACGAA GAGTGGGACG CGAGCGATGG CTCCAAGGAG GAAGTCGCAG AGAAAATCGC AGAGATGCTC GTCGAAAAGG CGGACATGCT CAAGGAGTAT CTCGGCGTCG TCATCGACAA GGAACGGCGT CAGATCACCG GAGTGCCGTC GATGCTTCCC GGGTACGCGC CGGAAATCGG CAAACTTCCC GAGTTCGTCC TCGCCCTCGC CGAAGACGTC GATTGGACGA GTGAAAAAGA GTGCTTCGAA ACCTGCGCTC GAGTCATCGG CGCATTTTTC GCCATGGACT GCTCTTTCCA CGATCCGAAA GCCGAAGAAG GCGACGCCGA GTCCGACGCT CGTCGCGTCG CTCGCCTCTG CGTCTTTCCC GCGATGAAGC GCCGTCTCGC CCCGCCTCGT CGTTTCGCCG ACGACGGCAC CGTCATTCAG ATCGCGTGCC TCGAGCAGTT GTACAAAATT TTCGAGCGCT GTTAGTCGGT CCGCGTCGTC GTCGTCGCCC TCGAACGGCG TCGGCGTTCC GAATCGTCTC GCGGCGCGTC GCCCGGTCGT AAAGCTTGAA ATCGTGAACC AACCAGCCGT TCGATTCAAC CGACGCGCGT CGACCGCGCG CGCCTCGGTC GTCACTCCCG CGCGCGCCAC CGGCTCGACG GACGGACCGA CGCGACCGAC GCGACCGACG CGCGCGTTCG AATCGTGCAT GCGCGCCGCG CGTCTCGTCG CCGCGCGCGT CGCCGCGAGA ACGCCGACGT CATCGCGTCG CCTCGCCGCT CGAAAACTCA AAACGACGAC CCGCGCGCGC GACGCCGCGA TCGCCGAGCG AGCGAGCATG ACGACGGGCG AACGCTCGAA CGCGTCGAAG CTCGCGGCGG TGCGAGAGGC GATGGCGAAG CGAGGGGTGC GAGCGGTCGT CGTGCCGTCG CAGGATCCGC ACTTTAGGCG CGTCGGCGAA GCGAAGGCGA ACGAACGAAA CGAGGAACGA CGACGCGCGC GACGGGAAAG ACTGACGAAC GGGCGAGGGC GTGTTTTTGT GGGGAACGCA GTGAGTACGT GGCGGCGTGC TTCGAGCGAC GACGATGGTT GAGCGATTTT ACGGGGTCGG CGGGGACGGT GGTGGTGACG GACGCGGCGG CGTTGTTGTG GACGGATGGA CGGTATTTCG TGCAGGCTGA AGACGAGCTG AGCGAGGACT GGACTCTGAT GCGAAGTGGG GTGAAGGATG TGCCGGACGT GAAGAAGTGG TTGTGCGCGG AGGAGGCGGG ACTGGCGTTT ACCGGAGCCA AGGTGGGCAT CGATCCAAAC GTGCACTCGG TGAGCGAGGC GCGAGGTTTG AGAGAAGCGT TGAGCGCGTG CGGGATCGAG TTGATGAGCG TCGAAGAGAA CTTGGTAGAT TTGGTTTGGA GCGATCGTCC ACCGTTCCCG AAGACGCCGC TCAGAGTGCA CCCGATGGAG TACGCGGGGA AGAGCGTGGC GGAAAAATTG GAAAACCTTC GAGAAAAAAT GAAGGAAAAC GACGCGCAGA AGCTCGTCGT GAGCTCGTTG GATGACGTCA TGTGGCTATG CAATGTTCGA GGCGGTGATG CACCGTGTAA TCCGGTGACG TTGTCTTACG TCTTGGTGGG TGAAAACGAC GCTTCGTTTT ACGTCGACAC GGACAAGGCG ACGCCTGAAG TCGTGGCGCA TCTCGCCGAG GCAAACGTGA CGATCAAGCC GTACGAAGAC ATGGCCAAAG ACGTGTATGC CGCGGCACAG CGCGGTGAGC GACTCTGGAT GGACGTCGAT AAGGTCTCCA TCGCCATGCT CGAACAGGCT GAAGCCGGAG CCGCCGAAGC GCCCAAGGAT GCGAAAAAGG TGAAGACGGA GAGCGCGCCG TCCGCCATCA AGGAGGGCAC GTGTCCGGTC CCGATCGCAA AGGCGGTGAA GAATGAGGCC GAGATGGCCG GTATGGTCGA AGCCCACCTC ATGGATGGCG CTGCGATGGC TGAATTCTGG TGCGCGATCG AGCGAGACGT CGCCGAGGGG CGCGCCATTG ACGAGTACGA AGCTGGCGAG AGGGTCTTGG CGTGCCGAGC CAAGCAAAAC GGTTTCTTCG AAGAATCGTT CCCGACGATC GCGGGTGAAG GTCCTCATGG CGCCGTGGTG CACTACCGTG CTTCGAAAAA GAGCGCGAGG GCTATCGGTA AGGACAGCTT ATTACTCTGC GACAGCGGCG GCCAGTACGC GTGTGGCACG ACGGATGTCA CTCGAACGGT GCACTTCGGA ACGCCCACCG CTCATCAAAA GGAGTGCTAC ACGCGCGTGC TCCAAGGTCA CATCGCACTC GACCAAATGG TTTTCCCTGT CGGCACGAAA GGTTTCGTTC TCGACGCCTT TGCGCGATCG CACCTGTGGG CCAACGGCTT GGATTACCGT CACGGCACCG GCCACGGCGT CGGCGCGGCG CTCAACGTGC ACGAAGGTCC GCAAGGAATC TCTCCGCGTT TTGGAAACAT GACGCCCCTT ATGCCAGGAA TGATCTTGAG CAACGAGCCG GGGTATTACG AAGACGGTGC GTTCGGTATC CGCATCGAGA CGCTTCTGCA AGTGAAGGAG GCGAAGACTG CGCACAACTT CGGAGACACT GGATTTTTAT GCTTTGACGT CTTGACGTTG ATCCCGATTC AAACGAAACT CATGGACTTG AGCATTATGA GTGAAAAAGA AATCGCGTGG GTGAACGCGT ATCACGAAAA AGTTTGGCAA CAAATTTCCC CGCGAGTGTC GGGGGAGACT AAAACGTGGC TCGAACGCGC GTGTGCAAAG ATTTCCAAGT AG
|
Protein sequence | MAKRGVRAVV VPSQDPHFRR YVAACFERRR WLSDFTGSAG TVVVTDAAAL LWTDGRYFVQ AEDELSEDWT LMRSGVKDVP DVKKWLCAEE AGLAFTGAKV GIDPNVHSVS EARGLREALS ACGIELMSVE ENLVDLVWSD RPPFPKTPLR VHPMEYAGKS VAEKLENLRE KMKENDAQKL VVSSLDDVMW LCNVRGGDAP CNPVTLSYVL VGENDASFYV DTDKATPEVV AHLAEANVTI KPYEDMAKDV YAAAQRGERL WMDVDKVSIA MLEQAEAGAA EAPKDAKKVK TESAPSAIKE GTCPVPIAKA VKNEAEMAGM VEAHLMDGAA MAEFWCAIER DVAEGRAIDE YEAGERVLAC RAKQNGFFEE SFPTIAGEGP HGAVVHYRAS KKSARAIGKD SLLLCDSGGQ YACGTTDVTR TVHFGTPTAH QKECYTRVLQ GHIALDQMVF PVGTKGFVLD AFARSHLWAN GLDYRHGTGH GVGAALNVHE GPQGISPRFG NMTPLMPGMI LSNEPGYYED GAFGIRIETL LQVKEAKTAH NFGDTGFLCF DVLTLIPIQT KLMDLSIMSE KEIAWVNAYH EKVWQQISPR VSGETKTWLE RACAKISK
|
| |