Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_19174 |
Symbol | PPR3_21 |
ID | 5006940 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009375 |
Strand | + |
Start bp | 39426 |
End bp | 41315 |
Gene Length | 1890 bp |
Protein Length | 629 aa |
Translation table | |
GC content | 62% |
IMG OID | 640422361 |
Product | pentatrichopeptide repeat (PPR) protein |
Protein accession | XP_001422791 |
Protein GI | 145357164 |
COG category | |
COG ID | |
TIGRFAM ID | [TIGR00756] pentatricopeptide repeat domain (PPR motif) |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 53 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 0.0365143 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACGATGA ACGACGGCAC CGAACGCGAG ATCACGGTGG ACGCGCGGTT TAAGTGCGCG AAGTGCGCGC GCGGGAGCTC GTGCTTCGCC GTGCACCGAG CGACGAGCGC GGTGAGCGGG ATGCGAGCGA AACGGCGGTA TAACGAACGG AAAAAGGCGA AAAAGACGAA GAGTAAATAC GCGTCGGACG TCGAGGGCGG CGGGCGAGGG TCGAGGGCGC GCGCGGACGC GGATCCGGTC GGGGCGACGC TGGCGCTGAT ACGAGACGGA CACAGGCCGA CGGCGAAGAC GTTTACGGCG GTGATTAGCA CGCTAGGCGC GATGGGACGC GCGAAAGAGG CTTTGGACGA TGTGTTGCCG ATGATCGAGG AGTATGGCGA TGACGTCGAT GTGGCGGTGT GGAACGCCGT GGCGCACGCG TTTTGCGCCG GCGGAGACCC GGCGGGGGCG GAGAAAATCG TGGATAGAAT GGTCGGCGAA GACGGCGTGG GCGTGAACGG GGCGACGCAT CCGGAGATCA TATACACGTA CGCGAAGCGT GGAGAGGCAA ATAGAGTGTA TAGGTTGATC AGACGAATGT CGAGCGAGCA TGGAATAATC CCAGACGAAC GCGCGTATAA CGCGTTTTTG CGAGGCTTGT GCGAGCGGGA CGATCTCGAG GACGCCGAGG AAGTGCTCAG ACGATGGAAC AACGAAAAGT TCGACTTGGA GCGACAGAGT GACCGCGGTG GGCGAGTGTC GAAGCCGAGC GCGGCGTCAT ACGGGTTACT CATCGACGCG TGGACGCGTC GCGGAAACAT GCTCGCCGCG CGCAAGCTTT TACAACAGAT GCAATGGGAG CGCATCGCGC CCTCACTGCC GCTGTTTAAC ATGCTCATAG ACGGATACCT CAAGCAAGAA AACATGCGCG CTGCGGAAGG GCTGTTTCGC GAGCTCGAAT CGAGTGGAAC GTGGGATATG GAGTCGTTGG GGATCAAACC AGACAACGTG ACGTACACGT TGTTTTTAGA TTATTGGGCG AATCAAGGCC AAGTCGACGC GTGCGAGCGA ATCTTCAATC GCATGCATCG CAAGGAAGTC GCGCCAGACG TCACAGCTTA CGGGACGTTG GTAAAGGCGT ACGCGCGCGC GCGCGATTCC GACGGCGCTG AGGCGGTTTT GGATCGACTC GCCGAGGCAA AGGTGGCTCC GTCGGTGGCT ATTTACAGCG CCGTCGTCGC CGCACATTGT ACGATTGGTA ACATGTCGCG CGCGCGCGAC GTACTCGAGC GCATGTTCGA CGCGGGCTTG CGCCCGAACG AGCGTACGTT CGCTCATTTC GCGTGGGGAT ACGGCCAACT GGAAGACATC AACGGTATTG CCGAGGTGGC GAAGTTAATG CTCGCGAGCG GGCTCAAACT CAAGGGTGCG AACCGCACCG CCATCGTGCG CGCGTGCGAA GAGTGCGGAA TGAGCATGAG CGCCGTACAA GCGCTGCTGG ATCGAATCAA TCCCGAAATG ACGCAGCGCA AGGGCGTGTG GAAACGAGAC GGCGGCGAGC CGAAACCCAA ATCCAATCGA GCCGCCGCCG CCGCCGCCGA CGAGGACCTC GAACCGTCAA CGCTCGAACC GACGAAGACG AAAGAAATCT ACGGAGGTCC GGAGTCCACC GCGCGTCGGG TGTCGCTTTC CCAGCGTGTA GAGTCCCTCG ACGAAGACGA CGACGACGGC GATACGGGCG CCGTGGACGC GCCGCCGGCG AGCTCGGATT GGCCTCGCAA AGTCTCCACT CGCGCCGTCG CCATCGCGCA CCGCGCGTCC TCGCGCGCGC GCCCTATCCG TCGCACATTT ACACGAACGA ACGCGCGCGC GTTCGCGATC ACGCGAGCCC TCGGCGCTGC TTCGATGTAA
|
Protein sequence | MTMNDGTERE ITVDARFKCA KCARGSSCFA VHRATSAVSG MRAKRRYNER KKAKKTKSKY ASDVEGGGRG SRARADADPV GATLALIRDG HRPTAKTFTA VISTLGAMGR AKEALDDVLP MIEEYGDDVD VAVWNAVAHA FCAGGDPAGA EKIVDRMVGE DGVGVNGATH PEIIYTYAKR GEANRVYRLI RRMSSEHGII PDERAYNAFL RGLCERDDLE DAEEVLRRWN NEKFDLERQS DRGGRVSKPS AASYGLLIDA WTRRGNMLAA RKLLQQMQWE RIAPSLPLFN MLIDGYLKQE NMRAAEGLFR ELESSGTWDM ESLGIKPDNV TYTLFLDYWA NQGQVDACER IFNRMHRKEV APDVTAYGTL VKAYARARDS DGAEAVLDRL AEAKVAPSVA IYSAVVAAHC TIGNMSRARD VLERMFDAGL RPNERTFAHF AWGYGQLEDI NGIAEVAKLM LASGLKLKGA NRTAIVRACE ECGMSMSAVQ ALLDRINPEM TQRKGVWKRD GGEPKPKSNR AAAAAADEDL EPSTLEPTKT KEIYGGPEST ARRVSLSQRV ESLDEDDDDG DTGAVDAPPA SSDWPRKVST RAVAIAHRAS SRARPIRRTF TRTNARAFAI TRALGAASM
|
| |