Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_28748 |
Symbol | |
ID | 4999405 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009355 |
Strand | - |
Start bp | 242111 |
End bp | 244918 |
Gene Length | 2808 bp |
Protein Length | 935 aa |
Translation table | |
GC content | 51% |
IMG OID | 640414826 |
Product | predicted protein |
Protein accession | XP_001415777 |
Protein GI | 145341353 |
COG category | [S] Function unknown |
COG ID | [COG4886] Leucine-rich repeat (LRR) protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGGGGACG TCGCGCGAGG CGCGTTGACC GAGTTGGATT TGGAAAAGTA CGCGCGCTTG ACGACACTGC GCGTGGTGAA GTGCGGTGAT GTGGTGTCGT TGATCGGATT GGGGGCGTGC GAGAGTCTGC GGAACGTGAC GGTGGCGGAG TGCGGCCTGA GGTCTTTGGA AGGCGCGAGC GAGTGTCGCG GGTTGGAGGC GTTGTACGCG TACGGGAACG CGATTGATTC GCTCGATGGG CTCGCGGATG GACGGCTTGA GCGGCTGCAC ACGCTTTGGT TGAACGATAA TCGTTTAATG TCGCTCGATG GGATCGAGAC GTTGATGGCG CTGCGGGATC TGAACGTCGC GCGCAACCGA CTCGAAAGCG TTCCGGACGC CGATGCGTTG CGCGAGCTGA ATCACCTCAA CATCGCTGGT AATCCCATCA AGTCGCTCGC GTACGTTGGG GTGACTCGTT TCATACATTC GATCACGTTT AAGGACGACG TACACGGCGC GTGTGACTTT TGTCGAGGGC GATGGTATCG CTCGTACATC TTGCGAGCCG TGTCCAATTT GCGCGTGCTG GACGGAGTCG ATATATCGGA AGACGAGCGA GCGAGAGCGA GCGATGAACA CGCAGAAAGA GAGTTATGGT ATCATATGCG ATGCGCGAGG ATTCGCAAAT CGTTTGATGA CGCGCGTGAT GTCGCTCTGG AACATTTGCG TATACTCACG GCTTCCGTCG AACGCGATTT GTTGCGCGCC AGAGCCGAAG GAAGATGGCT TGATGTTGAG CGCGATCTTT CCGACATAAG TCACTTCGAA ATCTCTTTCA CGAAAATCGA GCGTCGAGCA CGCGAGGCGT CGCTCGTGCT GATCGCGCGC GTTTTTCAAG ATGCGGAAAT TGTCGCAGAG ACGGCGCGCG AAGAACGATT CGCATTCGCG AAAGCCGAAT CAGCTGCACA GAGCGTAGAC GACGTCTTCA TCGAGATCTT CGGTAAATTC GGACGTGTTC GTAAAAAACG TGCGCCTCAA AAACGTAAAA GCGCGTGGAT TTTGACGCGA AGGATAAGCG ACATTGTGAT CGCATCGAAC ATGACTGAGA TCAATCTGCA CTCCGCTGAG CTCCGCGGCA TTCCACAGGA AATCGGCGAG TGTGTAAACC TACAAACATT GATTCTGAGC GATAATGCAA TCAAACGGCT TCAAGGTCTC CCTCAAATGA ACAGTCTAAA ATGGTTGGAT CTCGGGAACA ATCACTTGTG GAATTCGGAG GATTTGAACG TCTTGACGAC GCGTGCGCGA AACGTCACCT CGTTGATTCT TCGAGGTAAT CGCAAGTGGT TGGCAAAGAG TAAATTCTAC GCCCCAATTC TGGTCAAACG CTCGCCAACC TTGACACACT TGGACGGCAT CGAAGTCACA TCTCGTATGC GAGCGCATTA TCGCGTCGTT GGATGCAGAC TGAACGCACG CGCAGTGAGA CGTCGTGGTA AACTGTACCG TGACGCCGCG CAGTCCGCGT CGGAAATCGT CGAATTCAGC TTGGAAGATG AATCTCTCAG GAAAGTAGAA CTGTGCGGAG AATTCAACAC GCTCATCGTC GCATACCTCA TGAATAACTG CCTTAAATCG ATCAAAAGTT TCGGCGTCAC CTGCCGCCAC CTGAGGCACT TATCCGTGGA GGGAAATGCG ATGATACGTT TGGACGGTTT GAGTTTGCTG AAGGAATTGA AATTCCTCAA CGTGCGCAAC TGCATGATCA AAATTCTTTC CCCATCCTGG TTCAAGCCAT TGATAAAGTT ACTCTTCATC AACGTCGAGC GAAACGAACT CCAATCGCTC AGTGGACTTG AACAGTGTCG CTCAATTCGC GAAATATACG CGTCGCACAA CATGTTGGCG GAAACGAGTA GCGTCGTCGC GCTCGCCGCA CTTCCCAACC TCAGGTTACT CACGTTGTAC GGGAACGTCA TGTGTGATTT GAAAACGTAC CCGCACTACG TGATATTCAA ATTCAATCAA TTGTCCATAT TGGATACCGA TTATATCAGC GAGGCAGCGC GAGCGGATGC GACAAAAATT TACTCTGGTC GTCTCACGGA GGACATGATT GTTTCCGGTG AACACGCCGC GGGAGGAGAT GAAATCTCAT TGGTCGCACT CGGATTGTTT CATCTTGATC ATAATGTCGT GAACGCGCGC TTCCAAGGAA TCAAACGCAT GAATTTGGAA AACAATAATC TCAGCGATGT TTCCGCGCTG GCTAGTTTGC CACGACTCAC GCGGCTCGAA TTGCGCAACA ATAGGGTTAA TGCCAAGTTC GGTCGCGCCA GTTCTTTCAG CAAACTCAAA TATCTTGATT TGAGCGGGAA CTACATATCA TCGCTCAGCG TGCTCGCGCT CGGTTCGTGC TCAAGTCTTC AAACGCTTCT TTTAAGCGAC AACTTTCTCA CCAGGCTCGA CGGCATAAAT CAGCTCAAGT CTTTGCGAGT ACTCAAGGTT GACAAAAACA AATTAAGTAG GATCGACTCA AGCACATTCG ATGGCTGTGA AAGTTTGCGG GTTTTATGCA TGCGTAAGAA CGCATTTCGA ACTCTGAAAC ACGTAACAAA GCTTGCCTAC GTGCGCGAGT TACATCTTGA CGAAAATCGA GTGGACGACT TGGAGGAGAT TAGTTGGTTG GCGTATCTGA CGCGCCTACG GATTTTGTCC CTCGCGCGTA ACAAAGTTTC AAGCGAATAT CAGAGATATG TCGAGTTCGT GACGACGTGC TGCAGAAGTC TGGAGCAACT CGATGGGGAA AAGCTAGTAT ATAAATAA
|
Protein sequence | MGDVARGALT ELDLEKYARL TTLRVVKCGD VVSLIGLGAC ESLRNVTVAE CGLRSLEGAS ECRGLEALYA YGNAIDSLDG LADGRLERLH TLWLNDNRLM SLDGIETLMA LRDLNVARNR LESVPDADAL RELNHLNIAG NPIKSLAYVG VTRFIHSITF KDDVHGACDF CRGRWYRSYI LRAVSNLRVL DGVDISEDER ARASDEHAER ELWYHMRCAR IRKSFDDARD VALEHLRILT ASVERDLLRA RAEGRWLDVE RDLSDISHFE ISFTKIERRA REASLVLIAR VFQDAEIVAE TAREERFAFA KAESAAQSVD DVFIEIFGKF GRVRKKRAPQ KRKSAWILTR RISDIVIASN MTEINLHSAE LRGIPQEIGE CVNLQTLILS DNAIKRLQGL PQMNSLKWLD LGNNHLWNSE DLNVLTTRAR NVTSLILRGN RKWLAKSKFY APILVKRSPT LTHLDGIEVT SRMRAHYRVV GCRLNARAVR RRGKLYRDAA QSASEIVEFS LEDESLRKVE LCGEFNTLIV AYLMNNCLKS IKSFGVTCRH LRHLSVEGNA MIRLDGLSLL KELKFLNVRN CMIKILSPSW FKPLIKLLFI NVERNELQSL SGLEQCRSIR EIYASHNMLA ETSSVVALAA LPNLRLLTLY GNVMCDLKTY PHYVIFKFNQ LSILDTDYIS EAARADATKI YSGRLTEDMI VSGEHAAGGD EISLVALGLF HLDHNVVNAR FQGIKRMNLE NNNLSDVSAL ASLPRLTRLE LRNNRVNAKF GRASSFSKLK YLDLSGNYIS SLSVLALGSC SSLQTLLLSD NFLTRLDGIN QLKSLRVLKV DKNKLSRIDS STFDGCESLR VLCMRKNAFR TLKHVTKLAY VRELHLDENR VDDLEEISWL AYLTRLRILS LARNKVSSEY QRYVEFVTTC CRSLEQLDGE KLVYK
|
| |