Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_31845 |
Symbol | SDG3511 |
ID | 5001720 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009359 |
Strand | - |
Start bp | 632728 |
End bp | 634692 |
Gene Length | 1965 bp |
Protein Length | 654 aa |
Translation table | |
GC content | 62% |
IMG OID | 640417141 |
Product | predicted protein |
Protein accession | XP_001418071 |
Protein GI | 145347218 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 36 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 7 |
Fosmid unclonability p-value | 0.120441 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACGGCGC GCGACGACGC GCGGGACGAC GACGACGCGC GGCGCCTCGA GGCGGGCGCG CGCGCGCTCG AGGCGGCGCT GACGGTGGAC GACGACCCGC TGCGGTCGGG GGCGTCGGAC GTCGAGCGAC GACGACCGGC GTCGGCGCGG CGCGGACGAG GCGCGAGAGC GCTCGACGCG AGCGAGTGGG TGGCAAAGTT CGGCGTCGCG CGCAGGGGAG GGTGCGATGG ACGCGTCGGG ACGCGGGCGA CGGGGGCGAC CGAGGCGACG AGGGCGGCGA CGGCTGGAGG CGGCGGCGGC GAGGGAGGGA CGACGACGAC GAGCGGGAGG CTGGGATTGG TGGCGAAAGA ACGCGTGCGG GCGTGTGAGG TGCTGATCAG AGAGCGAGTG ACGTCGATTC AGGGGACGTT TCGGAGTCAA GGGGCGTGCC TGGCGAGACT CATGGAACGC TTGCGAGAAC GCGGAGAGCA AGACGCGAGT CGAGCGTGCG CGCCGTCGAG GGAGATGGCG AAGAAACTGC TCTCGAACGC TGAAGAAAGA GAGGAGGTTT TAGCGCTCGG GGCGAAATTA GGCGTGGAGG ACGAAGAGGA GTGGTTAAAG TTTGCGGCGT TTGTCAAATA TAACGCCGTG ACGAGAACGT GTCGACCGGC GACGACGAGC GCGGGCTCGA TGATAGGCGC CGATGCCTAC GTGCAGTGCT CAATGGTGAT GTTTGACGCA ATTAGCGCGT GCAACCACTC GTGTGACCCG AACGCGGAGG TGAGTCACGT GTCCGACGAG GGTGAGGTGT CGTTGTATTC GCTTCGCCCG ATTGAGCGCG GAGAGGGGAT AACAATTGCG TACGGAAAGC CTTCCCTCCG TTGGCTTCCA GCGCGCTGTC GAAAGAAAGC TTTACGCAGG GATTGGTATT TTGATTGCGC GTGCGCGCAG TGCAAGGCGG AAGTCGCCTC GGGATTAGCT GTAGATAAGC CTTTGGCGCG ACCGTGGGAC ATACAAGATC CGCGATGGTT CTTTTGCACA CACGACTACG TCACGGGATT CGAGTCGCAC TTTGACGGCG AAGGCAACTT ACTCTCGCTG AAATCGGCGA CTGCGTCTCG ATCGAATGTT TCACCGAGCG CGAGCGGGGC AAACACGTCT GAATCTGCCG CAGACGGCGT CGCCGCGGAA TTCGGCGCCA AGTTGCACGT TGGCGGCTTG GGACTCGCAG CCGATGCGAG CGGTAGTAGT AGTGGCTTAT GCGATTCATC TGATTCGTCT TCGTGCCAGT CGGTGGATTT AGACGAATAC GACCGCGAAT CGAGCGGAAG CGAGGACGAA GACGTCCTGC GATGGCACGA GCGCTGGCGC TCAAAACGCA TCAAGGCATA CAGCGTCAAG AACGTTGGAT TGTATACCCC TTTGCAGCTT TATCACGCCA TGCAGCGCTG TCAAATCAGA CAGGATCATT GGCAACTTCT TGTCGTGCGT GAGGCTTTGA TCAGTCAAAT CATGAACGAC GCGGCGATGA AAGGAAACGC GTCGTCTTCG CGCCCCAACG CTCCGAGTCT TGACGGCGGA TGGGGCGAGC GCGGGAAGTT CCAAGCCTTC AAGCTCATCT TAAATCAGTG TAGAAGTCTG GCGCGTATGG CGCCGAACAC AGGAAGTTTT GCAGAATTAT TTGCGACGCT CGAAAACATC GTGTACTGGT GGAGCACCGA CGGCTGGCAC TACGTCTCGA CGAAGCGCGA GCGCCGCGAG CAGGAGCGCG CTTTCTCGCG CGCTCGCGCG AGACGCGCGA GAAACGGCGC GGGAGAAACG GCGCGGGAAT CCGATCGCTC CGATTCCGTC GATGACGATG AATTTTTCGA AATTCCCGTC CGTTCGCGAT GGGCGTTTCG CTTGGAACGC CTGCGAGACG CCGCGCACGC CGACGTCTTG GCGTGGAACA TGCAGTTTGG GAAGCTGCCG AGCGCGCCGT TTTAG
|
Protein sequence | MTARDDARDD DDARRLEAGA RALEAALTVD DDPLRSGASD VERRRPASAR RGRGARALDA SEWVAKFGVA RRGGCDGRVG TRATGATEAT RAATAGGGGG EGGTTTTSGR LGLVAKERVR ACEVLIRERV TSIQGTFRSQ GACLARLMER LRERGEQDAS RACAPSREMA KKLLSNAEER EEVLALGAKL GVEDEEEWLK FAAFVKYNAV TRTCRPATTS AGSMIGADAY VQCSMVMFDA ISACNHSCDP NAEVSHVSDE GEVSLYSLRP IERGEGITIA YGKPSLRWLP ARCRKKALRR DWYFDCACAQ CKAEVASGLA VDKPLARPWD IQDPRWFFCT HDYVTGFESH FDGEGNLLSL KSATASRSNV SPSASGANTS ESAADGVAAE FGAKLHVGGL GLAADASGSS SGLCDSSDSS SCQSVDLDEY DRESSGSEDE DVLRWHERWR SKRIKAYSVK NVGLYTPLQL YHAMQRCQIR QDHWQLLVVR EALISQIMND AAMKGNASSS RPNAPSLDGG WGERGKFQAF KLILNQCRSL ARMAPNTGSF AELFATLENI VYWWSTDGWH YVSTKRERRE QERAFSRARA RRARNGAGET ARESDRSDSV DDDEFFEIPV RSRWAFRLER LRDAAHADVL AWNMQFGKLP SAPF
|
| |