Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_28133 |
Symbol | |
ID | 5006067 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009370 |
Strand | - |
Start bp | 281912 |
End bp | 284734 |
Gene Length | 2823 bp |
Protein Length | 940 aa |
Translation table | |
GC content | 59% |
IMG OID | 640421488 |
Product | predicted protein |
Protein accession | XP_001422027 |
Protein GI | 145355558 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG5113] Ubiquitin fusion degradation protein 2 |
TIGRFAM ID | [TIGR01435] glutamate--cysteine ligase/gamma-glutamylcysteine synthetase, Streptococcus agalactiae type |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 26 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 0.096306 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACGGGGG AGACGCTGGA GCGGGTCTTC TTCGCGCGGT TGGCGCGCGA CGACGGCGGC GGCGGCGGCG CGAACGCGGG GTTCGACGAG CGCGCGGAAC CGTACGCGTG GACGGTGGAG ACGTACCGGC GGGCGACGGA GGAACATCGA AGGTTGGGGA CGAAGAGCGA TGGGGCGTCG ACGGCGGCGC GGGAGGAGCT GCAGAGTTGC ATGGAATTTT GCGCGTCGTA CGGAGGGTTG TTGTTGAATC CGGCGCTCGC GGGGACGTTT CCGCAGAGCG AGTGGGCGGC CGGGCGAGGG GCGTGTCAGT TGTTGGACGC GATGCGGACG GTGGGTGGGA TACCGCACGG ATATTTGGAG CGATTGGCGA CGCGGTGCGA GGACGAAGGT TTGGACGAAA TCGCCGAGCG CGTGTTCGAC GAGTTGCGCG TGTCGACGCG AGGGATGAGT CCGTTGGGGG AGTTTGACGA GCACTTAAAG GTGATGTATC AGCTGTGCTC AGTGAAAGCG TTCGCGACGG CGCTCGTGAA GCACAAGCGG TGGGTGCCGA TGAAGAGTCA TCTGAGCGCG ATTAACGGGA GGCAGTTTGA GACGGAGAGC GTGCTCGGTT GGTTCTTCAG ACCGAGCGTG TTGCCGGACA TTCTCGGATG CGGCGAGCCC GACTGCGTGG GCCCGTACTT TAGTAACGTC ACGAAGCGAT TGAAGCGAGA CGTGGAGGCG TCGTACGGCA TGTTACGAGG CTGCGGCAAT CGCCTGGTCG AGGGACTGTA TCAGATTCTC TTTGTCATGT TGAAACACGG TGGCGACGTT CGCCAGGGCG TGCTGAACTA CCTAGATGCG TTCATGCGCG TCAACGCCGG GCGTGGCAAG ATGCGCATCC ATCCTCAAGT CGTCGCGTCG CACGGTGGTG CGCACAATTT GAGCATGGTG GCGTTGCGTC TGGCGATGCC GTTTTTGGAT CCGCAGAGCG GCAAGTACGA CAAAATCAGC CCGGCGTACG TACGAAGTCG CGCGTGCAGG ATCAATTTGA CGGACGAGAC GCGCGTCGCG TGCACCGCGG ACGAAGCTGT AGCGGCTAAA TTGTCGACGT CGGAAGACAA AGAAGATTGG GGATTCATTT GCGAGTGCTT TTACATCACC GGACGAGCGT TGCATTTGGG CTACGTCAAG TGCATCGCCG AATACGCGGC GTGCACGCGC GAGATCCAAG ACATGCGAGA GGCGGTGCGG GATTTACGAG GAATGTTAGA CCAGCAATTG ATGAGCTCGC CCGAGCGAGA GCGGTACGAG CGCAAACACG AAGAGATGAC TGCGGAGATT GAGCGCGCAC TCGAAAGAAA TTTGCAATTC GACTGCGCGC TTCGCGATCC GCGGCTGATC AGCGAGGCGA TGCAGTACTA TCGTCTCGTC GCTGTTTGGC TCATGCGTAT CGTCGCCACG AATGGGGACT ACGAGGCCGG GAACGGATTC ACCTTTGCTC AAATCACCAT GGACAAGTTC CCTCAGACGT GTCCGGTGGC GTTCGGGTGC TTGCCCGAGT ACGTCATCGA GGACTTGGTG GAGTTCATTC TGTACATCTC TCGCTACGCC CCCGACGCGC TCGATCACGA GCCGTTGGAT GAGATTATGA ACTTCTTCAT CACATTCATG GGCAACACGG CATTCGTGAA GAATCCGTAC TTGCGATGCA AATTCGTCGA AGTCTTGCGC CACTGGATCC CGTTTGAGGA TGGCTACCAA TCGCAAAAGC TCATGACCTT ATTCGAGGTG AACCCGGTGA GTTTGAAGAA CTTGATTCCG AGTCTGCTGT ATCTCTACGT CGACATCGAG TTCTCGGGAG GCGCGAACCA GTTCTACGAA AAGTTCAACG TTCGATATCA AATCGGTGAG CTTTGCGAAT ACTTGTGGTC CGTGCAATCG CACCGAAACG CATGGATCAA GCTCGCGAGC GAAGACCCTG AATTTTACAC TCGATTCCTG AACATGCTCA TCAATGATGC AATTTACTTA CTAGACGAGG CGATGAAAAA GCTTCCGGAG GTGCGCCAGA CGGAGACGGA CATGCAAGAC CAGGCGGCGT GGGAGGCGCG TCCGCAGCAA GAGCGCGAGG AACGCGAGAG CGAGTTTCGC CAGACACGGC GTCATTTGCG ATCTAACCTC ACGCTCGCCA TGGTGCACGT ACGCATGATG GCGTACACTT CGTGTGACAT CGCACATCCA TTCTTACGCC CCGAGATGGT CGAACGTGTC GCTGCGATGC TGAATTACTT CTTGCTCTTC CTCGCCGGTC CCGAGCGCCG AAAGCTGAAG ATTAAAAATC CCGAAAAGTA CGGCTGGGAA CCTAAAGAGC TTCTCGGCAT GATCACCGAC ATTTACGTCC AGATCTACGC CGCGGACAAG GACAAAGCGT TCATCGCCGC CATCGCCGCC GACGGCCGGT CCTATCGCGA CGAAGTCATG CTCGAAGCCG CCGCCATCGC GCGCGGTTTG CAGCTCCGCT CCGAGCGGCG CGTCGCCGCG TTCGAGAAAC TCGCCGCCGA CGCCCGCACG CGCGCCTCCG AGGACGAAGA AGAGGAGACC GATCTCGGCG ACATTCCCGA CGAGTTCCTC GACCCGATCT ACTGCACCCT CATGCGCGAT CCGGTCAAAC TTCCCAGCGG GCACTCGTGC GACAGGAGCA TCATCACTCG ACACTTGCTC AGCGACGAAA CCGACCCTTT CTCGCGCCAA CCCCTCACCG CGGACCAGCT CGTCCCGGAC GACGACTTAC GCGAGAAGAT CGCCGCCTTC ATCGCCGATC GCAAATCCGC GTCCGGGCGT TAG
|
Protein sequence | MTGETLERVF FARLARDDGG GGGANAGFDE RAEPYAWTVE TYRRATEEHR RLGTKSDGAS TAAREELQSC MEFCASYGGL LLNPALAGTF PQSEWAAGRG ACQLLDAMRT VGGIPHGYLE RLATRCEDEG LDEIAERVFD ELRVSTRGMS PLGEFDEHLK VMYQLCSVKA FATALVKHKR WVPMKSHLSA INGRQFETES VLGWFFRPSV LPDILGCGEP DCVGPYFSNV TKRLKRDVEA SYGMLRGCGN RLVEGLYQIL FVMLKHGGDV RQGVLNYLDA FMRVNAGRGK MRIHPQVVAS HGGAHNLSMV ALRLAMPFLD PQSGKYDKIS PAYVRSRACR INLTDETRVA CTADEAVAAK LSTSEDKEDW GFICECFYIT GRALHLGYVK CIAEYAACTR EIQDMREAVR DLRGMLDQQL MSSPERERYE RKHEEMTAEI ERALERNLQF DCALRDPRLI SEAMQYYRLV AVWLMRIVAT NGDYEAGNGF TFAQITMDKF PQTCPVAFGC LPEYVIEDLV EFILYISRYA PDALDHEPLD EIMNFFITFM GNTAFVKNPY LRCKFVEVLR HWIPFEDGYQ SQKLMTLFEV NPVSLKNLIP SLLYLYVDIE FSGGANQFYE KFNVRYQIGE LCEYLWSVQS HRNAWIKLAS EDPEFYTRFL NMLINDAIYL LDEAMKKLPE VRQTETDMQD QAAWEARPQQ EREERESEFR QTRRHLRSNL TLAMVHVRMM AYTSCDIAHP FLRPEMVERV AAMLNYFLLF LAGPERRKLK IKNPEKYGWE PKELLGMITD IYVQIYAADK DKAFIAAIAA DGRSYRDEVM LEAAAIARGL QLRSERRVAA FEKLAADART RASEDEEEET DLGDIPDEFL DPIYCTLMRD PVKLPSGHSC DRSIITRHLL SDETDPFSRQ PLTADQLVPD DDLREKIAAF IADRKSASGR
|
| |