Gene OSTLU_28133 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_28133 
Symbol 
ID5006067 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009370 
Strand
Start bp281912 
End bp284734 
Gene Length2823 bp 
Protein Length940 aa 
Translation table 
GC content59% 
IMG OID640421488 
Productpredicted protein 
Protein accessionXP_001422027 
Protein GI145355558 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG5113] Ubiquitin fusion degradation protein 2 
TIGRFAM ID[TIGR01435] glutamate--cysteine ligase/gamma-glutamylcysteine synthetase, Streptococcus agalactiae type 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.096306 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGGGGG AGACGCTGGA GCGGGTCTTC TTCGCGCGGT TGGCGCGCGA CGACGGCGGC 
GGCGGCGGCG CGAACGCGGG GTTCGACGAG CGCGCGGAAC CGTACGCGTG GACGGTGGAG
ACGTACCGGC GGGCGACGGA GGAACATCGA AGGTTGGGGA CGAAGAGCGA TGGGGCGTCG
ACGGCGGCGC GGGAGGAGCT GCAGAGTTGC ATGGAATTTT GCGCGTCGTA CGGAGGGTTG
TTGTTGAATC CGGCGCTCGC GGGGACGTTT CCGCAGAGCG AGTGGGCGGC CGGGCGAGGG
GCGTGTCAGT TGTTGGACGC GATGCGGACG GTGGGTGGGA TACCGCACGG ATATTTGGAG
CGATTGGCGA CGCGGTGCGA GGACGAAGGT TTGGACGAAA TCGCCGAGCG CGTGTTCGAC
GAGTTGCGCG TGTCGACGCG AGGGATGAGT CCGTTGGGGG AGTTTGACGA GCACTTAAAG
GTGATGTATC AGCTGTGCTC AGTGAAAGCG TTCGCGACGG CGCTCGTGAA GCACAAGCGG
TGGGTGCCGA TGAAGAGTCA TCTGAGCGCG ATTAACGGGA GGCAGTTTGA GACGGAGAGC
GTGCTCGGTT GGTTCTTCAG ACCGAGCGTG TTGCCGGACA TTCTCGGATG CGGCGAGCCC
GACTGCGTGG GCCCGTACTT TAGTAACGTC ACGAAGCGAT TGAAGCGAGA CGTGGAGGCG
TCGTACGGCA TGTTACGAGG CTGCGGCAAT CGCCTGGTCG AGGGACTGTA TCAGATTCTC
TTTGTCATGT TGAAACACGG TGGCGACGTT CGCCAGGGCG TGCTGAACTA CCTAGATGCG
TTCATGCGCG TCAACGCCGG GCGTGGCAAG ATGCGCATCC ATCCTCAAGT CGTCGCGTCG
CACGGTGGTG CGCACAATTT GAGCATGGTG GCGTTGCGTC TGGCGATGCC GTTTTTGGAT
CCGCAGAGCG GCAAGTACGA CAAAATCAGC CCGGCGTACG TACGAAGTCG CGCGTGCAGG
ATCAATTTGA CGGACGAGAC GCGCGTCGCG TGCACCGCGG ACGAAGCTGT AGCGGCTAAA
TTGTCGACGT CGGAAGACAA AGAAGATTGG GGATTCATTT GCGAGTGCTT TTACATCACC
GGACGAGCGT TGCATTTGGG CTACGTCAAG TGCATCGCCG AATACGCGGC GTGCACGCGC
GAGATCCAAG ACATGCGAGA GGCGGTGCGG GATTTACGAG GAATGTTAGA CCAGCAATTG
ATGAGCTCGC CCGAGCGAGA GCGGTACGAG CGCAAACACG AAGAGATGAC TGCGGAGATT
GAGCGCGCAC TCGAAAGAAA TTTGCAATTC GACTGCGCGC TTCGCGATCC GCGGCTGATC
AGCGAGGCGA TGCAGTACTA TCGTCTCGTC GCTGTTTGGC TCATGCGTAT CGTCGCCACG
AATGGGGACT ACGAGGCCGG GAACGGATTC ACCTTTGCTC AAATCACCAT GGACAAGTTC
CCTCAGACGT GTCCGGTGGC GTTCGGGTGC TTGCCCGAGT ACGTCATCGA GGACTTGGTG
GAGTTCATTC TGTACATCTC TCGCTACGCC CCCGACGCGC TCGATCACGA GCCGTTGGAT
GAGATTATGA ACTTCTTCAT CACATTCATG GGCAACACGG CATTCGTGAA GAATCCGTAC
TTGCGATGCA AATTCGTCGA AGTCTTGCGC CACTGGATCC CGTTTGAGGA TGGCTACCAA
TCGCAAAAGC TCATGACCTT ATTCGAGGTG AACCCGGTGA GTTTGAAGAA CTTGATTCCG
AGTCTGCTGT ATCTCTACGT CGACATCGAG TTCTCGGGAG GCGCGAACCA GTTCTACGAA
AAGTTCAACG TTCGATATCA AATCGGTGAG CTTTGCGAAT ACTTGTGGTC CGTGCAATCG
CACCGAAACG CATGGATCAA GCTCGCGAGC GAAGACCCTG AATTTTACAC TCGATTCCTG
AACATGCTCA TCAATGATGC AATTTACTTA CTAGACGAGG CGATGAAAAA GCTTCCGGAG
GTGCGCCAGA CGGAGACGGA CATGCAAGAC CAGGCGGCGT GGGAGGCGCG TCCGCAGCAA
GAGCGCGAGG AACGCGAGAG CGAGTTTCGC CAGACACGGC GTCATTTGCG ATCTAACCTC
ACGCTCGCCA TGGTGCACGT ACGCATGATG GCGTACACTT CGTGTGACAT CGCACATCCA
TTCTTACGCC CCGAGATGGT CGAACGTGTC GCTGCGATGC TGAATTACTT CTTGCTCTTC
CTCGCCGGTC CCGAGCGCCG AAAGCTGAAG ATTAAAAATC CCGAAAAGTA CGGCTGGGAA
CCTAAAGAGC TTCTCGGCAT GATCACCGAC ATTTACGTCC AGATCTACGC CGCGGACAAG
GACAAAGCGT TCATCGCCGC CATCGCCGCC GACGGCCGGT CCTATCGCGA CGAAGTCATG
CTCGAAGCCG CCGCCATCGC GCGCGGTTTG CAGCTCCGCT CCGAGCGGCG CGTCGCCGCG
TTCGAGAAAC TCGCCGCCGA CGCCCGCACG CGCGCCTCCG AGGACGAAGA AGAGGAGACC
GATCTCGGCG ACATTCCCGA CGAGTTCCTC GACCCGATCT ACTGCACCCT CATGCGCGAT
CCGGTCAAAC TTCCCAGCGG GCACTCGTGC GACAGGAGCA TCATCACTCG ACACTTGCTC
AGCGACGAAA CCGACCCTTT CTCGCGCCAA CCCCTCACCG CGGACCAGCT CGTCCCGGAC
GACGACTTAC GCGAGAAGAT CGCCGCCTTC ATCGCCGATC GCAAATCCGC GTCCGGGCGT
TAG
 
Protein sequence
MTGETLERVF FARLARDDGG GGGANAGFDE RAEPYAWTVE TYRRATEEHR RLGTKSDGAS 
TAAREELQSC MEFCASYGGL LLNPALAGTF PQSEWAAGRG ACQLLDAMRT VGGIPHGYLE
RLATRCEDEG LDEIAERVFD ELRVSTRGMS PLGEFDEHLK VMYQLCSVKA FATALVKHKR
WVPMKSHLSA INGRQFETES VLGWFFRPSV LPDILGCGEP DCVGPYFSNV TKRLKRDVEA
SYGMLRGCGN RLVEGLYQIL FVMLKHGGDV RQGVLNYLDA FMRVNAGRGK MRIHPQVVAS
HGGAHNLSMV ALRLAMPFLD PQSGKYDKIS PAYVRSRACR INLTDETRVA CTADEAVAAK
LSTSEDKEDW GFICECFYIT GRALHLGYVK CIAEYAACTR EIQDMREAVR DLRGMLDQQL
MSSPERERYE RKHEEMTAEI ERALERNLQF DCALRDPRLI SEAMQYYRLV AVWLMRIVAT
NGDYEAGNGF TFAQITMDKF PQTCPVAFGC LPEYVIEDLV EFILYISRYA PDALDHEPLD
EIMNFFITFM GNTAFVKNPY LRCKFVEVLR HWIPFEDGYQ SQKLMTLFEV NPVSLKNLIP
SLLYLYVDIE FSGGANQFYE KFNVRYQIGE LCEYLWSVQS HRNAWIKLAS EDPEFYTRFL
NMLINDAIYL LDEAMKKLPE VRQTETDMQD QAAWEARPQQ EREERESEFR QTRRHLRSNL
TLAMVHVRMM AYTSCDIAHP FLRPEMVERV AAMLNYFLLF LAGPERRKLK IKNPEKYGWE
PKELLGMITD IYVQIYAADK DKAFIAAIAA DGRSYRDEVM LEAAAIARGL QLRSERRVAA
FEKLAADART RASEDEEEET DLGDIPDEFL DPIYCTLMRD PVKLPSGHSC DRSIITRHLL
SDETDPFSRQ PLTADQLVPD DDLREKIAAF IADRKSASGR