Gene OSTLU_33783 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_33783 
Symbol 
ID5000675 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009357 
Strand
Start bp451129 
End bp453489 
Gene Length2361 bp 
Protein Length786 aa 
Translation table 
GC content52% 
IMG OID640416096 
Productpredicted protein 
Protein accessionXP_001416954 
Protein GI145344884 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG5647] Cullin, a subunit of E3 ubiquitin ligase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.365007 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGCGCG CGGCGAACGC GCGCGACGCG GTCGACGGCG ACGACGCGCG CGCGCGCGCG 
GCGAGCGACG CGCGCGCGCG CGAGCGCGAG CGCGACGGCG TCGTGGGACG GAAACGCGAC
GCGTCGGGAC AGGCGTCGCG CGCGCGCTCG CGCGCGGCGA CGACGGCGAC GACGACGGCG
TTGGAACCGT TCAGACATCG CGTGGACGCC GATCCGTCGT TCGTGGAGAC GACGCTTCGA
ACGCTTCGAA CGGCGACGAC GGAGCTGTTG AACCTGAGCA GCGAAGGGTT GTCGTACGAG
GAGCTGTACG GTAAGGCGTA CGCGTTGGTG CTTCGGAAAC AGGGTGATGC GCTGTATAAC
ACCATAAGCG ACGCGGTAAC AGATCATTTG TGCTTGCACG TGGCGAGTAA GATCGCGGAC
GTGGTGGGGG ATGTTGAATT TTTGAAGGAT TTGGAGACGC GATTTGCGCG GCACAGAAAG
AGCGCGCAAA TGCTGACCGA CGTCTTCATC TATCTCGATC GCGTGCACTT GAAGCGGAGC
GGGAACGCGA ACTTAGAACC CGTGGGGGAT CTCGTAATAA CGCTTTGGAG GGAGTGTGTG
GTAAATAATC CACGCATTAG ACGACGGATG CACTCGTGCA TGTTAGATTT GATCCGCCGC
GAGCGCGACG GCGAGAGCGT GGATAGGGAC GCGTTGCAAA AGGTGACGTC CATGTTACTC
ACTTTGCACG AATCTGTTTA CGTAGATGAG TTCGAAGTTA AAATGCTCGA TGAGACAAGA
TCTTACTACA AGGCGGTGGC GCAGAAACGA ATCGATATCG ACGACTGCCC GACGTTCTTG
AGGATGGCGG AGGCGAGACT GGCGCAAGAG AAGGACCGAA GTGAAGCTTA CATGGCTCCT
CGAACAACAG GCCTTTTACT TGAGCAAGCG CGCAACCAGT TATTGAAGGA GATGTCACAA
TCACTATTGC ATAACGCTAC GAGTGGCATG GTGCACATGC TTCGAGCGAA CCAGATCGAA
AACTTGCGTC GCATGTACTC GCTGTTTTCG ACGATGGACG ATCTCGAGGG TATTCCAGAC
GTGATGTTCA ATCATCTTAA GGAAATCGGC AAGTCGATCG TGAATGATTT AGAAAATGAA
AAGAATCCGA CACAGTTTGT CGAAGAGCTT TTCAAATTCA AGGAGAAGTA TGACACAATC
TTGATCGAGG CGTTTGCAAA TAATCGCCTC ATCGAGTCGC AGTGCAATCA GGCGTATCAA
CTCGTCGCAA ACTTGAATCC TAGATCACCC GAGTATTTGT CGCTTTATTT GGATCACATG
TTGAGGAAAT CGTCAAAAGA CGCGAGTCAG AGCGAACTGG AGATCATCCT GAACCGATCG
ATGGGGCTCT TCCACCTATT TCACGAGAAG GATGTGTTTG AGAATTATTA TCGCCAACAC
TTGTCGAAGA GGTTGCTAAA CAAGCGCTCC GCAAGCGACG ATAACGAGCT CGCGTTCATC
GGTAAACTCA AGGACGATTG TGGATTTACA TTCACGAGTA GAATGGAGGG CATGTTCAAC
GACATGCTCA CTTCGGGTGA CTTGACGAGG GAGTTTGAAG GTGTTTACTC AAGAGGCTCG
GGATCGATGG AAGTAAATGT CTCGGTGTTG ACCACCGGAG CCTGGCCCTT GAAGGTGCAT
AAAACTCCGA TCAACTTACC CCATGAATGC GAGAGGACGT GCAAGGTTTT TGAAAATTTC
TACCTCTCGC GTCACGCCGG TCGAAAGCTC ACTTGGCAGG CGAACATGGG CCGGGCCGAC
ATTAAGGCTA GGTTTGCGAG CGGTGAATAT GAAATTTCTG CGTCGACGCT GCACATGTGT
GTCCTCATGC TTTTCAACAC GCACGAGACT TTGACCACAA AAGATATTTC GGATCTTACT
GGAATGATAG GCGACGAGCT CAAGGGTTGC TTGCAAGCGC TTTCGTGCGT CAAGGGGAAA
AATATTCTCA CAAAGTTGCC CGCTGGGAAA GATGTGAGCT TGGGAGACTC GTTTCAAGTC
AATCGAGACT TCTCATCTAA GACGACCAAG GTCAAAATCT TGTCCATTTC TGCCAAGCGA
GAGAACGATC ACGAAAGATC GTTGACGAAA AGCAAAATCG TAGACGATCG TAAGCCGCAG
ATCGAGGCCA CCATTGTGCG CGTGATGAAG GCGAAGAAGC GGCTCGATCA CAACAGCATC
GTCATGGAAG TCACGGCTCA AGTCAGGAAC CGTTTCATGC CCACACCCGC GGATATCAAA
AAACACATCG AGACCTTGAT TGAGCGTGAA TACATCGAGA GGGATCCGAG CGATCGAAAA
ATGTACGTTT ATCTCGCGTA G
 
Protein sequence
MTRAANARDA VDGDDARARA ASDARARERE RDGVVGRKRD ASGQASRARS RAATTATTTA 
LEPFRHRVDA DPSFVETTLR TLRTATTELL NLSSEGLSYE ELYGKAYALV LRKQGDALYN
TISDAVTDHL CLHVASKIAD VVGDVEFLKD LETRFARHRK SAQMLTDVFI YLDRVHLKRS
GNANLEPVGD LVITLWRECV VNNPRIRRRM HSCMLDLIRR ERDGESVDRD ALQKVTSMLL
TLHESVYVDE FEVKMLDETR SYYKAVAQKR IDIDDCPTFL RMAEARLAQE KDRSEAYMAP
RTTGLLLEQA RNQLLKEMSQ SLLHNATSGM VHMLRANQIE NLRRMYSLFS TMDDLEGIPD
VMFNHLKEIG KSIVNDLENE KNPTQFVEEL FKFKEKYDTI LIEAFANNRL IESQCNQAYQ
LVANLNPRSP EYLSLYLDHM LRKSSKDASQ SELEIILNRS MGLFHLFHEK DVFENYYRQH
LSKRLLNKRS ASDDNELAFI GKLKDDCGFT FTSRMEGMFN DMLTSGDLTR EFEGVYSRGS
GSMEVNVSVL TTGAWPLKVH KTPINLPHEC ERTCKVFENF YLSRHAGRKL TWQANMGRAD
IKARFASGEY EISASTLHMC VLMLFNTHET LTTKDISDLT GMIGDELKGC LQALSCVKGK
NILTKLPAGK DVSLGDSFQV NRDFSSKTTK VKILSISAKR ENDHERSLTK SKIVDDRKPQ
IEATIVRVMK AKKRLDHNSI VMEVTAQVRN RFMPTPADIK KHIETLIERE YIERDPSDRK
MYVYLA