Gene OSTLU_31158 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_31158 
Symbol 
ID5001397 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009358 
Strand
Start bp411158 
End bp413562 
Gene Length2405 bp 
Protein Length781 aa 
Translation table 
GC content56% 
IMG OID640416818 
Productpredicted protein 
Protein accessionXP_001417235 
Protein GI145345476 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG5021] Ubiquitin-protein ligase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.000507428 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGATGTGG ACGACGAATC AATCGTCGAG GACGACGCGT CGTCTTCGGA TTCTTCCGAA 
TTGGATTTTG TCGTCGACGA CGACGAGGAC GAGACTGCGA CTGAAGTTGA CTTCACGCCT
CGAAGAACCC GGCTGAGTCT GGGATCGAGC GTTGACGAGG CTCTGAGCAT CGACGACGTC
GCAGCGTTGA AACGCGCTCT CTCACGTCTA GAGGAGACAC CTCGACCGGG TCGCGTGCTC
ATCTCCGCGG CCGCGCACAA CGCCATAAAC GTAGCGAGCG AGTTCTTGGC GAATGGTATT
CGAGTAAATG GTCAGTTGAA CGAGAGGGGA TCCAAGAAAC CGCACATCAC GGACATCGCC
GACGCGCTTC ACATCGCGTG CGAACGAGGC AACGTGGATT TCGTTCACGC CGTGAGGGAC
CAAATCGGAT TGGCGCAGTT CTGTCGAGGT GGTGACACCG GCGGTTGGCC GACCTCAGCG
ACGGGCGGCA CACTCGTCCA TTCGGCGGCG GATAACGAAA AAGCGAGCGC GGAGTGCGTT
GCAGCGGCGA TTATGGCGGA GTCACCACCG AAAGGGCGCG TCATGCCCGT GGCTGCGGTG
TTCGACGAAG ACGCTCGCGG CGCACGCACG CCGTTGATGA TCGCTGCGGG TGCAGGTCCC
GAGTGGGGGT CTTGCGTTGA AGCGCTCTTG AGGGATGCCA GAACATACGG CCAACGCGCA
GTGCGAATCG CGATACGATG TCAAGAGCCA CAAGAAGGAT GTTCTGCGGT GCACATCGCC
GCCGTCGCCG GAAATGTGGT TTGTTTGGAG AAATTTCTAA AGGAAGATCC ATCGTGCTTA
AAGGTGAAAG ACTACAACGA TTCGACGCCA TTGCATTGGG CATCTATGGA AGGACGGCAC
GAGACGATCC GAGTGTTGCT CGAGAAGGGC GCCAACAGAC TAGCGATTGA CAAAATGGGT
TGGATTCCGT TGCTCTACGC CAATTTTCAC GAAAAGAGTG AGGCAGTTTT ATACTTGCTC
GAGAAACAAG TCTCGGAGCA ATTGCAGACC ATGTTCACTG CGGTGGCATC GGAGGACGAT
GGGAAGTCCA AGTTGCAGGT ACAAAAGGTT TTGAATTTGC TCGCAGCCAA ACCAGCGTAC
TACGATGCGA TAAACAACTG CATCAGGAGA GACATGTCGC TCTTGGGCCC TGTCATGAAC
ATGCTGCGCG AATGTGGAGC GATTACTATT ATCAATTTGG CCAACCGCGT GTCGTTCTTG
CGCAATACGT GGCTCAATGA ATATGGATTG CAAAATTTAG CTCTCTCGAA AAGTTTCACC
TTGTCTGCGC AAGACGCTTG GCACGACTTT TTCGTAAATG TCTGGTCCGT CTCGCCAAAG
ATTTTGAAGA TTCGACATCT CAGTTGGGGC TTGCGGCAGC CATCGGGGGA TGTTTCAGTA
GGTCCGGGAC CAACGCGTGA AGCGTTCAGC TTGATGGCGA ACGAGTTGTG CGATGGAAAC
AGAGCCGACC GCCAGCTATT TACACAGGAT GACGCGCGCA CGTACAAACG TAAGGAAGGT
ATCGACCGCC CGAGCGTGCT GAAAGAGCTC CAGTGTTTCG GCGAACTTCT GGCGCACGTG
GTGCTCTTCG GCAGCGCCGT TTTACCGATT CCGTTCTCCA AAGTCTTCCT GAGGCGCGTC
ATCGCCAACG AAAAATGTGA CGCGTTCACT CTAGACGATC TCGCCGACGT TGAGCCCGCA
GTGGTGAAGT CGATCAAGGT TGTGCTTGAA ACCGACGACG TCGAATCGCT CTGTCTCACG
TTTATCGATC CGACGACGAA CGAAGAGACA GACGTCACGA AGGCAAACTT GCAAGCATTC
AAGCGACAAA AAATTCGCGA AATGGTGTCG AGCATCGTAG ACGACGCATC CGTGCACGCG
ATCCGCACAG GTTTGCTCCA AATCATGCGA AAATCACAGA TCGAGGTGCT GGCGCCCGAA
GAGTTTGGCC TCGTCGCCGC CGGCTCGCAA TCGATCGACC CCAAGGAATG GCGCCGACAC
GCCTCCTTCT CACCAAGTCC AGAGATGGAA TGGTTTTGGG ACGTGGTCGA ACGCATGAAC
AACGACGACA AATCTCGTCT CCTCCAATTC TCCACCGGGA GTAGTTTACT TCCGGTCGGT
TCTTTCGCCG CGCTGTGTCC TCCGTGGAGC ATCGAAGTCG GTCTTCACCG CGACGCCAGT
AAGCTCCCCA CCGCCTGGAC GTGCTTCAAC ACGCTTCAAA TGCCGCGCTA CCCATCGAAA
GAATTGCTCG AAGAGCGCTT GTTTTGCGCG CTGCGCCACG GAAGCGCTGG ATTCGCCTTT
GCGTGACGTT CGCCGAACGC GCCTCGCCTC GTAAACGCGC GTGTAACACG CTTTGATTCG
CCTCG
 
Protein sequence
MDVDDESIVE DDASSSDSSE LDFVVDDDED ETATEVDFTP RRTRLSLGSS VDEALSIDDV 
AALKRALSRL EETPRPGRVL ISAAAHNAIN VASEFLANGI RVNGQLNERG SKKPHITDIA
DALHIACERG NVDFVHAVRD QIGLAQFCRG GDTGGWPTSA TGGTLVHSAA DNEKASAECV
AAAIMAESPP KGRVMPVAAV FDEDARGART PLMIAAGAGP EWGSCVEALL RDARTYGQRA
VRIAIRCQEP QEGCSAVHIA AVAGNVVCLE KFLKEDPSCL KVKDYNDSTP LHWASMEGRH
ETIRVLLEKG ANRLAIDKMG WIPLLYANFH EKSEAVLYLL EKQVSEQLQT MFTAVASEDD
GKSKLQVQKV LNLLAAKPAY YDAINNCIRR DMSLLGPVMN MLRECGAITI INLANRVSFL
RNTWLNEYGL QNLALSKSFT LSAQDAWHDF FVNVWSVSPK ILKIRHLSWG LRQPSGDVSV
GPGPTREAFS LMANELCDGN RADRQLFTQD DARTYKRKEG IDRPSVLKEL QCFGELLAHV
VLFGSAVLPI PFSKVFLRRV IANEKCDAFT LDDLADVEPA VVKSIKVVLE TDDVESLCLT
FIDPTTNEET DVTKANLQAF KRQKIREMVS SIVDDASVHA IRTGLLQIMR KSQIEVLAPE
EFGLVAAGSQ SIDPKEWRRH ASFSPSPEME WFWDVVERMN NDDKSRLLQF STGSSLLPVG
SFAALCPPWS IEVGLHRDAS KLPTAWTCFN TLQMPRYPSK ELLEERLFCA LRHGSAGFAF
A