Gene OSTLU_30617 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_30617 
Symbol 
ID5001034 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009357 
Strand
Start bp467129 
End bp468573 
Gene Length1445 bp 
Protein Length443 aa 
Translation table 
GC content59% 
IMG OID640416455 
Productpredicted protein 
Protein accessionXP_001416670 
Protein GI145344292 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG1222] ATP-dependent 26S proteasome regulatory subunit 
TIGRFAM ID[TIGR01242] 26S proteasome subunit P45 family 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value0.580577 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ACACGGCGAC GCGTCTCGAC GCGCGACGGC GACGGCGCGA GCGACGGCGC GAGCGACGGC 
GACGATCGAC GCGCGACGCG CGCGCGCGAG CGAACATGGG CCAGGGACAG AGCGCGGACG
GCGGCGACGG CGCCGACGGA CGCCACGGCC GGGGCAAGAA GAAGGAAAAG AAGAAGTACG
TCCCGCCCGC GCCGCCGATG CGCGTGGGAA AGAAGAAGAA GAAGACCGGG ATCGAGGGCA
GCACGCGATT GCCGAACGTC GCGCCGCAGT CGAAGTGTAA GCTGCGGATG CTGAAGCTGG
AGCGGGTGAA GGATTATTTG CTGATGGAGG AGGAGTTCGT GGGGAATCAG GAGCGGTTGA
AGCCGCGAGA GGAGCGGGAC GAGGACGAGC AGAGCAAGAT TGACGAGATG CGGGGGGCGC
CGATGAGCGT GGGGTCGTTG GAGGAGATCA TCGATGACAC GCACGGGATC GTGTCGTCGT
CGATCGGGCC GGAGTATTAC GTGAACATCG CGTCGTTCGT GGACAAGAGT CAGCTCGAAC
CGGGGTGCGC GGTGCTGTTG CATCACAAGA ATTCTGCCGT CGTGGGGACT CTGGCGGACG
ACGTCGATCC CATGGTGAGC GTGATGAAGG TTGATAAGGC GCCGTTGGAG TCGTACGCCG
ATGTTGGGGG ATTAGAGGAT CAGATTCAAG AGATCAAGGA AGCCGTGGAG TTGCCGCTGA
CGCACCCCGA ACTGTACGAA GACATCGGCA TCAAGCCGCC GAAAGGGGTG ATCTTGTACG
GAGCTCCGGG AACTGGGAAG ACGCTGTTAG CTAAGGCGGT GGCGAACTCA ACGAGCGCGA
CTTTTTTGCG CATCGTTGGA TCTGAATTGA TTCAAAAATA CTTGGGCGAC GGCCCGAAGC
TCGTGCGCGA GCTCTTCCGC GTCGCCGACG AGATGAGTCC CTCTATCGTT TTCATGGATG
AGATCGACGC CGTCGGTACG AAGCGATACG ATTCTCAATC GGGCGGCGAG CGCGAGATCC
AACGTACGAT GTTAGAGTTA CTGAACCAGA TGGATGGTTT TGACTCGCGC GGCGACGTCA
AGGTCATCAT GGCTACGAAT AGAATCGAAT CGCTCGACCC CGCGCTCTTA CGCCCGGGTC
GAATAGATCG AAAGATTGAA TTCCCTTTAC CGGACGTCAA GACAAAGCGA CACATTTTCA
ACATTCACAC CGGGCGCATG AACCTTTCCG CCGACGTACA GTTGGAGGAA TTTGTCATGG
CCAAGGACGA ACTCTCGGGC GCCGACATCA AGGCGCTTTG CACCGAAGCC GGTTTGCTCG
CCTTACGTGA GCGCCGAATG CAAGTAACGC ACGCCGACTT CAGCAAGGCT AAAGAAAAGG
TTTTGTACAA GAAGAAGGAA GGCGTGCCGG AGGGAATGTT TACGTGATTA GAACCGTTTT
AGGAG
 
Protein sequence
MGQGQSADGG DGADGRHGRG KKKEKKKYVP PAPPMRVGKK KKKTGIEGST RLPNVAPQSK 
CKLRMLKLER VKDYLLMEEE FVGNQERLKP REERDEDEQS KIDEMRGAPM SVGSLEEIID
DTHGIVSSSI GPEYYVNIAS FVDKSQLEPG CAVLLHHKNS AVVGTLADDV DPMVSVMKVD
KAPLESYADV GGLEDQIQEI KEAVELPLTH PELYEDIGIK PPKGVILYGA PGTGKTLLAK
AVANSTSATF LRIVGSELIQ KYLGDGPKLV RELFRVADEM SPSIVFMDEI DAVGTKRYDS
QSGGEREIQR TMLELLNQMD GFDSRGDVKV IMATNRIESL DPALLRPGRI DRKIEFPLPD
VKTKRHIFNI HTGRMNLSAD VQLEEFVMAK DELSGADIKA LCTEAGLLAL RERRMQVTHA
DFSKAKEKVL YKKKEGVPEG MFT