Gene OSTLU_119643 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_119643 
SymbolPsma1 
ID5000161 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009356 
Strand
Start bp684424 
End bp685465 
Gene Length1042 bp 
Protein Length236 aa 
Translation table 
GC content45% 
IMG OID640415582 
Productalpha-type C2 proteasome subunit 
Protein accessionXP_001416462 
Protein GI145343723 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0638] 20S proteasome, alpha and beta subunits 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones32 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTTTCGCA ATCAATATGA TCAGGTAACT ATCTAAAATT GACGGCGCTC TCGTCTCTGA 
ACGACGTTCC TCTCGCTGAC GTCGTTGACC TACAGGACAT GTTGACATGG AGTCCACAGG
GACGCTTGCA TCAGGTGGAG TGTAAGTCTT TCACATATCT TCTATGAGTG ATGGACCTCA
AATGATCAAA AGATGCGATG GAAGCCGTGA AGCAAGGAAG TACCGTTGTA GCTTGTGGTG
TATGTGTCTA AACTTTCTGA AACTTGCTAC AAAGTTTGTT AATGGTGATC GCAGAATAAA
AAGTGCGCTG TGCTCGCTGC TTTGAAGCGC AAACAATCGA AGCTATCTTC GATGCAAGAA
AAGATTTTCG CAATAAACAC ACGAGTGGGC GTCGCAGTGT CAGGTCTTTC AACAGATGGG
GCAAAGTTAG TAAAACTTCT TCGCGAGGAA AGTACAAGAG AAGAGTTTGT GTACGGTCGA
AGCGCATGCC CTGGTCGGCT CGCTGCAGCA GCTTCAAAAT CATTACAGGT GAATTGATCA
CAGGTAGACT TGCTTTCTCT TATTGTTTTG AATAGATCTC TATGCGAAAA AGCTCAGGAC
GGCCTTGTGG AGTCGGTTTG GTACTTGTTG GTTCAAGTGA ACAGCTCGGT TGTCAAATCT
TCCAAACTTG TCCTAGTGGT GAAGTTCACC AAATGTCGTG CACTGCAATA GGTATGAGCC
AGTCAAGCAT TCGAATAATC CAAATAAGAC GTTGATGTGT GAATTAGGTG GGAGGTCACA
GTCAGCACGA ACATATCTTG AAAAAAATAT GGAGGCCCTC GAGACTGCCA GCGTTGACGA
GGTACGTTTG TTGTACGCTT TGCATTACTC CGTGAATGAT GTAGATGGCC AGATGATCAT
GATTGCACTG AGAGCGCTAA ACGAATCGAA TGTAGATGCT AGTGATCTCA CATCGGACAA
ATGCGAAGTG GCGATAGTAG AGCTAAACAC TTATACGTTG AAAAGCGCGG CAGAAGTGAC
TACCCTGTTG ACAAATCTTT AA
 
Protein sequence
MFRNQYDQDM LTWSPQGRLH QVEYAMEAVK QGSTVVACGN KKCAVLAALK RKQSKLSSMQ 
EKIFAINTRV GVAVSGLSTD GAKLVKLLRE ESTREEFVYG RSACPGRLAA AASKSLQISM
RKSSGRPCGV GLVLVGSSEQ LGCQIFQTCP SGEVHQMSCT AIGGRSQSAR TYLEKNMEAL
ETASVDEMIM IALRALNESN VDASDLTSDK CEVAIVELNT YTLKSAAEVT TLLTNL