Gene OSTLU_41449 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_41449 
Symbol 
ID5002299 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009360 
Strand
Start bp51974 
End bp53125 
Gene Length1152 bp 
Protein Length246 aa 
Translation table 
GC content60% 
IMG OID640417720 
Productpredicted protein 
Protein accessionXP_001418143 
Protein GI145347374 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0638] 20S proteasome, alpha and beta subunits 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.220218 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCCGCG GTGCCGGCGC CGGATACGAT CGGCACATCA CCGTATTCTC TCCCGAAGGA 
CGTCTGTACC AAGTCGGTGC GTGTTCGCGC GCGCGCCGAC GCCTCCTCGC GCCGTCACCT
CGCCCGCGAC GCGCGACGTT CACCGACGAA ACGATCTCAC CGACTGACCG CGATTCGACG
ACTTTCGACG CGCAGAGTAC GCGTTCAAGG CGATCAAATC CGTCGGCGTC ACCACCATCG
GCGTGCGAGG GAAAGACTCC GTGTGCGTCG TGACGCAGAA GAAGATTCCG GTGCGCGAAA
CGGGCGAACG CGAGGGCGAA GAACGATCTC GTCGGCGAAC GCGAAGGCGC GAACGCGCTC
GAGACGACGC GGGAGGGATC GACCGCGCTG GATCCCGCGA ACGCGCGCGC GAGAGGAAGA
CTGACGCGTG GTGTGTGATC GGACGTCGCG CGAGCGCAGG ATAAATTGAT CGATGCGTCG
GATGTGACGC ACATGTATAA GATTACGAAA ACCGTGGGCA TGTGCGCGAC GGGAAAAGGA
CGTACGTTTT CGTTTTCGTC GCTCGGCGGC TTTGTTTGAA TGAACACGTA CGCGCGCCCG
CGCGAGCGCG ACTTTTCGCT CGACCGATCG ACTGACGACT GCTCGCGTCG ATTTCGTTTC
GCAGCGGATA TCCGAGACAT AGTACAAAAG GCGCGCAGAA AGGCGGCGGA TTTCAAGCAA
CACTACGGGT ACGAGGTCCC GGTGGACGTG TTGGCGAACA TACTCGCCGA TGAGTTCCAG
GTGTACACGC AGCACGCATA CATGCGTCCG CTCGCGGTGA TGGTGATATT AATCGCCGTA
GATGAGGATC GCGGGCCGAG TCTGTTCAAG TGCGATCCGG CAGGATACTT TGTCGGTTAC
AGCGCGACGA GCGCGGGGGC GAAGGAGGTC GAGGCGGTGA ACTTCTTGGA GAAGAAGGTC
AAGAGCGGCG CATCGTTCGA TGTGAATCAG ACGGCCCAGC TCGCGATCAG CGCTCTTCAG
CACGTGCTCG GGGAAGAGGT CAAGGCGAGC GAGTTAGAAG TCGCCGTCGT CACGGCGGAC
AATCCCAATT TCCGCGTCAT CAGCGAGAGC GAAGTGGAAG ATCATTTGAC ATCGATTTCT
GAGAGAGACT AG
 
Protein sequence
MSRGAGAGYD RHITVFSPEG RLYQVEYAFK AIKSVGVTTI GVRGKDSVCV VTQKKIPDKL 
IDASDVTHMY KITKTVGMCA TGKGPDIRDI VQKARRKAAD FKQHYGYEVP VDVLANILAD
EFQVYTQHAY MRPLAVMVIL IAVDEDRGPS LFKCDPAGYF VGYSATSAGA KEVEAVNFLE
KKVKSGASFD VNQTAQLAIS ALQHVLGEEV KASELEVAVV TADNPNFRVI SESEVEDHLT
SISERD