Gene OSTLU_27330 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_27330 
Symbol 
ID5005530 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009368 
Strand
Start bp77269 
End bp78364 
Gene Length1096 bp 
Protein Length297 aa 
Translation table 
GC content59% 
IMG OID640420951 
Productpredicted protein 
Protein accessionXP_001421201 
Protein GI145353826 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0638] 20S proteasome, alpha and beta subunits 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.00217957 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.014135 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACGGGC CGACTTTAGA TTTCAGCTTC CTCGACGCCA CCGCGCGCGG CTCGATGGCG 
TCGAAACACG TCGACGAAAG GTTCGTTTCA GATCGCGTCA ACGCGCGATC GACGACTGAC
GCCGCGTTTT AACGCGTTCA CAGCCTCGAC GCGATCGATG GAAACATGCA CGAGTGTAAC
TTTAAAGCCC CTGCGGTGGA AGACGTGCGT GAGCGACGCG AACGCGACGC GCACGACGAG
TGAGCGCTGG CGATCGGGAG AGCAACGGTT TGGTTAGGTC GCGGCGCTCG CGAAGAGTGA
AAATGGACGA CTGACGATGA CGGTGGACGA TTTGCGCGCG CAGTTTGAAG GATTCCAGCG
AGAGGTGATC AATTACGTCA AACCGAACCA CGGGACGACG ACGCTGGCGT TTATTTTTGA
GCACGGTATC GTCGTCGCGG TGGACTCTCG CGCGTCGCAA GGACCGTACA TTTCTTCGCA
GACGGTGAAA AAGGTGATCG AGATTAATCC GTTCTTGCTC GGGACCATGG CCGGGGGGGC
GGCAGATTGT CAGTTTTGGC AGCGAAACCT CGGGATTCAG TGCCGGTTGC ACGAGTTGGA
AAATGGGAAG CGAATCACGG TGCGCGCGGC GAGCAAGCTG TTGGCGAACA CGCTGTACAG
TTACAAGGGC AAGGGATTGT CCATGGGGAC GATGGTGGCT GGGTGGGATT TGAACGGGCC
TGGGCTGTAT TACGTCGATA GCGAGGGCAC ACGGTTGAAG GGGCAGCGGT TTAGCGTCGG
TTCGGGGTCG TTGTTTGCAT ACGGGGTGTT GGATCAAGGA TACAAGTGGG ACTTGACGGT
TGAGGAAGCG TGCGAACTCG GACGACGCGC GATTTATCAC GCCACGTTTC GCGACGCATT
TTCTGGTGGT ACTGTCAGTG TGTACCACGT CGGTGCGAAT GGGTGGACTA AGGTGACCGG
GGACGATGTC GGCGAGTTAC ACTTCTCGTA TTACCCGGCG ACGCCGGTCG ACGACGTCGA
TGCGCACTGC GGCGGAGAAG GCAAGAAGGA AGCCGAGGCG AGAGCGGCTA CGGAAGCGAG
CGCGATGGAG ACGTGA
 
Protein sequence
MNGPTLDFSF LDATARGSMA SKHVDESLDA IDGNMHECNF KAPAVEDFEG FQREVINYVK 
PNHGTTTLAF IFEHGIVVAV DSRASQGPYI SSQTVKKVIE INPFLLGTMA GGAADCQFWQ
RNLGIQCRLH ELENGKRITV RAASKLLANT LYSYKGKGLS MGTMVAGWDL NGPGLYYVDS
EGTRLKGQRF SVGSGSLFAY GVLDQGYKWD LTVEEACELG RRAIYHATFR DAFSGGTVSV
YHVGANGWTK VTGDDVGELH FSYYPATPVD DVDAHCGGEG KKEAEARAAT EASAMET