Gene OSTLU_36158 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_36158 
Symbol 
ID5000470 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009356 
Strand
Start bp854974 
End bp857004 
Gene Length2031 bp 
Protein Length618 aa 
Translation table 
GC content56% 
IMG OID640415891 
Productpredicted protein 
Protein accessionXP_001416271 
Protein GI145342788 
COG category[J] Translation, ribosomal structure and biogenesis
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG5117] Protein involved in the nuclear export of pre-ribosomes 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCGCAG AGGCGCGACG CGAGAGCGTC AAGGTACGAA TTGCGTCGAC GTGTCAGAGC 
GTCATCGAGG ATCCAGAGTC GAAGTGGAAA GAGCTCAAAG ACATCGGCAC GCTTTGTGAA
GACCGAGACT CGGAAATCGC TCGATTAGCG TCGCTATCGC TCACCTTGGT CTACCGCGAC
ATTTGCCCGG GCTATCGCAT TCGACCTCCG ACGGAAAAGG AGTTGTCCAT GAAGGTGAGT
AAAGACGTGC TCAAGACGCG CGCTTTCGAG ACGGGCTTGC TTGAGCACTA CAAGTCGTAC
GTGAAAATGC TAGTGAGATG CTCTGGGGCG AAGAAATCTC GCGCACAGCG TGGCAAAGGC
GGACCAGATG CGGAGTCTGC GATCAAGTGT TTGTGCGCCT TGCTCATCGG ATTGCCGAGC
TTCAACTATC GCACGGATAT TTTATCCGCC ATCGTGCCCG TCTTCGATAA GCGAGACACC
AGTCACGCGC AGATCGTCAC TGATGCTCTG GTTGAGGTCG TGTCTAACGA CATTCGCGGC
GATCTCACGC TCGAGGCGTT GCACATGACG GCGCAGCTCG TGAAGCAGAG TAAATGCAAC
ATTCAGCCGT GCGCGTTTGC GTACTTCCTC AAGGTTCGCT TCGATGAGGG CATATTAGTG
CCTATGGTGC GTGACAGGAA AGAAATTCTT TCTCGTAAGC AGACGTTCAA GAAGAAGCAG
GAAGAGCGTG ATAAGATTCG CAGAGCTCGC GCGGAGAAGA CGAGGAAGCA ACAAGACAAG
GAACGCATGA AATCATTCGG GCACGTCGCG GACTCTTCAG ACGATAGCGA AGACGAAGAG
GCGGCGTTTC ATCGAGATTT AGACGAAGGT TCAGCGGTGA TGAGTTTCGG CGAGAAGAAA
AAAACGCAGA GTCGATTATT AGAGGCGACT TTCGAGATGT ACTTTCGCGT ATTGAAGAAC
GCCGCGAGTC CGGCGCCGAC GCCCGGGTTG CCGCTCTTGA GCGCCGCGCT CACGGGTTTG
GCAAAGTTCA CGCATTTAAT CTCCATCGAT TTCTTGGGTG ATCTCATGGA AGTGTTTAGA
AAGTTATTGG CCCAGGAAGA CTTGCTCTCG GACGCGCTCA AGGCGCAAAC TTTGCTGACG
GCGTGTGAAA TTTTGAGCGG TCACGGAGAG GTTTTGCAAG TGGATACCGG GGAATTCTAC
CGTCAGCTCT ATACGATGCT CGGCAAGCCC AGCGTAGGTG CAGCGGGGTG GCAAGATGGT
ATGTCTATAA CCGATCAACG CGCGCTCAAT CACGGCACGC TGCGCGTTCG TGCCATTCAA
AAGTTCATCG GCGGTTTCAA GCAAGTCGAT CAAGCACGCA TGGCGGCGTT TTCCAAGCGC
CTCACCTCGG CGTCGATCGG CATGGAAGCT GGCGAATGCC TGGGTTCGCT CGGTGTCGTC
CGCCAAATTC TCGCGTCGTA TCATCGCGTG CGTAACTTGC TCGAGAACGA GCGGATTGGG
AACGGAGTGT TTCAGATGGA TTTAGACGAT CCCGAGCACG CACAAGGCAT GTCCGCCGTG
CTCTGGGATC TTTGTCTACT TTCGCAACAC TACCATCCCA CGTGCGCGGC GGCGGCTCAC
GAGGTTGCCA ATTTACCACT CTCGGGCGCG ATAGCTCCAC CTCCCGGCTC ACACGCCCCG
AGCGAGCTCG CAAAGGCGTA CTCCACGCTC CGTGGAGACT TCAACCCGCC GATTCCCGAG
CCGCCGACGC AACGCAACAA ACCGCGAGCG CCGATCGATC AGAAAAAGTT CGTCGACGCC
TACGACGAAT CTTTCAAACG GAACGTCATC AGTAAGCTCG GCGACACCGT CGACACCGTC
GAAGCGCGTG CGTTCCGCCG ACATTTCCGT CGCGTTCGAG CGCACGGGGA AAACTTTCGG
TTACGAAGGG AGCGCGACGC CCTCGCGCGA AAAATTAGCG CCATGCGCGC GCACGAGCTC
GAAGCCAAGT CGAAGTCGAA ATCTAAAAAA TCGTCGTCCA AACGCACGTA G
 
Protein sequence
MSAEARRESV KVRIASTCQS VIEDPESKWK ELKDIGTLCE DRDSEIARLA SLSLTLVYRD 
ICPGYRIRPP TEKELSMKVS KDVLKTRAFE TGLLEHYKSY VKMLVRCSGA KKSRAQRGKG
GPDAESAIKC LCALLIGLPS FNYRTDILSA IVPVFDKRDT SHAQIVTDAL VEVVSNDIRG
DLTLEALHMT AQLVKQSKCN IQPCAFAYFL KVRFDEGILV PMVRDRKEIL SRSAVMSFGE
KKKTQSRLLE ATFEMYFRVL KNAASPAPTP GLPLLSAALT GLAKFTHLIS IDFLGDLMEV
FRKLLAQEDL LSDALKAQTL LTACEILSGH GEVLQVDTGE FYRQLYTMLG KPSVGAAGWQ
DGMSITDQRA LNHGTLRVRA IQKFIGGFKQ VDQARMAAFS KRLTSASIGM EAGECLGSLG
VVRQILASYH RVRNLLENER IGNGVFQMDL DDPEHAQGMS AVLWDLCLLS QHYHPTCAAA
AHEVANLPLS GAIAPPPGSH APSELAKAYS TLRGDFNPPI PEPPTQRNKP RAPIDQKKFV
DAYDESFKRN VISKLGDTVD TVEARAFRRH FRRVRAHGEN FRLRRERDAL ARKISAMRAH
ELEAKSKSKS KKSSSKRT