Gene OSTLU_119554 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_119554 
SymbolMak16 
ID5000120 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009356 
Strand
Start bp542749 
End bp543850 
Gene Length1102 bp 
Protein Length226 aa 
Translation table 
GC content44% 
IMG OID640415541 
ProductMAK16-like nucleolar RNA binding protein, putative 
Protein accessionXP_001416423 
Protein GI145343639 
COG category[R] General function prediction only 
COG ID[COG5129] Nuclear protein with HMG-like acidic region 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.113172 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCAAAGTG ACGAAGTAGT TTGGCAGGTG CGCCCGCACG ATACTTTTGA TCCTACTCTT 
CCCGTCATCC ATACTATAAC ACAGATCATT AACCATGGAC AATGTAGCTA TAGGGCTACA
TCAGAGACGT CCAATTTTTG TCGGAATGAA TTTAGCTTGA CTGGAATGTG CAACCGAAGT
TCTTGTCCAC TTTCAAACAG TCAGTACGCG ACGATTCGAG AAGAGTGTGG TGAGCTACTC
TTAGCGAACT GTGAAGTTCT GAAATTCACT TACTCAACTA GGCATCCTCA ACCTATACAC
AAAGACTGTA GAGCGTTCAC ATATGCCTTC AAAGTTGTGG GAGAAGACGG AACTCAGTCC
CAAATATGCG GAAGCTCTGG AACAGATAAA CTCTTCTCTG AGACATTGGT GAGTAAATTC
CGAAACGTTT GAACAGGATC GTGATAAGCG GATTAGGCCA AAGTTTCTAG TTCACAAGAG
CAAGCAACGT CTGACAAAGT TGACTCAGCT ACTCATACGT TCAAGGAAAT TAGAAAAAGT
TGGGAGGTGA GTCAGGTGCA ACTGGTTCGT GACGAGTTCA TTTACTCTGA CTTTAGGGAA
AAAATCCAAA CAATGCCAGC ACGACATACA CAGCGCGATG CGAGAGCGGA AAGTAAGGTA
AAGTCTCTCG CGCACACAAT TTGCATACGA ATATTAACTT TCGTGAAAAC AAGGCTCAAG
TGGCGGCGCG TTTGGACTCG AGTATTGAAA ATGAGTTGCT GGTATGCCTT ACTCCAGGTT
ATGACGGCGC GAATTTGAGA AACGATATGT AGGAACGATT GAACGCTGGC GTCTACGAAT
CTAGCTACCA ATTTTCTACG GCTAGATACT CACATGCTTT AGAGGTAGGC CAGACTCGCA
AAAATATTGG ATTGTATCTA ACGCTTCTCT CTGCGACAGG GAACAAGGAA AATGGGAAGT
CCAGAAACCA AAACTCCGCG GAAAATCCGC CAGCGCCGTC TACAACGGGA GATTGAATAT
GAACAGATCA GGTAAGTGAT TACTGGTATT TTCCATGGAG ATATATATTT TACTTTTGTC
CAGAACCTCA GTAGAACAGT GA
 
Protein sequence
MQSDEVVWQI INHGQCSYRA TSETSNFCRN EFSLTGMCNR SSCPLSNSQY ATIREECGIL 
NLYTKTVERS HMPSKLWEKT ELSPKYAEAL EQINSSLRHW PKFLVHKSKQ RLTKLTQLLI
RSRKLEKVGR EKIQTMPARH TQRDARAESK AQVAARLDSS IENELLERLN AGVYESSYQF
STARYSHALE GTRKMGSPET KTPRKIRQRR LQREIEYEQI RTSVEQ