Gene OSTLU_93034 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_93034 
Symbol 
ID5002725 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009361 
Strand
Start bp457100 
End bp458686 
Gene Length1587 bp 
Protein Length528 aa 
Translation table 
GC content60% 
IMG OID640418146 
Productpredicted protein 
Protein accessionXP_001418938 
Protein GI145349019 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value0.0651398 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.100945 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAGGCGG CGAAGGCGGA GGAGGAAAAG ATTCGCAGTA CCGTTCGTGA ATTCATTTTG 
AAGGCGATGG AGATGGCGGC GGAGGAGACG AAAGCGAGCG GGCACGACGA AGCCAATGGA
ACGCCGAGCG AAGTCGCCGC GGCGGTCGAA TCGGCGCTGT ATAAAAAATG CGGTTCGGCT
GACAAAGAGT ACAGAACGCG CGCGCGGTCT TTAAAATCCA ACTTACAGGA CGTGCGCAAC
CCTCAATTAC GGGCGCGCGT GCTCGCGAAC GATTTGAAAG CTTCACAACT CGTGGACATG
TCTCCGCTGC AGTTGGCCAA CAAGGAGCTC GTCGAGTGGC GCAAGGCGCG ACAGGAAATC
GCGGGCGAGG GTGCCTTTAT GAAGGGAATC GCGCTTGAGG ATATAGTGGT GAAGAAAGAT
GGGAAGAATG AAATTCACGT CGAGCTGAAG CCAGAAGAAC CGGCGCCGTC GAAGCCCGTG
GAACAAACGC CGAGCGTCGA GGAAGAGCCG ACGCAAGTCA CCGAGATTGA CGTGACGTCT
GGGAACGACC AACTGTCAGA CGAGGAGCAC GAAGAAGCCG CACCGATGAA CGTCGACGGT
GATGATTCAG AAATGCTTTC TTTCGAAAAG TTTGCGAATG GAGGCGAAGA AGACAAGGAA
GAGGAGCAGG AAGAAGAGGA AGACGAGGAA GACGCCGCGC CGGAATACGA GCCAGAGCCC
GAGTACGAAC CGGAGGACGA ACCTGAGTAC GACCCGGAAG CGACTACGGC GGATGAGGTG
GAAGAAGAAG AATACAATCC CGCCGACGAC CCGATAGATG TTCCTCTTCC CGAAGGTGCG
TGGGAAGGCA CCGTAGATGT TCCAGGACTG CCTACGCTTC AGCTGCGAGC GGTGCCCATC
GGCGGCGAAG GCGCCCACGT CGGCGACATC TTGCCCGAGA GTTTGCACAT CAAAGGCCGC
GTCGACTACA AAGCCATGCA ATCTTTCGTC AAGCAAGTCC ACCGCTCGTC GACGTCGCGC
GCGGTGACGC TCGTGCATAT CTCGAGCGCG CCGAGCGGCG GAGACGACGC GGAAGCGGCG
ATGGCGAAGA TCGTCAAACA ATATCGCGAG CGCAAGCGAT GCGGCGTGGC GAAAACGGAG
GATGGCATCG AGCTGTATCT CGCGCCTCGC GGTCAGCACG CGGATAAAGT CATCTTAACC
GTCGACCTTA TCCCGGGACA CGTCCCACCG TCGACGGGGA TGATTGGAAT GGTGATTCAT
CCGCGAGGCA TCGGTCCAAG GAAAGTCGAC TCGAAGGAAC TTCATCGATC AAAGAAGACG
CGCGTCGAAG AGCACGTCGA CGAGGATGAG TACGCGCCCA ACGCGCCACC GGCGCAATTC
ATGGAGGTTC CGCCACCACC GCCACCCTCA GCGCAGATGG CGCCGCTCGC GCCCCCCCAA
ACTCTTCGAG AAGTTCCCCC GCCGCCGCCG CCCGCCGCCG CGCCGCCGGC GTTCCAAGCG
CAAGATCTCG CCGGTTTGAT CGCGACGCTT TCCGGCGCCC AGCAACCGGT GCGCATGAAC
GTACCACCGC CTCCTCCTCC TCCGTGA
 
Protein sequence
MEAAKAEEEK IRSTVREFIL KAMEMAAEET KASGHDEANG TPSEVAAAVE SALYKKCGSA 
DKEYRTRARS LKSNLQDVRN PQLRARVLAN DLKASQLVDM SPLQLANKEL VEWRKARQEI
AGEGAFMKGI ALEDIVVKKD GKNEIHVELK PEEPAPSKPV EQTPSVEEEP TQVTEIDVTS
GNDQLSDEEH EEAAPMNVDG DDSEMLSFEK FANGGEEDKE EEQEEEEDEE DAAPEYEPEP
EYEPEDEPEY DPEATTADEV EEEEYNPADD PIDVPLPEGA WEGTVDVPGL PTLQLRAVPI
GGEGAHVGDI LPESLHIKGR VDYKAMQSFV KQVHRSSTSR AVTLVHISSA PSGGDDAEAA
MAKIVKQYRE RKRCGVAKTE DGIELYLAPR GQHADKVILT VDLIPGHVPP STGMIGMVIH
PRGIGPRKVD SKELHRSKKT RVEEHVDEDE YAPNAPPAQF MEVPPPPPPS AQMAPLAPPQ
TLREVPPPPP PAAAPPAFQA QDLAGLIATL SGAQQPVRMN VPPPPPPP