Gene OSTLU_35087 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_35087 
Symbol 
ID5003718 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009363 
Strand
Start bp32131 
End bp33708 
Gene Length1578 bp 
Protein Length471 aa 
Translation table 
GC content61% 
IMG OID640419139 
Productpredicted protein 
Protein accessionXP_001419682 
Protein GI145350584 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones30 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTTAAGA CGAGCGCGAA ACAGCGACGG CGCGTTCGCG GCGTCGTGCG CGGCGCGGGG 
CGCGGGAGAG TGATTGGATT TTATCACCCG AGCGGTGGGG ACGGTGGCGG CGGCGAACGC
GTGCTGTTCG CCGCGATCGC GGCGGCGCAG CGAGGTGATT TCGATGGCGA TCGGCGCGGC
GCGCGAGGCG ACGACGCCGA CGCCGACGCG GCGGAGAAGG AAGGCGAGGC GGCGATGATG
TCGCGCGCGG ACGAGCGCCG GGACGCGCGG TCGGACGTGA GATCGAACAG TGCGCGCGAG
ACGCGCGAAT GGAGAGCGAC GGAGGGAAGG GATGTCACGT GTTGCGTGTA CGCGGGCGCG
ACGTCGGCGC GAGGCGACGA ACCCGTCGAG GACGGCGACG AACTCATCGC GCGCGCGCGC
GAACGGTTCG GGATCGAGTT GCGAGCGCCC ATAAACGTTA TCAGACTCAC GCGTGAGCGA
TGGGCGCGCG CGGAGACGTA CAAACGGTGC ACGATATTGG GGCAGTTTAT CGGCGGCGCG
TGGCTCGGGC TCGAGGCGCT GTGGACGTTC GCGCCGGATG TGTTCGTGGA TACCGTCGGA
CACGCGGCGA CGTATCCGAT CGCGCGATAT CTGTTTGGGT GCCAAACAGT GGCGTACGTG
CACTATCCGA CGGTGTCGAG GGACATGATC GCGCGCGTGG AGAGCGGAAG ACTGATGTAC
AACAACAGCC GCCTGTTCGC GTCGTCCAAG TTTTTGAGCG GCCTCAAGGT ACTTTATTAC
CGCGCCTTTG CCGTCGTGTA CGGGTGGTGC GGACGATCGT GTAAGTGCGT GATGGTGAAC
TCTTCGTGGA CTAAATCGCA CATCGACGCA TTGTGGCGGG TTGATTCACG GGTGGTGTAT
CCCCCGTGCA ACGTCGAAGA CTTGTCCAAA CTCCCGTTGA CGCGCCCGAG GTTGAACGCG
CGCGGGACGC CGGTGAAAAA GGACAAGTCG TCGATACGCG TTGTTAGCGT GGGGCAGTTT
CGACCGGAAA AAGCGCACGT GGTTCAAATC GCCGCCTGGA AGGCTTTGAA AAAGTTCAAG
ACATTATCGA GCAAGATTGA AAACGCCATT TTGGTCTTCG TCGGTGGATG CCGTGACGAA
GCCGACCGCG AGCGCTTGGC AGATTTGCAA CAAAGTGTCA AAGATCTGGA GCTCCAGGAT
AGCGTTCAGT TCCACGTCGA CGTGTCGTAC GACGAAGTCA AGCGCGAGCT GTCGCGCGCG
TCCATCGGCC TTCACTCCAT GATTGACGAG CATTTCGGTA TTTGTGTCGT CGAGTACATG
GCGGCAGGCG CGGTGCCCGT CGCTCACGCA TCTGGCGGAC CTTTTCTCGA TATCATACGC
GACCAACACG ACGGCCCGAC AGGTTTCACT GCGGATAGCG TGGCGACCTT CGCCGAAACG
CTCGAGCACT TGTTGCTCAT GCGCCGAACC GAGCGGGAGG AAATTTCAGC GCGCGCGCGC
GCGCGTAGTG ACATTTTCAG CGAAACAGAA TTCAACTCAA ACTTCATCGA CAGTCTCGTC
AACTCTGGCG TTCTCTAG
 
Protein sequence
MFKTSAKQRR RVRGVVRGAG RGRVIGFYHP SGGDGGGGER VLFAAIAAAQ RATEGRDVTC 
CVYAGATSAR GDEPVEDGDE LIARARERFG IELRAPINVI RLTRERWARA ETYKRCTILG
QFIGGAWLGL EALWTFAPDV FVDTVGHAAT YPIARYLFGC QTVAYVHYPT VSRDMIARVE
SGRLMYNNSR LFASSKFLSG LKVLYYRAFA VVYGWCGRSC KCVMVNSSWT KSHIDALWRV
DSRVVYPPCN VEDLSKLPLT RPRLNARGTP VKKDKSSIRV VSVGQFRPEK AHVVQIAAWK
ALKKFKTLSS KIENAILVFV GGCRDEADRE RLADLQQSVK DLELQDSVQF HVDVSYDEVK
RELSRASIGL HSMIDEHFGI CVVEYMAAGA VPVAHASGGP FLDIIRDQHD GPTGFTADSV
ATFAETLEHL LLMRRTEREE ISARARARSD IFSETEFNSN FIDSLVNSGV L