Gene OSTLU_26292 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_26292 
Symbol 
ID5003936 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009364 
Strand
Start bp554755 
End bp555945 
Gene Length1191 bp 
Protein Length288 aa 
Translation table 
GC content66% 
IMG OID640419357 
Productpredicted protein 
Protein accessionXP_001420211 
Protein GI145351711 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1215] Glycosyltransferases, probably involved in cell wall biogenesis 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value0.147213 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.185291 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
CGGTCGCGCG ACGCGCGTCG CGATGTCCGC GGCTTTCGCG TCGACGTCTA CATCGACGTC 
GACGTCGACG TCGCGCGCGC GGTTGGACGC GGCGGCGCGG CGGCGCGGTG GACGACGCGC
GAGCGCGCGC GCCGCGCCTC GGGCCTCGCG CCGTCGCGAG GCTTCGCGCG TCGCCGCCGC
CGCGCTCGAC GGCGGCGGGC TCGGTGGGTT CGGTGGGAAC GACGACGACG GCGCGCGGCG
TTGGCGCGAC GACGACGACG ATCCCGACCA CGACGGGGCG TCGAGGAAGT TTAACGCGTG
GATGAGGTTG TTCGCGGGGG CGGTGGCGCT CAACTGGACG ACGCACGAGG TCATCGCGTG
GAACGATCGA CCGGCGCACG ACGCGCCCGA TGACGACGCG TGGGAGCAAA AACGCGTCTC
GATCGTGGTC CCGGCTCGGA ACGAAAGCAA GGCGATCGGG CGGTTGTTGA AGCAACTGCG
ACGCGCGCTC GAGCCCGAGG CGGCGGAGGT TATAGTCAGC GTCGGCGACT CGGTGGATGA
CACGGCGGCG ATCGCCGCCG CGCACGGCGC GATCGTGGTT TCGGGAGCAA AGGGACGAGG
GAATCAGATG AACGCGGGCG CGCGAATCGC GACTGGGGAT TACGTATTGT TTTTGCACGC
GGATACGACG CCGCCGGCGG ACGTCGTGGA CGTCATTCGT CGACAGCTTC GCGATCAAAA
GACCGTCGTG GGCGGGTTCG TGTCGCTCAT AGAGACCAAG TCGCGCACGT TTTGGGCGAT
GTCGTATCAC AACGTCGTCA AGACGACGTA TTGCGCTGTG ATTAGTCGCC CTTTTGGCTA
CCTTCGAGGC TTTCGCATCC TGTTCGGCGA CCAAGCCATG TTTTGTCGCC TCGACGATTT
TAACGCCGTC GGCGGCTTCG ACGGTTCGCT GTCCATCATG GAAGACGCGG ATTTATGCGT
GCGCATGCAC GTCAAAGGTC GCGGGCGGTA CCGCGGACGA GTGAAGCTCT TGGACCGCGT
CGTCACCACC TCGGGCCGAC GTATAGAGCA ACTCGGCAAC TTCAAGGCGA CGTGCATACA
CGTCTTGATC GCGTGCAGCT GGAACTTTGG CGTCGGTCCG GAGAAATTGC GTAAACTGTA
CGACTGGTGT TACCGCGACG TGCGGTGACG CGCGCGGCGT CGGCGCCCTT T
 
Protein sequence
MRLFAGAVAL NWTTHEVIAW NDRPAHDAPD DDAWEQKRVS IVVPARNESK AIGRLLKQLR 
RALEPEAAEV IVSVGDSVDD TAAIAAAHGA IVVSGAKGRG NQMNAGARIA TGDYVLFLHA
DTTPPADVVD VIRRQLRDQK TVVGGFVSLI ETKSRTFWAM SYHNVVKTTY CAVISRPFGY
LRGFRILFGD QAMFCRLDDF NAVGGFDGSL SIMEDADLCV RMHVKGRGRY RGRVKLLDRV
VTTSGRRIEQ LGNFKATCIH VLIACSWNFG VGPEKLRKLY DWCYRDVR