Gene OSTLU_4812 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_4812 
Symbol 
ID4999435 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009355 
Strand
Start bp744675 
End bp745715 
Gene Length1041 bp 
Protein Length314 aa 
Translation table 
GC content58% 
IMG OID640414856 
Productpredicted protein 
Protein accessionXP_001415920 
Protein GI145341653 
COG category[C] Energy production and conversion 
COG ID[COG0584] Glycerophosphoryl diester phosphodiesterase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones31 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
CCGGGACTGT TCGAACCCGC CGGTTGCGCC GTCATCGGGC ACAGAGGTTT ACAAGCGAAT 
CGCGCTTCTG GCGCCGGTAT TCGTGAAAAC ACGCTAGCAT CGTTCAACGC GGCTTCTGCT
GGTGGCGCGG AGTGGTGTGA GTTTGACGTT CAAGTGACCG CAGACGGTGT TCCCGTCGCT
TGGCACGACG ACGTCGTCAT CATTCGTCGC GGACTCGGAC CTTTGGAGTC GTTTAGCGTT
CGAGAGATTG ACTGGGCGGA TCTGCGCGAA CTGTCTCGCG CCGCGCGCGC TACCGCCGCG
CGAGCGTCCA ACGCCCTCGG TGTTGAAAAG ACAGTTCCTT TAACCACCGA CGACGAAGAC
GACGAAGACG ACGACGATTA CGACGAAGAC GACAACAAGG TGACATTTTA TCGCGTGTTC
GGCGGCGATC TTGAACCTCA ACCGTGGGTC ATGGAAGTCG AAGATGAGAT CCCAACTTTG
GCACAGATTC TTGGAAACAC GCCGAAAGAG CTTGGCTTCA ACATCGAGCT CAAGTTCGAC
GAAGAGAACA GCTGTGAAAC GCGCCGCTTG GTCGCGGAAC TCCGCGCCAT TCTAGCGGTT
TGCATGGCGC AACCCAGTCG CAGAATCGTG TTCTCATCTT TCGATCCAGA TGCCGCTCTA
CTCATGCGTG CCATCCAGGG CTCATATCCA GTGATGATAT TGACCGATGC CGAGCCCCAT
CACGTCGACC CGCGTCGACG TTCAGTCGCT GCCGCGATGG AAGTCGCGCT CGAAGGTGGC
TTGTGTGGCG TTGTGTCGAA CGTCAAGGCG ATTATATCGC GCCCGTCCGA CGCGACCGAT
GTTCGAGACA GTGGTTTACT TCTCGCTACA TACGGCGAAG GTAACGATGA TGTCGCTGCA
TCGTCGACGC AAGTCGAGCT CGGCGTTTTC GGGATCATCA CAGACGCCGT GCCAGCCGTC
GCGAAGAAGT TCAATGCGAC GACTGTGAAT CCTGGCAACT TGGCTCCGGC GCTTGCGCCA
TTGGTATCAC CCTCAGTTGA C
 
Protein sequence
PGLFEPAGCA VIGHRGLQAN RASGAGIREN TLASFNAASA GGAEWCEFDV QVTADGVPVA 
WHDDVVIIPS NALGVEKTVP LTTDDEDDED DDDYDEDDNK VTFYRVFGGD LEPQPWVMEV
EDEIPTLAQI LGNTPKELGF NIELKFDEEN SCETRRLVAE LRAILAVCMA QPSRRIVFSS
FDPDAALLMR AIQGSYPVMI LTDAEPHHVD PRRRSVAAAM EVALEGGLCG VVSNVKAIIS
RPSDATDVRD SGLLLATYGE GNDDVAASST QVELGVFGII TDAVPAVAKK FNATTVNPGN
LAPALAPLVS PSVD