Gene OSTLU_33309 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_33309 
Symbol 
ID5003797 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009363 
Strand
Start bp17339 
End bp18386 
Gene Length1048 bp 
Protein Length340 aa 
Translation table 
GC content59% 
IMG OID640419218 
Productpredicted protein 
Protein accessionXP_001419676 
Protein GI145350571 
COG category[G] Carbohydrate transport and metabolism
[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0451] Nucleoside-diphosphate-sugar epimerases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones67 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones29 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
CTGCGTTCGA TGCCTTCGCC ACCTCCGACC GTGCCCAAGG CGCGCCCGCG TTGCGGTGAA 
CCCCGCCGCG TGCTCGTCAC CGGTGGTGCC GGCTTCGTCG GGTCGCACCT CGTCGACGCG
CTGTTGAAAC GCGGCGACGA AGTCATCGTC ATGGACAACT TTTTCACCGG CTCGCAGCGC
AACTTGGAGC ACTTGAAGGG GAATCCAAAG TTTGAAATCA TCCGACACGA CATCGTGACG
CCGTTCTTGG TGGAGATCGA CGAAGTGTAT CACTTGGCGT GCCCGGCGTC GCCGATTCAT
TACAAATTCA ATCCGGTGAA GACGATCAAA ACCAACGTCT TGGGGACTAT GAACGCGCTC
GGGCTCGCGA AGCGGTGCAA GGCGAAATTC TTGCTCACGA GCACGTCTGA GGTGTACGGC
GATCCGCTCG AGCACCCGCA GACGGAATCG TACTGGGGCA ACGTCAATCC GATCGGCGAA
CGCGCGTGTT ACGATGAGGG CAAACGGTGC GCTGAAACGT TGGCGTTCGA TTACCATCGC
GAGCACGGTT TGGAAATTCG AGTGGCGAGA ATTTTCAACA CGTACGGACC GCGCATGGCG
ATGGATGACG GGCGCGTGGT GTCCAACTTC GTCGCGCAAG CGCTGGAGGG CAAACCTATG
ACAATCTACG GTGATGGCAC GCAGACGCGC TCGTTTCAAT ACGTCTCTGA CCTAGTCGCC
GGACTCATCG CGTTGATGGA CAACGACTCG GGCTTCGTCG GTCCGGTGAA TCTCGGTAAC
CCCGGTGAAT TCACGATGCT CGAACTCGCG GAGAAGGTGC GCGAAGTCGT GAACCCGAAC
GCGGAAATCG TGTTCTGCGA GAACACCTCG GACGATCCGA GTCGGCGCAA GCCAGACATT
TCGCTCGCGA AGGAAAAATT AGGCGGTTGG GAACCGAAGG TGAAGCTCGA GGACGGGCTC
AAGCTCATGG TGGAAGATTT CCGGGAGAGA ATCGAAGATA AGCGGGCGCG AGACGCGGCG
GGAGGACGAT GAGCGCCGTA CTTCTTAG
 
Protein sequence
MPSPPPTVPK ARPRCGEPRR VLVTGGAGFV GSHLVDALLK RGDEVIVMDN FFTGSQRNLE 
HLKGNPKFEI IRHDIVTPFL VEIDEVYHLA CPASPIHYKF NPVKTIKTNV LGTMNALGLA
KRCKAKFLLT STSEVYGDPL EHPQTESYWG NVNPIGERAC YDEGKRCAET LAFDYHREHG
LEIRVARIFN TYGPRMAMDD GRVVSNFVAQ ALEGKPMTIY GDGTQTRSFQ YVSDLVAGLI
ALMDNDSGFV GPVNLGNPGE FTMLELAEKV REVVNPNAEI VFCENTSDDP SRRKPDISLA
KEKLGGWEPK VKLEDGLKLM VEDFRERIED KRARDAAGGR