Gene OSTLU_31018 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_31018 
Symbol 
ID5001419 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009358 
Strand
Start bp169596 
End bp170930 
Gene Length1335 bp 
Protein Length444 aa 
Translation table 
GC content61% 
IMG OID640416840 
Productpredicted protein 
Protein accessionXP_001417162 
Protein GI145345320 
COG category[S] Function unknown 
COG ID[COG0398] Uncharacterized conserved protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones31 
Plasmid unclonability p-value0.212376 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCGGCGC TTGAACTGGC GCGACGAGGA CCCGAACCGG GGGAGACGTG GACCGCGGAG 
CGAGAGGAGA TGCGAGCGCG GGCGCGGGCG TCGATTCCGA GCGCGGTGGG GACGTATTGG
CGAGCGAGGA GCGAGGCGCT GATTGAAAGT CAAGCGGTGC GGGACGCGGC GCCGTTGATG
TCGAGCGCGG TGAACGTGGC GATCGCGGGG GTGGTGCTGC GGTTGTTTTT GCCGCGCGTC
GCGGCGCTGC AAGCGGTGGG TGGGTTCGAC GAGTTGACTG AATTTTTTGG GTTGCCGCCG
AGGAGCGAGT TGAGCGGATA TTTGGATCAG TTGCGAGCGT TGCCCGTGGC GGCGGTGTTC
GCGGTGTACG TTGGGTTGTT CGTGGCGGAA AAGTTGACGA TGACGGATGA GTTTTTGCCG
ATCGGCTTCG TGCTGCCCGT GGTGTCGCCG GTGGTGTTCG GCGGCGTCTT CGGAGGGACG
ATGGTGACGT CGCTGGCGAG CACGCTGGCG GCGAGCGTCA ATTTCTTCCT CGCGCGGTAC
GTGTTGAAGG ATAAGATATT AGGCTTTAAG TGGGGCGAAA GTGATCCCGT GGGCGAGCAA
AAGTGGTTCG CCGCGCTGAG TCGAAGATTC GACTCGTCGC AATTCCCCGA GTCCACGGTG
CCCGAGGGGT TCAAGTCGGC GCTCTTGCTC AGGCTGTGCC CGATTTTACC GATTCCGATA
AGTGGGAACT GGTACGTGTG CGGGCTGACG CCTCTCAAAT TCAAAGAGTT CTTCGCCGCG
CACTTCATCG GAAGCTCGAA GACTGCGTTC ATCGACGCGT ATTTAGGTTC AATTTTGCTC
ACCGCGGTGT TCGACGAGTC ATCCGTCAAG GACCAGGCGC AAGGCGCGCT CGTGTTCGAA
ACCGTCGCCA TCATGGTTGT TTCCATCTTA GTCAGCACGT ACGCCACGGA GCTCTTCACG
CAGATTCTCG ACGAAGAAGG CGTCGACGCG GGGGCGATGA TGGGATTCGG TTCGGAATCC
AAAGACGAAG ACGAAGGCGA AGACGCCGTC GACGCCACCG TCGCCTTCAT CGCCGCCGCC
GCCCTGCCCG TGGAACCAGC CGGCTCCACG GCGACGAGCG ATGATGAACC GAAAGCCGAC
GACGACGAGA ACGACGACGA CGCCACGTCG AACGAACCTG AACTCATCCC AATCGAGCGC
ATGCCCGAGG ATGAAAAAGT TTTAATCGCC GAAGGCGAAG CGCTCTGGCG ACGCGCCGCG
CGCGTCGAAG CCGAGCGTCA AAAGCTCACC ATCGAAGAGA TGACCGATTA CGACTCCATG
GGACCAGACA TGTGA
 
Protein sequence
MAALELARRG PEPGETWTAE REEMRARARA SIPSAVGTYW RARSEALIES QAVRDAAPLM 
SSAVNVAIAG VVLRLFLPRV AALQAVGGFD ELTEFFGLPP RSELSGYLDQ LRALPVAAVF
AVYVGLFVAE KLTMTDEFLP IGFVLPVVSP VVFGGVFGGT MVTSLASTLA ASVNFFLARY
VLKDKILGFK WGESDPVGEQ KWFAALSRRF DSSQFPESTV PEGFKSALLL RLCPILPIPI
SGNWYVCGLT PLKFKEFFAA HFIGSSKTAF IDAYLGSILL TAVFDESSVK DQAQGALVFE
TVAIMVVSIL VSTYATELFT QILDEEGVDA GAMMGFGSES KDEDEGEDAV DATVAFIAAA
ALPVEPAGST ATSDDEPKAD DDENDDDATS NEPELIPIER MPEDEKVLIA EGEALWRRAA
RVEAERQKLT IEEMTDYDSM GPDM