Gene OSTLU_17590 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_17590 
Symbol 
ID5004667 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009366 
Strand
Start bp258728 
End bp259948 
Gene Length1221 bp 
Protein Length406 aa 
Translation table 
GC content63% 
IMG OID640420088 
Productpredicted protein 
Protein accessionXP_001420637 
Protein GI145352619 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value0.0167886 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCGCTCG TCGAGCGCGC GAGCGACGGC GGCGCGTCGC CGGCGCTCGT CGCGGCGTGC 
GAGACGCTCG GGGTTTTGAT CCACGGCGAG ACGGAGCGAC GCGCGACGGA GCGCGCGGAC
GAGGACGACG CGAGCGAGGC GCGGGAGACG GCGAAGGAGG GCGCGAGCGA CGTCGAGGAT
GGAGAGGCGG AGAAGTTCGC GAGGGCGCTG AGCGCGAAAC CGCGAGGGGA GGTTTTGGAG
GCGCTGACGG GACACGCGGA GACGCTGTTC GGCGACGGGG GGGATAAGGA GGTGAGCGGC
GTCGTGGCGG TGATGGCGAA TCTGGCGGGA GACGATGGGG GGGCGATGAA ACGCGTGATG
GAGTGCGTGA CGGCGAGCGT GAGCGAACGA GTGAGTCTGC GGGTGCGGTG CGCGATTGCG
TTGTACAACG ACGCCGACGC GAAGGATGGG GAGACAAAGC TTGAATTGTT TGAAAAGATT
GCGGCGTATT GCGTCGCGGC GAAACAAAAG CAAGTGTTAC CCATGCTCGT CGCGCACGCG
AGCGAGGCCA AGGCGTGGGG CGGCGAGGTC AAGACGCAGC GCAGAGTGTT GAAGTTGTGC
GTCGATTTGC TCCGAGAGTT GGAGGGTCGC GAGGAGGAGT TGTTCTCGTG CATGATCAAG
TATTTAGCGA CGTTTGAAAA CGACAGTGGA GCCGTCGCGG AGGCGGCGGA TATCGCCAAG
GAAACCGCGC GCATATTCAT CAGCTCGCCG ACGATGTTCC ACGGGGATTT CTTAGCGCTC
AAGGGGATGC AACATTTGCA ATCCGCCGAC GCCAACGCGC TCAAGCTCTT GTCCACCATG
CTCACGGGCT CCGTGGCCGA CTACAACGCC TTGGTCAAGG CGAACGGGTC GATCGTCTCC
GGTCTCGGCT TAGACGCCGA TGACTGCGTG GCAAAGATGC GCATGATGGC GCTGGCGGCG
CTCGGTAAGA AGGGCGACGC GTCGTACTCT GAAATCAAGG AGGCGATGCA GTGCGACGAC
GGTGAGGTTG AGGAATGGGT CGTGCGCGCA GTTGGCGCGG GTGTGGTCGA CGCGAAGATG
GATCAAATGA AGCAACGCGT TGTTTTCACT AGATGCACCG ACCGCGTATT CACTGGTGCC
GAGTGGCAAG AGTTGAGCTC GCGCATAACG CAGTGGCGAG GCAAGATCGC CGCGCTTCAA
AAGACGTTGT CGGCAAACTA A
 
Protein sequence
MPLVERASDG GASPALVAAC ETLGVLIHGE TERRATERAD EDDASEARET AKEGASDVED 
GEAEKFARAL SAKPRGEVLE ALTGHAETLF GDGGDKEVSG VVAVMANLAG DDGGAMKRVM
ECVTASVSER VSLRVRCAIA LYNDADAKDG ETKLELFEKI AAYCVAAKQK QVLPMLVAHA
SEAKAWGGEV KTQRRVLKLC VDLLRELEGR EEELFSCMIK YLATFENDSG AVAEAADIAK
ETARIFISSP TMFHGDFLAL KGMQHLQSAD ANALKLLSTM LTGSVADYNA LVKANGSIVS
GLGLDADDCV AKMRMMALAA LGKKGDASYS EIKEAMQCDD GEVEEWVVRA VGAGVVDAKM
DQMKQRVVFT RCTDRVFTGA EWQELSSRIT QWRGKIAALQ KTLSAN